Áú»¢¶Ä²© Conference 2016 Agenda
Official Agenda of Áú»¢¶Ä²© Conference 2016, that took place on September 9-10 in Riga, Latvia
Explore the agenda in full, with all the videos and presentations!
Official Agenda of Áú»¢¶Ä²© Conference 2016, that took place on September 9-10 in Riga, Latvia
Explore the agenda in full, with all the videos and presentations!
09:00 |
Registration for Conference Delegates |
10:00 |
Opening Speech |
10:40 |
Áú»¢¶Ä²© for HPC Cluster SupportMikhail is working on High Performance Computing platform support and is in charge of remote management of a team, analysing current HPC platform and finding opportunities for further improvements, supporting and monitoring existing HPC platform, and lifecycle management of HPC software. Conference video & presenations |
11:20 |
Insight in Mechanics of Áú»¢¶Ä²© ModulesGleb will talk about loadable modules and the new ¡°superpowers¡± they provide in Áú»¢¶Ä²© 3.2. He is going to shed some light on what¡¯s happening under the hood and how to make Áú»¢¶Ä²© and module coexistence as harmonic as it can get. When you understand how this heavy machinery works, extending Áú»¢¶Ä²© becomes a joyful experience. Conference video & presenations |
11:40 |
Coffee Break |
12:10 |
Áú»¢¶Ä²© at Nokia - Case StudyWe will explore a fairly complicated Áú»¢¶Ä²© environment at one division in Nokia. Having several different Áú»¢¶Ä²© versions in use and a lot of custom products monitored, it is a place one can get lost in easily. We'll discuss JMX monitoring, approaches to keep notification configuration simple and notifications useful, different usecases for the Áú»¢¶Ä²© API and a lot of other topics. The importance of the SSL compliance will be covered along with some of the many ways custom solutions are monitored. Conference video & presenations |
12:50 |
SHORT TALK: Monitoring Cloud Applications Using Áú»¢¶Ä²© |
13:10 |
SHORT TALK: Áú»¢¶Ä²© Meets OPS Control / RundeckÁú»¢¶Ä²© is an excellent tool to do network monitoring and to alert if something bad happens. But Áú»¢¶Ä²© can do more. An underestimated feature of Áú»¢¶Ä²© is its ability to perform actions in addition to simple notifications. However, this requires to precisly setup those actions within zabbix, which is not always an easy task and might duplicate existing work. So what if Áú»¢¶Ä²© actually worked in concert with an external taskrunner / jobscheduler that is build to do exactly this: run a task or action against a host and report its outcome? Áú»¢¶Ä²© would perform the same well defined steps that an ops member would perform in case of certain failures using this kind of tool. A well know example of this kind of software is "Rundeck" which is licensed under the Apache License Version 2.0. Conference video & presenations |
13:30 |
Lunch |
15:00 |
Monitoring Mesos, Docker, Containers with Áú»¢¶Ä²©At DBC we are running docker and other container types in a mesos/marathon cluster environment. I will demonstrate how we collect statistics, logs etc. and monitor this environment, showing configuration examples, data flows and templates. Some of the covered topics:
Conference video & presenations |
15:40 |
Nagios to Áú»¢¶Ä²© MigrationA case study on how we learned to stop worrying and love Áú»¢¶Ä²©, in a in depth migration process from Nagios to Áú»¢¶Ä²©. Conference video & presenations |
16:20 |
Coffee Break |
16:50 |
SHORT TALK: Trouble Ticket Integration with Áú»¢¶Ä²© in Large Environment
Large Environments rely on TroubleTicket tool and HelpDesk for managing IT issues. Bridging Áú»¢¶Ä²© with over 5000 servers and HelpDesk manually is a painful and impossible project. In this presentation we will cover how we may integrate Áú»¢¶Ä²© with HelpDesk, the architecture and what are the issues specially in Large Environments. Conference video & presenations |
17:10 |
SHORT TALK: Áú»¢¶Ä²© Action SimulatorThe action simulator used to be available for 2.0 and 2.2, but not for 2.4 and consequently 3.0, mostly due to the possible shift of the API into a different component and the introduction of custom expressions in action conditions. Now the first version is here. We will talk about the intention to bring the action simulator back, as it solves some important problems which may occur in more complex installations. Conference video & presenations |
08:30 |
Áú»¢¶Ä²© Workshops / Áú»¢¶Ä²© Exam
(IMPORTANT! All participants must bring their own laptops with Áú»¢¶Ä²© pre-installed) EXAMÁú»¢¶Ä²© Conference will also offer a fantastic opportunity to become Áú»¢¶Ä²© 3.0 Certified, as we will provide you with a chance to pass Áú»¢¶Ä²© 3.0 exam right at the venue!
If you currently hold any of the following certificates, you are welcome to apply for the exam: |
||
10:00 |
Zen and The Art of Áú»¢¶Ä²© Template DesignÁú»¢¶Ä²© monitoring solution can help bring balance to your organisation's IT landscape. However, the success greatly depends on the templates you use to setup your monitoring system. As any Áú»¢¶Ä²© veteran will tell you, the default templates don't really suffice for any setup other than a proof-of-concept. How then do you set about creating your own templates? Following practical examples, we'll discuss some of the design decisions that need to be made to achieve template perfection. Conference video & presenations |
||
10:40 |
Log management ELISA controlled by Áú»¢¶Ä²©Datasys ELISA log management is robust, powerful, yet inexpensive solution for collection, correlation and analysis of logs. Core system consists of the Elasticsearch ¡°noSQL¡° database and the web user interface Kibana, which provides high comfort for analysis of detected security incidents and relevant logs. It is common that the database ElasticSearch is distributed to multiple servers to achieve load balancing and high availability of indexed data. ELISA heavily utilizes ZABBIX for user authentication and role based access control, notifications and self-monitoring. Elasticsearch Indices can be managed right in ZABBIX Frontend. ZABBIX "trapper" items and monitoring templates are used to centrally manage configuration of distributed environment of NXlog agents. Agents are capable to securely auto-register as ZABBIX "hosts". Conference video & presenations |
||
11:00 |
Coffee Break |
||
11:30 |
Lessons Learned While Being On-Site + Benefits of Áú»¢¶Ä²© Training |
||
12:10 |
Monitoring More Than 6000 Devices in Áú»¢¶Ä²©Ryan will describe a Skunkworks project executed by Kinetic IT at the Department of Education to deliver an autonomous infrastructure monitoring solution for over 6000 devices distributed across WA. The team were given opportunity to experiment with DevOps practices such as Scrum product development, Infrastructure As Code and Continuous Integration to determine where the value lay and which practices should be adopted at greater scale. Conference video & presenations |
||
12:50 |
Lunch |
||
13:50 |
Event Analysis ToolsetDuring outages on 10k+ hosts environment, NOC and Operations teams may face hundreds of alerts in order to perform root cause analysis, remediation or escalation, meanwhile logging resolution progress to Incident Management system for audit purposes. This presentation will describe RingCentral approach to Incident and Problem Management in large Áú»¢¶Ä²© monitored cloud. Co-authors of the presentation: Dmitry Shchemelinin, Ph.D., Sr. Director of Operations, RingCentral, USA Furthermore, it was necessary to know the limits of the new architecture by having test loads and be sure that the new architecture will absorb load peaks Conference video & presenations |
||
14:30 |
Manage Áú»¢¶Ä²© Proxies in Remote NetworksMonitoring multiple server farms spread all around the world is not an easy task, many small problems have to be addressed, but using Áú»¢¶Ä²© it is all a breeze. We will talk about our experience on setup of Áú»¢¶Ä²© proxies in very remote networks, problems we encountered and how we worked on fixing them. Conference video & presenations |
||
14:50 |
Coffee Break |
||
15:20 |
Áú»¢¶Ä²© at the University of OsloA case study showing the problems we have resolved with Áú»¢¶Ä²© and the challenges we had when we implemented Áú»¢¶Ä²© as the main monitoring tool at the University of Oslo. The number of challenges is not low in an organization as heterogenous as ours, with many thousands of servers and clients, all kinds of devices connected to our infrastructure, different operating systems, multiple locations and hundreds of IT staff. Full automation and delegation of privileges are the key words in the work we have done during the past year and a half. Conference video & presenations |
||
16:00 |
Lightning Talks
5-minute inspiration! Conference video & presenations |
||
16:40 |
Q&A Session with Áú»¢¶Ä²© Team |
||
17:20 |
Closing Speech |