Speaker
Mr
David Rothera
(Facebook)
Description
Want to learn how Facebook operates their global network to support more than 1.3 billion users? We will be describing the technologies and methods we use to manage Facebook's production network. The neteng org at Facebook has built/leverage several systems for managing and operating the production network, including an audit framework, alarms daemons, drainers, and an automatic remediation engine. This talk will focus on these technologies and how they have helped improve user experience, administer complexity, automate day-to-day operations, mitigate impact, and increase reliability.
Summary
A high level overview on some of the problems and some of the lessons learned onhow we deal with operating a large scale network.
Primary authors
Mr
David Rothera
(Facebook)
Mr
Jose Leitao
(Facebook)