Track: Evolving DevOps
Day of week:
Devops, SRE, TechOps, System Administration and the rest have a common goal toward stability, and reliability of deployment and operation of production applications. We’ll discuss how mechanization of SRE tasks, observability, consistency, and training are vital for speed and operational excellence in a scalable production environment. We’ll discuss the history of the competencies of DevOps as well as how we can work toward increasing the pipeline of new folks in the field.
by Lisa Phillips
VP, Site Reliability Engineering @Fastly
As a content delivery network, Fastly operates a large internetwork and a global application environment. Fastly developed its Incident Command protocol, which it uses to deal with large-scale events. Lisa will cover in detail the typical struggles a company Fastly’s size runs into when building around-the-clock incident operations and the things Fastly has put in place to make dealing with incidents easier and more effective. She will also cover common mistakes and lessons learned as Fastly...
by Cory Watson
Observability Specialist @Stripe
It's common to hear that an organization needs more observability, but what does that mean?
How do you change the culture of a company such that these needs are addressed sooner than later? I've got some ideas, and I've been trying them out at Stripe. Let's review how it's gone and talk about what worked at what didn't.
Let's talk about people, their needs and how to make them — and your observability — awesome.
by Sayli Karmarkar
Senior Software Engineer, Diagnostics and Remediation Engineering (DaRE) @Netflix
Netflix is a collection of microservices that all come together to enable the product you love. Operations for these microservices is distributed across the owning teams and their engineers. Ever wondered how we manage to achieve high availability and reliability without having a central operations team managing the operations of all these individual services? We believe that engineers who know their service inside out are the best people to manage its operations as well. So instead of...
by Pedro Canahuati
VP, Production Engineering & Site Reliability @Facebook
Development/Software orgs typically focus on shipping and building new features. Sometimes this happens at the expense of efficiency or stability. Operations orgs are typically built to enure services run smoothly 24x7 and to do it with the least amount of cost possible. Sometimes, this means each teams’ incentives aren't quite aligned and the situation can lead to an us versus them dynamic.
Facebook’s solution for this problem lies in the Production Engineering (PE) team. PE embeds...
by Franziska Bell
Data Science Manager @Uber
The Observability team at Uber focuses on providing intelligent real-time outage detection and root cause exploration at scale. This encompasses multiple building blocks: (i) a proprietary, scalable back-end store for application telemetry data that can service more than 500 million time series in real-time, (ii) a user-friendly and robust query language and UI for setting up alert configurations, (iii) the development of novel time series and machine learning models for fully automated,...
Monday Nov 7
Architectures You've Always Wondered About
You know the names. Now learn lessons from their architectures
Distributed Systems War Stories
“A distributed system is one in which the failure of a computer you didn't even know existed can render your own computer unusable.” - Lamport.
State of the art in Container deployment, management, scheduling
Art of Relevancy and Recommendations
Lessons on the adoption of practical, real-world machine learning practices. AI & Deep learning explored.
Next Generation Web Standards, Frameworks, and Techniques
Keeping life in balance is a challenge. Learn lifehacks, tips, & techniques for success.
Tuesday Nov 8
Next Generation Microservices
What will microservices look like in 3 years? What if we could start over?
Java: Are You Ready for This?
Real world lessons & prepping for JDK9. Reactive code in Java today, Performance/Optimization, Where Unsafe is heading, & JVM compile interface.
Big Data Meets the Cloud
Overviews and lessons learned from companies that have implemented their Big Data use-cases in the Cloud
Lessons/stories on optimizing the deployment pipeline
Software Engineering Softskills
Great engineers do more than code. Learn their secrets and level up.
Modern CS in the Real World
Applied, practical, & real-world dive into industry adoption of modern CS ideas
Wednesday Nov 9
Architecting for Failure
Your system will fail. Take control before it takes you with it.
Stream Processing, Near-Real Time Processing
Bare Metal Performance
Native languages, kernel bypass, tooling - make the most of your hardware
Culture as a Differentiator
The why and how for building successful engineering cultures
//TODO: Security <-- fix this
Building security from the start. Stories, lessons, and innovations advancing the field of software security.
Bots, virtual reality, voice, and new thought processes around design. The track explores the current art of the possible in UX and lessons from early adoption.