You are viewing content from a past/completed conference.
  
    
  
  
        
    
  
    
      
  
The Eternal Sunshine of the Toil-Less Prod
    
  
    
      
	
	
	
	
	
		
		
	
	
		
			
				
					
					                    Abstract
					
						One of the most important decisions in building an SRE practice is what kind of work should be assigned to the SRE team, and in what percentages. At Red Hat, we ship OpenShift both as a product and as a service, which can make it extra difficult to draw the lines between feature development and toil automation work. In addition, we face the usual SRE struggle between striving for toil minimization and unintentionally devaluing Ops-type work. In this talk, we will discuss the evolution from shipping products to running services, and what we've learned while trying different approaches. 
					 
					
						
					
					
					Speaker
    
    
    
            Sasha Rosenbaum
      Director of the Cloud Services Black Belt Team @RedHat
          
    Sasha is a Director of the Cloud Services Black Belt team at Red Hat, where she is helping enterprise customers successfully migrate to Managed OpenShift on customers’ favorite public cloud.
In her career, Sasha has worked in development, operations, consulting, and cloud architecture. Sasha is an organizer of DevOpsDays Chicago, a chair of DeliveryConf, and a published author.
 
    Read more
       
 
 
										
					
				 
				
			 
		 
	
			
			
				From the same track
				
					
    
        Session
        SRE
        Did the Chaos Test Pass?
        Wednesday Oct 26 / 11:50AM PDT
        
            
            People used to ask me all the time how to figure out if their chaos test has “passed,” and I’d always say “well, that’s a loaded question.” To confirm that a chaos test “passed,” we need to do verification of hypotheses - sometimes you’re trying to prove some system behavior occurred in response 
      
        
        	
		
		
			Christina Yakomin
			Senior Site Reliability Engineering Specialist @Vanguard_Group
		 
	 
 
        Did the Chaos Test Pass?
     
 
    
        Session
        SRE
        The Endgame of SRE
        Wednesday Oct 26 / 10:35AM PDT
        
            
            The containers are deployed and the builds are green. Yaml flows through the system, linted, reviewed, tested, and shipped with ease and regularity. Our intrepid SRE finds themself at a crossroads. The infrastructure is great but teams still struggle to maintain error budgets.
      
        
        	
		
		
			Amy Tobey
			Senior Principal Engineer and SRE practice Leader @Equinix
		 
	 
 
        The Endgame of SRE
     
 
    
        Session
        SRE
        Rethinking Reliability: What You Can (and Can't) Learn From Incidents
        Wednesday Oct 26 / 02:55PM PDT
        
            
            This talk presents research collected from the VOID—an open database of public incident reports. Containing over 2,000 reports for almost 700 organizations, the database allows for more structured review and research about software-related incident reporting.
      
        
        	
		
		
			Courtney Nash
			Internet Incident Librarian & Research Analyst, Previously @Verica, @Holloway, @Fastly, @O’Reilly Media, @Microsoft, & @Amazon
		 
	 
 
        Rethinking Reliability: What You Can (and Can't) Learn From Incidents
     
 
    
        Session
        
        [Panel] SRE: Is it Working?
        Wednesday Oct 26 / 01:40PM PDT
        
            
            How does SRE mature from a craft with a wide range of skills and levels of expertise to a mature discipline?
      
        
        	
		
		
			Courtney Nash
			Internet Incident Librarian & Research Analyst, Previously @Verica, @Holloway, @Fastly, @O’Reilly Media, @Microsoft, & @Amazon
		 
	 
	
		
		
			Amy Tobey
			Senior Principal Engineer and SRE practice Leader @Equinix
		 
	 
	
		
		
			Christina Yakomin
			Senior Site Reliability Engineering Specialist @Vanguard_Group
		 
	 
	
		
		
			Sasha Rosenbaum
			Director of the Cloud Services Black Belt Team @RedHat
		 
	 
 
        [Panel] SRE: Is it Working?