Friday, September 26, 2014

Box - Sr./Staff Site Reliability Engineer

Apply to this job

Sr./Staff Site Reliability Engineer

Los Altos, CA
Technical Operations
Full-Time

If you are passionate about working on technologies and design approaches that are disruptive to incumbent Enterprise players, like to leverage Open Source, tooling and automation and agile concepts, and are interested in opportunities to work at massive scale, we may just have the job for you. Site Reliability Engineering (SRE) is a discipline that is in and of itself not new, but at Box we like to think that we are considering it in a way that makes it one of the most important roles within our Technology organization. As an SRE at Box you will be responsible for running our production services and will be working very closely with developers to ensure reliability, scalability and performance of the next-generation of systems. As such you are embedded within one of the engineering teams and will be a core driver of operational excellence throughout the development cycle and go-live. We are looking for smart Operations Engineers, who have a strong acumen for software development and tuning of distributed systems, having done this in a highly scalable, consumer facing web service. Multi year experience with best practices around monitoring, deployment and configuration management are a must, as are deep technical skills in systems and web operations / management. If this sounds like you, please read on for a more detailed description of the scope and requirements for this role.

Responsibilities

    • Primary owner of one or many production services for the rapidly growing Box.com site. 
    • Primary point responsible for the overall operability, resiliency, performance, and capacity of owned production services.
    • Collaborate with developers in the deployment and scaling of new product features to facilitate rapid iteration and massive growth.
    • Develop tools to improve our ability to rapidly deploy and effectively monitor production applications in a large-scale Unix environment.
    • Work closely with development teams to ensure that services and platforms are designed with 'operability' in mind.
    • Participate in a 24x7 rotation for third-tier escalations.

Qualifications

    • Minimum 5 years of Unix/Linux systems administrator level experience.
    • Extraordinary experience in web technologies
    • Proven production service trouble-shooting skills that span applications, systems and network.
    • Demonstrated programming skills in one or more of: Python, Perl, PHP, Ruby, Java, C.
    • Minimum 5 years experience working in a consumer web scale Technical Operations role.
    • Solid understanding of operational principles, such as capacity planning, monitoring and incident handling.
    • Very comfortable working in an agile DevOps oriented capacity, alongside Dev partners.
    • CS or equivalent university degree
    • Passion for cloud technologies

Nice to have

    • Committer to relevant Open Source projects
    • Prior experience working in a DevOps capacity

About Box: Box provides a secure way to share content and improve collaboration on any device. Desktop, tablet or mobile. From huge corporations to mom and pop stores, Box believes technology should never limit anything you do. Businesses of any size can be more productive, inventive and powerful on Box. The company is well funded by top VC firms like Andreessen Horowitz, Draper Fisher Jurvetson and U.S. Venture Partners. Box is proud to be on Forbes’ list of America’s Most Promising Companies, is used in 240,000 businesses - including 99% of the Fortune 500 – and is the go-to product of 27 million people.

No comments:

Post a Comment