The Principal Reliability Engineer will lead the technical team efforts and mentor and lead other Reliability Engineers (RE), Product Managers, Software Engineers, and Architects to produce mission-critical infrastructure, tools, performance improvements, actionable and meaningful performance measurements, and communication to stakeholders. The Principal RE is expected to work with management, peers, and customers to architect, design, build and implement the technical vision, improve monitoring tools, error detection, defect elimination, improving Mean Time to Detection/Resolution, and overall service availability and customer satisfaction. The RE role at Dick’s Sporting Goods (DSG) provides an opportunity to blend system architecture, design and software engineering skills with a passion for troubleshooting and defect elimination to address an ever-changing applications and environments with scalability and reliability challenges.
This is a full time remote opportunity. We are hiring immediately!
Advance the state-of-the-art of Reliability Engineering at Dick’s Sporting Goods.
Improving and developing the reliability engineering platform, building out custom tools, infrastructure, and services. Automation of manual tasks to reduce toil.
Perform engineering and technical tasks as assigned by applying general engineering principles.
Perform independent research in support of technical tasks.
Contribute positively to open-source projects developed by DSG and join existing communities. Navigate this broader ecosystem and structure projects with upstream/ downstream opportunities in mind.
Design monitoring and observability solutions and set reference architectures for ongoing use.
Bring experience, pragmatism, empathy, and composure to interactions with teams outside of the RE organization.
Work frequently with Product teams on shared goals and cross-team projects.
Balance planned and reactive work using basic project planning techniques and technical roadmaps.
Negotiate SLIs, SLOs, and SLAs with product owners.
Supervise service reliability, metrics, sustainability, technical debt, and operational toil for live services running at scale.
Work across multiple project teams simultaneously to support rapid development efforts.
Identify and integrate with third-party solutions where it makes the most sense.
Use data to understand the availability, reliability, and sustainability of our software.
Our teammates know that there is an athlete behind every in-store and eCommerce transaction. We go beyond the expected to build technology that makes the DICK’S Sporting Goods’ experience innovative and hassle-free.
COMMITTED TO INCLUSION & DIVERSITY.
We actively seek to create an inclusive and diverse workforce, reflecting the communities we serve. Doing so strengthens our ability to serve all our athletes and drive innovation and growth.
HAVE A PASSION FOR SPORTS.
We believe that sports make people better and we’re determined to be the best sports company in the world. Whether you’re an athlete or sports enthusiast, we bring our passion for the game into everything we do.
GET BETTER EVERY DAY.
The journey is never over. We know that to be the best, we must get a little better each day. We focus on delivering 1% more in everything we do.
What we’re looking for
Bachelor's Degree in Computer Science, Software Engineering, Information Systems
Service Management Tools Like: Jira, Pivotal Tracker, xMatters
Intellectual curiosity, problem solving, and openness is key to its success. Mindset for solving production systems issues and understanding root cause while providing “Detective work” and automating away toil – doesn't like boring repetitive tasks. Enjoys digging into new problems.
Capable of digging into common system performance issues, such as "this is slow", and developing metrics and driving measurable improvements.
Can work on different tasks in different systems week to week
Knows when to ask for help and when to dig more on their own
Understanding of and comfort with the GNU/Linux operating system.
Proficiency in high-level languages such as Ruby, Python, and Bash.
Exposure to system-level languages such as Go, C/C++.
Familiarity with configuration management software such as Puppet, Chef, Ansible, or Salt.