Job Opportunity: SRE - Software Engineer at Dearborn, Michigan 48126

Job#: 2038983

Job Description:

Location:Hybrid to SE MI or Palo Alto, CA
Duration:12 month contract to hire
Description:
We are seeking a talented Full Stack / Site Reliability Engineer to play a key role in developing a comprehensive Internal Developer Platform (IDP) that includes CI/CD pipelines, managed infrastructure, observability, and a developer portal.
The Bedrock and Customer Success and SRE team is responsible for ensuring that our customers derive maximum value from our platform. This team acts as the primary point of contact for customers, helping them onboard, adopt, and optimize their use of our platform offerings. This team also works to ensure the stability of the platform that hosts the cloud applications that power our customer's connected vehicle experiences.
Additional for Data Engineer Specialist(s)
In this role you will be working with our customers to ensure that they are leveraging our Data Platform efficiently and effectively. You will partner with our Data Platform team to ensure that their data products are as valuable and useable as possible for our Bedrock customers.
Additional for Security Specialist
In this role, you will be a trusted advisor for our customers, ensuring their success with our security solutions, including SCA, SAST, DAST, Vulnerability Management, and CSPM. You will act as a technical expert, guiding customers through onboarding, implementation, and ongoing usage, while also proactively identifying and addressing security risks.
Responsibilities:

Strong background in software development and systems administration, as well as excellent problem-solving and communication skills.
Run a production environment by monitoring availability and taking a holistic view of system health.
Developing, improving, and operating the deployment and orchestration of a complex distributed system
Improve reliability, quality, and time-to-market of our suite of software solutions
Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating to continually improve
Provide primary operational and engineering Support for multiple large, distributed software applications
Identify and reduce or eliminate toil via automation to maximize the time spent on engineering and innovation
Collaborating with development teams to design, build, and operate scalable and resilient software systems
Automating build, deployment, monitoring, and incident response processes
Performing root cause analysis of production incidents and implementing preventive measures
Conducting performance analysis and optimization of the system
Ensuring compliance with security and regulatory standards
Implementing and maintaining disaster recovery processes
Providing technical guidance and mentorship to other team members
Participating in an on-call rotation for incident response and support.

Additional Responsibilities for Data Engineer Specialist(s)

Design, develop, and maintain scalable data pipelines and systems
Collaborate with data scientists and analysts to understand data requirements
Ensure data integrity and quality across all data platforms
Optimize data processing workflows for performance and cost
Implement data security measures to protect sensitive information
Monitor and troubleshoot data pipeline issues and system performance
Document data processes and systems for cross-functional teams

Additional Responsibilities for Security Specialist

Collaborate with customers to understand their security needs and objectives
Leveraging SCA, SAST, DAST, Vulnerability Management, and CSPM tools to assist customers deliver secure services
Guide customers through vulnerability remediation processes, providing actionable steps and recommendations to mitigate risk.
Proactively monitor customer environments for security vulnerabilities
Advocate for customer needs and provide feedback to product development teams

Qualifications

4 Year College Degree in Computer Science or Equivalent Experience
5 - 6 years' experience with Golang, Java, NoSQL/SQL Datastore, Spring Boot, GCP/AWS/Azure, Docker/K8 in Maintenance and Development of multi-tier applications.
Understanding of gRPC & RESTful APIs, and microservices platform
4 - 5 Years of experience with any of APM and other monitoring tools such as Grafana Cloud, Dynatrace, New Relic, ELK, Splunk, Prometheus, Sensu, Nagios, Kafka, DataDog, PagerDuty.
Strong experience with product & development teams to establish error budgets by identifying the right SLOs (Service level objective), SLIs (Service level indicators), KPIs (Key performance indicators) and effectively drive the use of the budget to ensure maximum domain availability/uptime.
Regularly review key site technical metrics such as transactions errors, logging, response times, caching strategies, conversion/bounce rates, capacity & resource utilization.
Proactively identify stability risks & work with engineering leadership to establish appropriate mitigation plans
Experience in solving complex architecture/design & business problems, work to simplify, optimize, remove bottlenecks, etc.
Architect, design & develop automation to reduce toil, improve recoverability, availability, latency & scalability of supported applications with understanding of MTTD (Mean Time to Detection) & MTTR (Mean Time to Resolution)
Maintain knowledge repository that includes Standard operating procedure, Release checklists, Runbooks for incident recovery

Job Details

SRE - Software Engineer

Organization Profile

Job Summary

Job Tools

Session Time out

Job Details

SRE - Software Engineer

Organization Profile

Job Summary

Job Tools