Cloud Infrastructure Site Reliability Engineer (SRE) Job at LTIMindtree, Berkeley Heights, NJ

SlRjcmFnVTBuSFRZZDQrb25FbVFaNDJMcnc9PQ==
  • LTIMindtree
  • Berkeley Heights, NJ

Job Description

LTIMindtree is an equal opportunity employer that is committed to diversity in the workplace. Our employment decisions are made without regard to race, color, creed, religion, sex (including pregnancy, childbirth or related medical conditions), gender identity or expression, national origin, ancestry, age, family-care status, veteran status, marital status, civil union status, domestic partnership status, military service, handicap or disability or history of handicap or disability, genetic information, atypical hereditary cellular or blood trait, union affiliation, affectional or sexual orientation or preference, or any other characteristic protected by applicable federal, state, or local law, except where such considerations are bona fide occupational qualifications permitted by law.

A little about us...

Role : Cloud Infrastructure Site Reliability Engineer (SRE)

Location : Alpharetta, GA/Berkeley Heights, NJ

Job Description:

Position Summary:

As a Cloud Infrastructure Site Reliability Engineer (SRE) with expertise in multiple public cloud service provider platforms, you will be responsible for operating infrastructure solutions, following the principles and practices pioneered by Google’s SRE model. Your work will ensure our cloud services meet uptime, reliability, and performance targets, and you will drive automation and continuous improvement across our production environments. This role will involve collaborating with cross-functional teams to enhance our cloud reliability posture and streamline processes through automation.

Key Responsibilities:

Design, build, and maintain highly available, scalable, and secure cloud infrastructure on platforms such as AWS, GCP, or Azure.

Develop and implement automation for provisioning, monitoring, scaling, and incident response using Infrastructure-as-Code tools (e.g., Terraform, CloudFormation, Ansible).

Monitor system reliability, capacity, and performance; proactively detect and address issues before they impact users.

Respond to production incidents, participate in on-call rotations, and lead post-incident reviews to drive root cause analysis and reliability improvements.

Collaborate with software engineering and security teams to ensure new services and features are production-ready and meet reliability standards.

Build and maintain tools for deployment, monitoring, and operations; automate manual processes to reduce toil.

Document operational processes and system architectures to ensure knowledge sharing and repeatability.

Continuously evaluate and implement new technologies to improve system reliability, security, and efficiency.

Qualifications:

Bachelor’s degree in computer science, Engineering, or a related technical field, or equivalent practical experience.

3+ years of experience in software development with proficiency in at least one programming language (e.g., Python, Go, Java, C++).

Experience administrating cloud platforms (AWS, GCP, Azure), including networking, security, containerization, storage, data management, and serverless technologies.

Solid understanding of Linux systems, networking fundamentals, virtualized, and distributed systems, file systems, system processes and configurations.

Deep understanding of observability (monitoring, alerting, and logging) tools in cloud environments. Ability to set up and maintain monitoring dashboards, alerts, and logs.

Familiarity with Continuous Integration/Continuous Deployment (CI/CD) tools for automated testing, deployments, provisioning, and observability.

Ability to manage and respond to incidents, perform root cause analysis, and implement post-mortem reviews.

Understanding of setting, monitoring, and maintaining Service-Level Objectives (SLOs) and Service-Level Agreements (SLAs) for system reliability.

Additional Qualifications a Plus:

Experience working with enterprise-scale financial services or other regulated industries

5+ years of experience in SRE, DevOps, infrastructure, or cloud engineering roles, preferably supporting large-scale, distributed systems.

Excellent problem-solving, troubleshooting, and communication skills.

Experience leading technical projects or mentoring junior engineers.

Certifications: Certified Engineer, DevOps, SRE, CSREF

LTIMindtree is an equal opportunity employer that is committed to diversity in the workplace. Our employment decisions are made without regard to race, color, creed, religion, sex (including pregnancy, childbirth or related medical conditions), gender identity or expression, national origin, ancestry, age, family-care status, veteran status, marital status, civil union status, domestic partnership status, military service, handicap or disability or history of handicap or disability, genetic information, atypical hereditary cellular or blood trait, union affiliation, affectional or sexual orientation or preference, or any other characteristic protected by applicable federal, state, or local law, except where such considerations are bona fide occupational qualifications permitted by law.

Job Tags

Local area,

Similar Jobs

Sigma Design

Senior Manual Machinist Job at Sigma Design

 ...Their products are used in a variety of industries, including mining, steel mills, power generation, cement, pulp, and paper. What...  ...and reduce information security occurrences. Education and Experience: (Knowledge, Skills, & Abilities) Minimum of 10+ years of experience... 

Boys & Girls Clubs of America

Dir Partnership Marketing Job at Boys & Girls Clubs of America

Overview Boys & Girls Clubs of America named "One of the Best Nonprofits to Work for in 2017, 2018 and 2019" Indeed.comBoys & Girls Clubs...  ...Tuesday & Wednesday)JOB SUMMARYThe Director of Partnership Marketing leads integrated marketing for all partners headquartered in... 

Taco Bell

Cook Fast Food Job at Taco Bell

 ...The Cook Fast Food is the key to ensuring guest satisfaction. This is a very important position...  ...all guest inquiries and concerns in a timely manner. Maintain a safe, secure, and comfortable...  ...your career. Apply now and become a part of our highly skilled and motivated crew!... 

RBI Private Lending

Loan Officer Job at RBI Private Lending

 ...Position (Eligible candidates must reside in Orlando, Fl) The Loan Officer is responsible for originating, managing, and closing...  ...application and approval process, ensuring a smooth and compliant experience. Analyze financial documents and credit reports to assess... 

Hy-Vee Food Stores

Donut Finisher Job at Hy-Vee Food Stores

 ...workforce that is fully engaged and committed to supporting our customers and each other. Job Description: Job Title: Donut Finisher Department: Bakery FLSA: Non-Exempt General Function Responsible for finishing donuts, bagels, and danishes to be sold...