
Job Overview
Employment Type
Full-time
Work Schedule
Standard Hours
Benefits
Travel perks
health benefits
wellness programs
401(k) program
Employee assistance program
Pet insurance
Discounts on hotels, cars, cruises
Job Description
American Airlines is the largest airline in the world, recognized globally for its extensive network and commitment to excellence in air travel. With a fleet that connects over 365 destinations across six continents and more than 6,800 daily flights, American Airlines is a leader in the aviation industry, continually innovating to provide outstanding customer experiences. The company is known for fostering an inclusive and diverse workforce, encouraging team members to bring their unique perspectives and talents to contribute to a dynamic and supportive workplace environment. American Airlines offers expansive career opportunities that allow employees to grow professionally while enjoying exceptional travel benefits and comprehensive health and wellness programs. Together, American Airlines' employees uphold the mission to connect people and cultures while providing safe, reliable, and efficient air transport.
This particular opening is for an integral role within American Airlines' Supply Chain Division, specifically on the Information Technology Team. This position is designed for a cloud and infrastructure specialist focused on building and managing both on-premises and cloud-based infrastructure solutions. The ideal candidate will be highly skilled in software engineering, site reliability engineering (SRE), and DevOps practices, contributing to the evolution of the airline's technological backbone. Responsibilities include constructing end-to-end monitoring infrastructure to ensure system reliability, collaborating closely with product development and operations teams, and managing physical and virtual hardware assets, including servers, autonomous robots, and network equipment.
In this role, you will have the chance to engage in the full lifecycle of infrastructure implementation, from design adherence to deployment and continuous improvement of performance and availability. Your work will directly impact the operational efficiency and reliability of applications that support American Airlines’ supply chain and automated warehouse systems. The position requires hands-on expertise with monitoring and logging tools, proficiency in Azure cloud architecture, and strong knowledge of CI/CD pipelines using tools like Jenkins and GitHub. Experience with SQL Server and Mongo databases, alongside working knowledge of Kubernetes and Kafka, is highly advantageous.
As part of a company deeply committed to employee well-being, American Airlines offers extensive benefits that start from day one, including health, dental, prescription, and vision coverage, virtual healthcare options, and wellness programs tailored to help you be the best version of yourself. The company also supports long-term financial security with a 401(k) plan featuring employer contributions and additional perks such as travel discounts, employee assistance programs, pet insurance, and savings on hotels, cars, and cruises. Furthermore, American Airlines embraces a workplace culture that values inclusion and diversity, supported by more than 20 Employee Business Resource Groups aimed at fostering community connections and developmental opportunities.
Joining American Airlines means you will be part of a global team that values innovation, dedication, and personal growth while providing you the opportunity to travel the world and build a rewarding career. If you are motivated to solve complex challenges with flexibility and grace while impacting an essential global industry, this role invites you to embark on a fulfilling journey with a company that cares about its people and its mission.
This particular opening is for an integral role within American Airlines' Supply Chain Division, specifically on the Information Technology Team. This position is designed for a cloud and infrastructure specialist focused on building and managing both on-premises and cloud-based infrastructure solutions. The ideal candidate will be highly skilled in software engineering, site reliability engineering (SRE), and DevOps practices, contributing to the evolution of the airline's technological backbone. Responsibilities include constructing end-to-end monitoring infrastructure to ensure system reliability, collaborating closely with product development and operations teams, and managing physical and virtual hardware assets, including servers, autonomous robots, and network equipment.
In this role, you will have the chance to engage in the full lifecycle of infrastructure implementation, from design adherence to deployment and continuous improvement of performance and availability. Your work will directly impact the operational efficiency and reliability of applications that support American Airlines’ supply chain and automated warehouse systems. The position requires hands-on expertise with monitoring and logging tools, proficiency in Azure cloud architecture, and strong knowledge of CI/CD pipelines using tools like Jenkins and GitHub. Experience with SQL Server and Mongo databases, alongside working knowledge of Kubernetes and Kafka, is highly advantageous.
As part of a company deeply committed to employee well-being, American Airlines offers extensive benefits that start from day one, including health, dental, prescription, and vision coverage, virtual healthcare options, and wellness programs tailored to help you be the best version of yourself. The company also supports long-term financial security with a 401(k) plan featuring employer contributions and additional perks such as travel discounts, employee assistance programs, pet insurance, and savings on hotels, cars, and cruises. Furthermore, American Airlines embraces a workplace culture that values inclusion and diversity, supported by more than 20 Employee Business Resource Groups aimed at fostering community connections and developmental opportunities.
Joining American Airlines means you will be part of a global team that values innovation, dedication, and personal growth while providing you the opportunity to travel the world and build a rewarding career. If you are motivated to solve complex challenges with flexibility and grace while impacting an essential global industry, this role invites you to embark on a fulfilling journey with a company that cares about its people and its mission.
Job Requirements
- Four years of experience in software engineering, SRE or performance engineering role
- Two years of experience in Azure cloud architecture, networking, security and administration
- Expertise in Terraform and CI/CD tools like Jenkins and GitHub
- Experience with Event Hub client configuration and monitoring
- Experience with SQL Server and Mongo databases
- Hands-on expertise with monitoring and logging tools such as DynaTrace, Mezmo, LogInsight, ThousandEyes
- Knowledge of Kubernetes and Kafka is a plus
Job Qualifications
- Four years of experience in software engineering, site reliability engineering, or performance engineering role
- Two years of experience in Azure cloud architecture, networking, security, and administration
- Expertise in Terraform
- Proficiency with CI/CD tools such as Jenkins and GitHub
- Experience with Event Hub client configuration and monitoring
- Knowledge of SQL Server and Mongo databases
- Hands-on expertise with monitoring and logging tools like DynaTrace, Mezmo, LogInsight, ThousandEyes
- Knowledge of Kubernetes and Kafka is a plus
- Excellent communication and teamwork abilities
- Airline industry experience is preferred
- Previous automated warehouse or supply chain experience is preferred
Job Duties
- Build end-to-end monitoring infrastructure including logging, metrics, and tracing
- Work closely with other product teams to provide tooling that measures system reliability
- Collaborate with development and operations teams to ensure the availability and reliability of applications, hardware, and infrastructure
- Manage physical servers, virtual machines, network equipment, hardware control systems, autonomous mobile robots (AMRs), and autonomous guided vehicles (AGVs)
- Administrate SQL Server instances including backups, restores, data purges, and failovers
- Efficiently handle live production incidents and troubleshoot application, hardware, and infrastructure issues
- Implement and improve continuous integration and continuous deployment automation using DevOps tools
- Facilitate incident management, post-incident reviews, and remediation tasks to reduce incident frequency and severity
Job Criteria
Experience
Mid Level (3-7 years)
Job Location
Your Profile Is Visible To Hiring Managers Across OysterLink.
We'll match you with best jobs
Get job offers faster


Search For More Opportunities:
How Candidates Get Hired Faster
Apply to 2–3 similar roles
Complete profile & get best matches
Check new opportunities daily

