Technical Program Manager, AI Network Infra

Technical Program Manager, AI Network Infra
Location pin icon
Menlo Park, CA
This position will play a critical role in driving end-to-end AI product introductions and AI operations initiatives supporting Meta’s growing AI/HPC infrastructure for our Family of Apps . They will be responsible for overseeing the entire program lifecycle, from concept to planning to execution to monitoring, ensuring successful delivery and implementation. This includes collaborating with cross-functional Engineering teams to define scope, goals, and timelines, as well as leading the cross functional teams in delivering the business outcomes. They will help solve some of the most challenging networking problems in the industry, drive innovative, creative and ground-breaking solutions & technologies. The ideal candidate will have experience in AI/HPC product development and operations, strong understanding of the Network communications stack for AI solutions, fundamental knowledge of the hardware components, strong program management skills, and excellent communication and leadership abilities. As such,they need to understand the problem space and domain in depth, create roadmaps, prioritize based on impact and drive product development from concept to production. They will operate in a multi-organization landscape.
Technical Program Manager, AI Network Infra Responsibilities
  • Lead technical program management of next-generation Artificial Intelligence/Machine Learning (AI/ML) platform(s) for Meta's Network Infrastructure in a matrix organization covering a range of areas (Data Center, Network, Hardware Systems, Infrastructure Engineering, Software Engineering, Capacity Management) and across multiple physical locations
  • Collaborate with Engineering and business owners to define program requirements, set priorities, and establish scope which includes defining the roadmap and long-term strategy of the teams that you are partnering with.
  • Manage cross functional dependencies, risks, and changes effectively by optimizing scope, schedule, and resources accordingly.
  • Develop and own communication plans to effectively and proactively communicate program status, issues, and risks to stakeholders.
  • Partner with cross functional teams to drive technical analysis, design, development, testing, implementation, and post implementation phases.
  • Define and track key metrics and key quality and performance indicators and drive cross functional execution of program deliverables.
  • Proactively identify and analyze complex, long-term, critical infrastructure problems with engineering leaders and stakeholders.
  • Drive internal and external process improvements across multiple teams and functions including reducing the manual efforts through automation.
  • Build strong and aligned program teams to efficiently deliver on shared goals.
Minimum Qualifications
  • B.S. in Computer Science or a related technical discipline, or equivalent experience.
  • 12+ years of software engineering, systems engineering, hardware engineering, or technical product/program management experience.
  • 8+ years experience in delivering Network solutions/Programs for Data Center applications.
  • Experience delivering tech programs or products from inception to delivery.
  • Experience operating autonomously across multiple teams, demonstrated critical thinking, and thought leadership.
  • Communication experience and experience working with technical management teams to develop systems, solutions, and products.
  • Analytical and problem-solving experience with large-scale systems.
  • Experience establishing work relationships across multi-disciplinary teams and multiple partners in different time zones.
  • Understanding of the Network communication stack, Network Hardware (NICs, Optics & Switches).
  • Experience Developing & Delivering AI Cluster Solutions for training & inference use cases.
Preferred Qualifications
  • Experience in Network protocols (RoCE, IB, Ethernet).
  • Experience working with large scale distributed systems.
  • Experience with data center architecture & Deployment.
  • Experience working with ODMs and silicon vendors.
  • Experience with AI training and inference model deployments to physical infrastructure.
For those who live in or expect to work from California if hired for this position, please click here for additional information.
Locations
About Meta
Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today—beyond the constraints of screens, the limits of distance, and even the rules of physics.

$167,000/year to $230,000/year + bonus + equity + benefits

Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.


Equal Employment Opportunity and Affirmative Action
Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics. You may view our Equal Employment Opportunity notice here.

Meta is committed to providing reasonable support (called accommodations) in our recruiting processes for candidates with disabilities, long term conditions, mental health conditions or sincerely held religious beliefs, or who are neurodivergent or require pregnancy-related support. If you need support, please reach out to accommodations-ext@fb.com.
Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics. You may view our Equal Employment Opportunity notice here. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. We may use your information to maintain the safety and security of Meta, its employees, and others as required or permitted by law. You may view Meta Pay Transparency Policy, Equal Employment Opportunity is the Law notice, and Notice to Applicants for Employment and Employees by clicking on their corresponding links. Additionally, Meta participates in the E-Verify program in certain locations, as required by law.

Meta is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, you may contact us at accommodations-ext@fb.com.
Let us know you're interested.
Share your resume or LinkedIn profile with our recruiting team and create personalized job alerts.