Sr. Linux AI System Engineer
• Remote - TX and or CST preferred
• Must be US citizen, and all work must be done with from within the United States.
• Start ASAP - one month assignment
The AI environment that was built following less than best practices. Experiencing instability, and do not have the resources to debug and resolve these issues.
Technical Requirements/Scope
• Linux Systems Engineering
• GPU / HPC expertise
• AI/ML platform support
• Cloud + DevOps + MLOps
• Full AI Build Experience
• Configuration management, Monitoring, diagnostics and issue resolution of:
• Servers - Lenovo and Lenovo GPU nodes
• NVIDIA Cumulus- Ethernet - SN5000 Series
• Ubuntu Linux
• Kubernetes's
• Portworx SDS
• GPU Passthrough - OS to k8 - Pod
• Familiarity with Centific would be a huge plus
• Strong Debugging and Diagnostic Skills
• Ability to map out and documents an environment for future administrators.
• Strong written and verbal communication skills
• Proven experience working in a collaborative multi-vendor environment
• Full time - weekdays and daytime hours
• Potential overtime non-standard work hours, nights, and weekends, extended days during active issue resolution.
• Will be working with CJIS data. CJIS background check is a fingerprint-based, national criminal history search mandated by the FBI's Criminal Justice Information Services (CJIS) Division. It checks individuals accessing sensitive law enforcement data for compliance.
Additional Information
• All candidates are encouraged to apply, but many positions require a strict drug and background check by our customers.
• F2OnSite supports and adheres to all state laws regarding background checks.