Bestkaam Logo
Weekday AI (YC W21) Logo

Linux Administrator

Actively Reviewing the Applications

Weekday AI (YC W21)

India, Maharashtra, Pune Full-Time On-site INR 12–15 LPA
Posted 2 months ago Apply by April 29, 2026

Job Description

This role is for one of the Weekday's clients

Salary range: Rs 1200000 - Rs 1500000 (ie INR 12-15 LPA)

Min Experience: 3 years

Location: Pune

JobType: full-time

The Linux Administrator will be responsible for operating, maintaining, and optimizing on-premise Linux infrastructure that supports large-scale, distributed systems. This role requires deep expertise in Linux internals, release engineering, and system reliability, with hands-on ownership of deployments, troubleshooting, and performance in high-load, network-intensive environments.

Unlike cloud-centric roles, this position demands strong command-line diagnostics, kernel-level understanding, and the ability to resolve issues directly on bare-metal systems.

Requirements

Key Responsibilities

Linux Systems Administration

  • Operate, tune, and troubleshoot bare-metal Linux servers (CPU, memory, storage, and networking)
  • Perform deep OS-level diagnostics using system logs, process monitoring, and kernel tools
  • Identify and resolve performance bottlenecks and system stability issues

Release & Deployment Management

  • Own end-to-end release and deployment processes for on-prem systems
  • Manage versioning, staged rollouts, and rollback strategies to ensure stable production releases
  • Coordinate and execute controlled production deployments with minimal downtime

Container & Runtime Operations

  • Build, optimize, and debug Docker images
  • Integrate containerized applications into on-prem environments
  • Troubleshoot container runtime and system-level issues

Networking & Connectivity

  • Monitor, validate, and troubleshoot network performance including latency, packet loss, and jitter
  • Diagnose Wi-Fi performance issues in high-density, real-time environments
  • Apply strong understanding of TCP/UDP, routing, and VLANs for issue resolution

Automation & Tooling

  • Develop and maintain Bash and Python scripts for automation, system tooling, and log analysis
  • Improve operational efficiency through custom tooling and workflow automation

Monitoring & Reliability

  • Implement and maintain monitoring and alerting using tools such as Prometheus and Grafana
  • Analyze logs and metrics to perform Root Cause Analysis (RCA)
  • Ensure system visibility, reliability, and performance across environments

Required Skills & Experience

  • Expert-level experience administering Linux systems (Ubuntu Server preferred)
  • Strong understanding of Linux internals, processes, memory management, and networking
  • Hands-on experience with release engineering, deployment strategies, and rollback mechanisms
  • Proficiency with Docker, including image optimization and debugging
  • Strong scripting skills in Bash and Python
  • Solid networking fundamentals (TCP/IP, UDP, VLANs, routing, Wi-Fi troubleshooting)
  • Practical experience with monitoring, observability, and log analysis tools (ELK, Prometheus, etc.)

Good to Have

  • Background in Site Reliability Engineering (SRE) or Build/Release Engineering
  • Experience running Kubernetes in on-prem or edge environments
  • Operational experience with databases such as PostgreSQL, MongoDB, or Redis (availability, backups)

What Success Looks Like

  • Reliable and predictable system deployments with seamless rollback capability
  • Clear system visibility through actionable metrics and logs
  • Fast, accurate root-cause diagnosis at the OS, network, and application layers
Check Qualification

Quick Tip

Customize your resume and cover letter to highlight relevant skills for this position to increase your chances of getting hired.