Career opportunity Senior ML Ops Engineer

Senior ML Ops Engineer

systemengin

Job description

  • Building, maintaining, and operating MLOps pipelines to efficiently enable the training, evaluation, and prediction of machine learning models.
  • Designing, developing, and operating machine learning API servers.
  • Developing data pipelines for machine learning systems.
  • Designing, developing, and operating machine learning workflows.
  • Creating internal libraries to streamline ML-related development tasks.
  • Standardizing tech stacks and establishing operational frameworks in line with the latest technological trends to achieve the above objectives.
  • Building and managing infrastructure using Infrastructure as Code (IaC).
  • Automating deployments through CI/CD pipelines.
  • Designing and operating monitoring systems for services.
  • Implementing measures to enhance Site Reliability, including performance tuning.
  • Training, skill transfers, and recruitment efforts for internal data scientists and related roles.
  • Formulating and promoting strategies for utilizing the ML platform and driving company-wide efforts to foster an environment for machine learning utilization and awareness.

Requirement

Required Qualifications

  • Over 5 years of experience in system development, including data extraction and processing using SQL and programming languages like Python.
  • Development and operation experience with ML frameworks such as Vertex AI, Kubeflow, SageMaker, TFX, or Flyte.
  • Experience in development and operation with one of the following: dbt, Snowflake, or Databricks.
  • Team development experience using Git.
  • Experience with CI/CD automation and operation on GitHub/GitLab.
  • Development experience with Infrastructure as Code (IaC).
  • Over 3 years of recent experience in building and operating machine learning pipelines using multiple technology stacks.
  • Over 3 years of recent experience in building and operating machine learning features in production environments.
  • Development experience in cloud environments, particularly Google Cloud or AWS (recent experience of at least 3 years).
  • Ability to select technologies and design architectures with consideration for maintainability and operational requirements.
  • Practical experience in security measures at both the infrastructure (e.g., VPC configurations, inter-cloud authentication) and application levels.

Preferred Qualifications

  • Development and operation experience with Kubernetes (e.g., Pods/Services).
  • Practical experience in web development and operations.

Benefits

  • Annual Salary: ¥10,000,000 - ¥15,000,000 (*Consideration given to previous salary.)
  • Salary Revision: Twice a year based on performance reviews.
  • Performance Bonus: Based on company performance.

Insurance and Benefits:

  • Comprehensive social insurance (health insurance, pension, employment insurance, workers' compensation insurance).
  • Commuting allowance (up to ¥50,000/month).
  • Salary increases based on performance (twice a year).

Holidays and Leave:

  • 120 annual holidays, with full two-day weekends (Saturdays and Sundays) and national holidays.
  • Relaxation Leave: 5 days provided upon joining, with an additional 5 days granted annually (separate from annual paid leave).
  • Annual paid leave.
  • New Year’s holidays.
  • Bereavement leave.
  • Maternity and paternity leave.

Additional Perks:

  • Stock purchase program (requires enrollment in the company stock ownership plan).
  • Purchase of necessary books to deepen industry knowledge.
  • Financial support for attending work-related seminars.
  • Subsidies for hobby-related activities to encourage interaction.
  • Monthly support for team meals or activities to enhance communication within and outside the team.
  • Opportunities for interaction with employees from other departments through events like “同期会” (cohort meetings), “社長ランチ” (CEO lunch), and layer-specific exchange gatherings.
  • Celebration bonuses for employees getting married.
  • Growth support system within the Product and Engineering divisions.
  • Skill-Up Allowance: ¥120,000 annually to support learning and growth initiatives.
  • Dedicated Concierge System: Provides support for ancillary tasks to allow employees to focus on core work.
  • Remote HQ Program: Support to create a personalized and efficient remote work environment.

Work time

  • Flextime system (core time from 12:00 to 16:00)
  • Standard working hours: 10:00 AM to 7:00 PM
  • Total working hours in a month are calculated as 8 hours per day multiplied by the number of working days in the month.


Location

Tokyo

facebook-logo

Chat with us!