Career opportunity Senior ML Ops Engineer
Senior ML Ops Engineer
- Locations
- Tokyo
- Salary
- ¥10,000,000〜¥15,000,000 (Annual)
- Recruitment industry
- Backend Engineer, System Engineer
- Employment status
- Full time employee
Job description
- Building, maintaining, and operating MLOps pipelines to efficiently enable the training, evaluation, and prediction of machine learning models.
- Designing, developing, and operating machine learning API servers.
- Developing data pipelines for machine learning systems.
- Designing, developing, and operating machine learning workflows.
- Creating internal libraries to streamline ML-related development tasks.
- Standardizing tech stacks and establishing operational frameworks in line with the latest technological trends to achieve the above objectives.
- Building and managing infrastructure using Infrastructure as Code (IaC).
- Automating deployments through CI/CD pipelines.
- Designing and operating monitoring systems for services.
- Implementing measures to enhance Site Reliability, including performance tuning.
- Training, skill transfers, and recruitment efforts for internal data scientists and related roles.
- Formulating and promoting strategies for utilizing the ML platform and driving company-wide efforts to foster an environment for machine learning utilization and awareness.
Requirement
Required Qualifications
- Over 5 years of experience in system development, including data extraction and processing using SQL and programming languages like Python.
- Development and operation experience with ML frameworks such as Vertex AI, Kubeflow, SageMaker, TFX, or Flyte.
- Experience in development and operation with one of the following: dbt, Snowflake, or Databricks.
- Team development experience using Git.
- Experience with CI/CD automation and operation on GitHub/GitLab.
- Development experience with Infrastructure as Code (IaC).
- Over 3 years of recent experience in building and operating machine learning pipelines using multiple technology stacks.
- Over 3 years of recent experience in building and operating machine learning features in production environments.
- Development experience in cloud environments, particularly Google Cloud or AWS (recent experience of at least 3 years).
- Ability to select technologies and design architectures with consideration for maintainability and operational requirements.
- Practical experience in security measures at both the infrastructure (e.g., VPC configurations, inter-cloud authentication) and application levels.
Preferred Qualifications
- Development and operation experience with Kubernetes (e.g., Pods/Services).
- Practical experience in web development and operations.
Benefits
- Annual Salary: ¥10,000,000 - ¥15,000,000 (*Consideration given to previous salary.)
- Salary Revision: Twice a year based on performance reviews.
- Performance Bonus: Based on company performance.
Insurance and Benefits:
- Comprehensive social insurance (health insurance, pension, employment insurance, workers' compensation insurance).
- Commuting allowance (up to ¥50,000/month).
- Salary increases based on performance (twice a year).
Holidays and Leave:
- 120 annual holidays, with full two-day weekends (Saturdays and Sundays) and national holidays.
- Relaxation Leave: 5 days provided upon joining, with an additional 5 days granted annually (separate from annual paid leave).
- Annual paid leave.
- New Year’s holidays.
- Bereavement leave.
- Maternity and paternity leave.
Additional Perks:
- Stock purchase program (requires enrollment in the company stock ownership plan).
- Purchase of necessary books to deepen industry knowledge.
- Financial support for attending work-related seminars.
- Subsidies for hobby-related activities to encourage interaction.
- Monthly support for team meals or activities to enhance communication within and outside the team.
- Opportunities for interaction with employees from other departments through events like “同期会” (cohort meetings), “社長ランチ” (CEO lunch), and layer-specific exchange gatherings.
- Celebration bonuses for employees getting married.
- Growth support system within the Product and Engineering divisions.
- Skill-Up Allowance: ¥120,000 annually to support learning and growth initiatives.
- Dedicated Concierge System: Provides support for ancillary tasks to allow employees to focus on core work.
- Remote HQ Program: Support to create a personalized and efficient remote work environment.
Work time
- Flextime system (core time from 12:00 to 16:00)
- Standard working hours: 10:00 AM to 7:00 PM
- Total working hours in a month are calculated as 8 hours per day multiplied by the number of working days in the month.
Location
Tokyo
Other jobs
- All
- Tokyo