Research Scientist - Model Evaluation Job at Lumicity, San Francisco, CA

ZDhEeEV4VU5BY0w2UC8yTzJDWDEva1VKcHc9PQ==
  • Lumicity
  • San Francisco, CA

Job Description

This range is provided by Lumicity. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range $180,000.00/yr - $240,000.00/yr Direct message the job poster from Lumicity Executive Search Consultant @ Lumicity | Recruiting Researchers & Engineers with Foundational AI Experience Join a team at the forefront of AI model evaluation, setting the standard for how large language models are tested and validated. In this role, you'll assess the latest AI models, design new benchmarks, and develop advanced evaluation methodologies. You'll work closely with engineers, AI researchers, and enterprise clients to ensure cutting-edge AI systems meet the highest standards. This role is a bridge between research and practical implementation and will suit someone who enjoys taking academic papers and creating working models. Key Responsibilities: Analyze and benchmark newly released AI models (DeepSeek, Gemini, etc.) Develop and implement novel evaluation frameworks Build datasets, manage labeling processes, and publish findings Enhance automated evaluation techniques for AI-generated content Collaborate with top AI labs and enterprise partners to refine best practices Who You Are: MSc or PhD from leading Computer Science or Machine Learning school At least 3 years of experience in applied AI, with a focus on benchmarking or model evaluation Passion for advancing AI assessment standards Solid Python, PyTorch/TensorFlow and Django Make a real impact in AI research and development—apply today! Seniority level Seniority level Mid-Senior level Employment type Employment type Full-time Job function Job function Research and Engineering Industries Software Development Referrals increase your chances of interviewing at Lumicity by 2x Get notified about new Evaluation Specialist jobs in San Francisco Bay Area . Oakland, CA $52,800.00-$64,500.00 1 month ago Research Assistant for the Hoover Technology Policy Accelerator Research Associate - AI (Eastern Standard Time) Research Fellow, Regulation, Evaluation, and Governance Lab (RegLab) , Stanford Law School San Jose, CA $104,200.00-$116,500.00 5 months ago Research Assistant II, Life Science Research Professional San Francisco, CA $106,000.00-$156,000.00 2 weeks ago We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. #J-18808-Ljbffr Lumicity

Job Tags

Full time,

Similar Jobs

Equinox

Personal Trainer, Woodland Hills Job at Equinox

 ...As an Equinox personal trainer your career becomes an empowered lifestyle founded on maximizing both your personal and client performance. Under the guidance of two dedicated managers you will develop and refine an approach to programming, education, business, and... 

Access Healthcare

Travel Registered Nurse - MDS Coordinator Job at Access Healthcare

 ...Job Description Access Healthcare is seeking a travel nurse RN Clinical Coordinator, Case Management for a travel nursing job in Covina, California. Job Description & Requirements ~ Specialty: Case Management ~ Discipline: RN ~ Start Date: 07/14/2025~ Duration... 

Capital One - CA

Senior Associate, Data Analyst Job at Capital One - CA

 ...Job Description 161 Bay Street (93021), Canada, Toronto,Toronto, Ontario, Senior Associate, Data Analyst Our Capital One DA Team. Data is at the center of everything we do. It started in 1988, when we launched as a startup with the goal of using data to... 

Vaco by Highspring

Global Treasury Manager Job at Vaco by Highspring

 ...support the Company's growth and strategy. The Global Treasury Manager is an integral member of the Corporate Treasury Team and will oversee...  ...Treasury workstation. Responsibilities * Oversee global cash management including global cash positioning, global cash... 

Packaging Manufacturer

Operations Planner (Mandarin Chinese Required) Job at Packaging Manufacturer

 ...candidate will have strong planning skills, cultural fluency in Chinese business practices, and the ability to work across multiple...  ...international markets is a plus Fluency in Mandarin Chinese (speaking and writing) is highly preferred; proficiency in English is required...