Research Engineer, Production Model Post-Training

Thomson Reuters

Charing Cross, United Kingdom

2 days ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Senior

Compensation

£ 260K

Job location

Charing Cross, United Kingdom

Tech stack

Artificial Intelligence

Software Debugging

Distributed Systems

Python

Machine Learning

Software Engineering

Deep Learning

Information Technology

Job description

About Anthropic Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.Note: We are not actively hiring in this location for this team at the time, but we are keeping this up to collect expressions of interest. Once we are hiring again, we may reach out to you if we see a mutual fit. Please consider applying to our Zürich or US opening for this team.About The Role Anthropic's production models undergo sophisticated post-training processes to enhance their capabilities, alignment, and safety. As a Research Engineer on our Post-Training team, you'll train our base models through the complete post-training stack to deliver the production Claude models that users interact with.You'll work at the intersection of cutting-edge research and production engineering - advancing post-training techniques and captaining production runs at frontier scale. Your work will directly impact the quality, safety, and capabilities of our production models.Note: For this role, we conduct all interviews in Python. This role may require responding to incidents on short-notice, including on weekends.ResponsibilitiesImplement and optimize post-training techniques at scale on frontier modelsConduct research to develop and optimize post-training recipes that directly improve production model qualityDesign, build, and run robust, efficient pipelines for model fine-tuning and evaluationDevelop tools to measure and improve model performance across various dimensionsCollaborate with research teams to translate emerging techniques into production-ready implementationsDebug complex issues in training pipelines and model behaviorHelp establish best practices for reliable, reproducible model post-trainingYou May Be a Good Fit If YouThrive in controlled chaos and are energised, rather than overwhelmed, when juggling multiple urgent prioritiesAdapt quickly to changing prioritiesMaintain clarity when debugging complex, time-sensitive issuesHave strong software engineering skills with experience building complex ML systemsAre comfortable working with large-scale distributed systems and high-performance computingHave experience with training, fine-tuning, or evaluating large language modelsCan balance research exploration with engineering rigor and operational reliabilityAre adept at analyzing and debugging model training processesEnjoy collaborating across research and engineering disciplinesCan navigate ambiguity and make progress in fast-moving research environmentsStrong Candidates May AlsoHave experience with LLMsHave a keen interest in AI safety and responsible deploymentWe welcome candidates at various experience levels, with a preference for senior engineers who have hands-on experience with, Why PlayStation? PlayStation isn't just the Best Place to Play - it's also the Best Place to Work. Today, we're recognized as a global leader in entertainment producing The PlayStation family of products and services including PlayStation®5, PlayStation®4, PlayStation®VR,..., Diligent is the AI leader in governance, risk and compliance (GRC) SaaS solutions, helping more than 1 million users and 700,000 board members to clarify risk and elevate governance. The Diligent One Platform gives practitioners, the C-Suite and the board a consolidated..., Are you a curious and open-minded individual with an interest in conducting state-of-the-art foundational machine learning research? Thomson Reuters Labs is seeking Research Scientists with a passion for building complex AI systems in a data-rich, complex academic...

Requirements

frontier AI systems. However, proficiency in Python, deep learning frameworks, and distributed computing is required for this role.The annual compensation range for this role is listed below. For sales roles, the range provided is the role's On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role.Annual Salary £260,000-£370,000 GBPEducation Requirements We require at least a Bachelor's degree in a related field or equivalent experience.Location-based Hybrid Policy Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices.Visa Sponsorship We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this.Safety Notice Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links-visit anthropic.com/careers directly for confirmed position openings.How We're Different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact-advancing our long-term goals of steerable, trustworthy AI-rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in

Benefits & conditions

common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills.The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences.Come Work With Us Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues.Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.#J-18808-Ljbffr Similar jobs

About the company

Research Scientist/Engineer, Biological ModelsLondon, UK About the AI Security InstituteThe AI Security Institute is the world's largest and best-funded team dedicated to understanding advanced AI risks and translating that knowledge into action. We're in the heart of the..., About the AI Security Institute The AI Security Institute is the world's largest and best-funded team dedicated to understanding advanced AI risks and translating that knowledge into action. We're in the heart of the UK government with direct lines to No. 10 (the Prime..., Software is eating the world, but AI is eating software. We live in unprecedented times - AI has the potential to exponentially augment human intelligence. Every person will have a personal tutor, coach, assistant, personal shopper, travel guide, and therapist throughout..., Synthesia is the world's leading AI video platform for business, used by over 90% of the Fortune 100. Founded in 2017, the company is headquartered in London, with offices and teams across Europe and the US.As AI continues to shape the way we live and work, Synthesia...