Software Engineer - Large Language Models
Role details
Job location
Tech stack
Job description
Full-time | Remote with trips to Silicon Valley office | Reports to FoundersIntroduction:Join us at Fastino as we build the next generation of LLMs. Our team, boasting alumni from Google Research, Apple, Stanford, and Cambridge is on a mission to develop specialized, efficient AI.Fastino's GLiNER family of open source models has been downloaded more than 5 million times and is used by companies such as NVIDIA, Meta, and AirbnbFastino has raised $25M (as featured in TechCrunch) through our seed round and is backed by leading investors including Microsoft, Khosla Ventures, Insight Partners, Github CEO Thomas Dohmke, Docker CEO Scott Johnston, and others.What You'll Work On:Experiment with novel language model architectures, helping drive and execute Fastino's research roadmapOptimize Fastino's multimodal models to improve response quality, instruction adherence, and overall performance metricsArchitect data processing pipelines, implementing filtering, balancing, and captioning systems to, Senior LLM, RAG & Agentic AI Consulting Engineer - Lead / Senior FDE Remote First, some trips to client offices and HQ Lead the design and delivery of complex, AI-native client engagements, spanning agentic systems, retrieval architectures and semantic layers. This is a..., Join our innovative team as an AI Engineer specializing in Natural Language Processing. In this role, you will be responsible for developing sophisticated language models and text analysis tools. Key tasks involve fine-tuning Large Language Models, optimizing prompt...
Requirements
ensure training data quality across diverse content categoriesImplement reinforcement learning techniques including Direct Preference Optimization and Generalized Reward Preference Optimization to align model outputs with human preferences and quality standardsBuild robust and real-world motivated evaluationsPartner with Fastino engineering team to ship model updates directly to customersEstablish best practices for code health and documentation on the team, to facilitate collaboration and reliable developmentWhat We're Looking For:Required - Great velocity for building and shipping agents / AI products.Optional - Advanced degree (Master's or PhD) in Computer Science, Artificial Intelligence, Machine Learning, or related technical discipline with concentrated study in deep learning and computer vision methodologiesOptional - Demonstrated ability to do independent research in Academic or Industry settingsOptional - Substantial industry experience in large-scale deep learning model training, with demonstrated expertise in at least one of Large Language Models, Vision-Language Models, Diffusion Models, or comparable generative AI architecturesOptional - Comprehensive technical proficiency and practical experience with leading deep learning frameworks, including advanced competency in one of PyTorch, JAX, TensorFlow, or equivalent platforms for model development and optimization Similar jobs, Innovation Software Engineer (Numerical Modelling, AI/ML, C++/Python) Guildford, Surrey (On-site) £65000 - £100,000 + 25% Bonus, 10% Pension, Private Medical. - A Masters or PhD Degree in Computing or STEM disciplines. - Can work full-time, 5 days a week in Guildford in...