Member of Technical Staff / Anthropic / Co-Founder / Essential AI
Niki Parmar is a pioneering AI researcher and entrepreneur, best known as a co-author of the seminal "Attention Is All You Need" paper, which introduced the Transformer model foundational to modern AI systems like ChatGPT. She has held significant roles at Google Brain and co-founded two prominent AI startups, Adept AI Labs (where she was CTO) and Essential AI. Currently, she serves as a Member of Technical Staff at Anthropic, focusing on frontier capabilities and RL research.
Master's Degree, University of Southern California
Leading RL research on hard exploration tasks, frontier capabilities and test-time scaling
Essential AI
Co-Founder
Jan 2023 - Sep 2024
Position
Co-Founder
A
Adept AI Labs
Co-Founder and CTO at Adept
Nov 2021 - Nov 2022
Responsibilities
Started and Co-founded Adept AI where the vision was to build useful and intelligent models that can take actions with tools in your browser on behalf of users.
Google Brain
Staff Research Scientist
Apr 2021 - Nov 2021
Responsibilities
Core contributor of the “Attention Is All You Need” paper (https://arxiv.org/abs/1706.03762) that invented the Transformer model. I also led scaling transformer models and extended it to other AI problems like Image Generation(https://arxiv.org/abs/1802.05751) and Computer Vision(https://arxiv.org/abs/1906.05909). Transformer based models can write news articles, poetry, emails, translate one language into another, and even answer questions. It has also been used in code generation, protein folding, speech recognition and many other AI models like ChatGPT, GPT3, AlphaStar, AlphaFold, Co-Pilot, Dall-e etc.
Launched some of the first successful deep learning based AI models for Question Answering and Text similarity for Google Search.
Google Brain
Senior Research Scientist
Feb 2020 - Apr 2021
Position
Senior Research Scientist
Google Brain
Senior Research Engineer
Nov 2018 - Feb 2020
Position
Senior Research Engineer
Google Brain
Research Software Engineer
Oct 2017 - Nov 2018
Responsibilities
Deep Learning research for language understanding and vision (images, videos)
Research in Deep Learning for Machine Translation, Images and other generation methods.
Google Research
Software Engineer III
Oct 2016 - Oct 2017
Responsibilities
Research in Deep Learning for Machine Translation, Images and other generation methods.
Google Research
Software Engineer
Jul 2015 - Oct 2016
Responsibilities
Research in Deep Learning for Machine Translation, Images and other generation methods.
Previously worked in Natural language understanding for sentence similarity and question answering systems using end to end deep learning methods.
Computational Social Science Lab , USC
Research Assistant
Jan 2014 - Dec 2015
Responsibilities
https://cssl.usc.edu
▪ Social media analysis to determine how moral content in social media text can predict real-world behavioral phenomena like group dynamics, decision making and culture
▪ Analyzed twitter data during government shutdown (Nov’13) to predict the social distance between users, their rate of activity and the influence each user has on the twitter network. Visualized network structure to show difference in moral values.
▪ Developed a NLP suite to implement algorithms like z-label LDA, clustering, classifiers and social media data extractor.
Amazon
Software Development Engineer Intern
Jun 2014 - Aug 2014
Responsibilities
▪ Part of the Global Action Trace - Distributed Computing team under the E-commerce Platform Services.
▪ Tracking dependency changes for any service in Amazon over time to report changes in call patterns like new or removed dependencies. Tracking volume of calls between services, to report causes of discrepancy or latency issues in the call chains.
▪ Processing was done in a distributed environment to make it scalable to manage millions of service calls made in an hour.
University of Southern California
Database Specialist
Aug 2013 - Jan 2014
Responsibilities
▪ Performance tuning of database using query optimization techniques. Processing student data to generate survey reports and cleaning up redundant data using normalization.
PubMatic
Software Engineer
Jul 2012 - Jul 2013
Responsibilities
▪ Part of the AdServer team responsible for Online AdServing for publishers in real time across display, mobile and video. Included the end to end ad serving process, requesting bids, sending the optimal ad and gathering stats all in realtime.
▪ Worked on designing and implementing video ad serving for publishers supporting video inventory.
▪ Used Hadoop (Map Reduce) to aggregate pixel and audience data stats and writing it to MySQL. Also, worked on predicting the mobile ad price from Ad networks using different parameters like device, os, time, location, etc.
Education
University of Southern California
Master's Degree
2013 - 2015
Field of Study
Master's Degree - Computer Science
Pune Institute of Computer Technology
Bachelor of Engineering (BE)
2008 - 2012
Field of Study
Bachelor of Engineering (BE) - Information Technology
Key Career Achievements
Transformer Architect
Co-authored the seminal 'Attention Is All You Need' paper, introducing the Transformer architecture which underpins modern AI systems like ChatGPT, GPT-3, and AlphaFold.
AI Entrepreneurship
Co-founded two significant AI startups, Adept AI (as CTO and Co-Founder) and Essential AI (Co-Founder, securing $8.0M in initial funding).
Deep Learning Pioneer
Recognized as the youngest and only non-PhD researcher on the Google Brain team at the time, leading key initiatives in scaling and extending Transformer models.
Google Search Innovation
Launched highly successful deep learning-based AI models for Question Answering and Text Similarity systems for Google Search.
Core Competencies
Expertise in foundational AI research, deep learning model architecture, and developing scalable enterprise AI solutions.
AI & Machine Learning
Deep LearningMachine LearningNatural Language ProcessingNatural Language GenerationAlgorithms