Solutions

Find Influencer

Find Client

Find Investor

Find Talent

Find Partner

Find Coach

Find Distributor

Find Property Leads

Tools

Instagram

Instagram Fake Follower Check

Instagram Follower Count

Instagram Engagement Calculator

Instagram Audit

Instagram Pricing Calculator

Find Instagram Creators

Compare Instagram Influencers

TikTok

TikTok Fake Follower Check

TikTok Follower Count

TikTok Engagement Calculator

TikTok Audit

Find TikTok Creators

Compare TikTok Influencers

YouTube

YouTube Follower Count

YouTube Fake Follower Check

YouTube Engagement Calculator

YouTube Audit

Find YouTube Creators

Compare YouTube Influencers

Twitter/X

X Profile Viewer

Twitter Profile Search

Twitter/X Follower Count

Twitter/X Engagement Calculator

Twitter/X Audit

Find Twitter/X Creators

Compare Twitter/X Influencers

LinkedIn Profile Extractor

Facebook

Facebook Profile Viewer

Discord

Discord Profile Viewer

Cold Email Generator

Free Cold Call Tool

Bulk Email Verifier

Free Email Verifier

Email Permutator

Reverse Email Lookup

AI Email Outreach Engine

Email Addresses List

Email Outreach

Email Subject Line Tester

Company Research

Company Profile Search

Company Location Finder

Small Business Near Me

Company Intelligence Snapshot

Lookalike Company Finder

Other Tools

Free AI Headshot Generator

CPM Calculator

Tech Stack Checker

Market Size Calculator

ICP Fit Scorer

Sales Deck Outline Generator

Competitor Comparison Tool

Resume Scorer Free

CV Builder

Free Invoice Generator

Guangyu Chen

ML Researcher / Kimi · Moonshot AI

CountryChina

EducationHigh School Senior, Shenzhen

Known ForMachine Learning, Efficient Attention, Model Architecture, Continual Learning, Open Source

293

Artificial IntelligenceMachine LearningDeep Learning

GitHub Followers

293

FLA Stars

4.7K

Age

Codeforces Rating

2000+

Search More

About Guangyu Chen

Guangyu Chen (Nathan) is a machine learning researcher at Kimi, Moonshot AI, specializing in model architecture, efficient attention mechanisms, and continual learning. Despite being a high school student from Shenzhen, China, he has made significant contributions to AI research, most notably as the co-first author of the "Attention Residuals" paper, which proposes a novel approach to replacing fixed residual connections in Transformers with learned, input-dependent attention over depth. The paper garnered widespread attention and praise from Elon Musk and Andrej Karpathy. Chen's journey into ML began through contributing to the Flash Linear Attention open-source project, which led to his invitation to join the Kimi team.

Education

International High School, Shenzhen

Senior Year

Current

Education

Currently a high school senior at an international school in Shenzhen, Guangdong, China. His interest in machine learning began through self-directed learning, competitive programming on Codeforces, and contributing to open-source projects.

ML Research

Efficient Attention

Model Architecture

Linear Attention

Residual Connections

Scaling Laws

Continual Learning

Systems & Engineering

Triton Kernels

CUDA Optimization

GPU Compute

PyTorch

Distributed Training

Competitive Programming & Open Source

Codeforces

Algorithms

Flash Linear Attention

Open Source Development

Python

Domains

Primary

Efficient Attention Mechanisms, Transformer Architecture, Model Scaling

Secondary

Continual Learning, Hardware-Aligned ML Algorithms, Interpretability

Experience

Kimi · Moonshot AI

ML Researcher

Nov 2025 - Present

Role

Working on ML research at Kimi, focusing on model architecture and efficient attention mechanisms. Co-first author of the Attention Residuals paper, which proposes replacing fixed residual connections with softmax attention over preceding layer outputs, achieving approximately 25% compute efficiency improvement on the Kimi Linear architecture (48B total / 3B activated parameters).

Silicon Valley AI Startup

Research Intern

Summer 2025

Role

Completed a seven-week internship at a Silicon Valley AI startup. Managed a project involving 144 H100 GPUs and engaged directly with leadership on operational matters including recruitment and financing discussions.

Tilde Research

Research Contributor

2025

Role

Worked on applied interpretability research, exploring how neural networks process and represent information internally.

Notable Work

Attention Residuals (2026)

Co-first author. Proposes AttnRes, replacing fixed residual accumulation with softmax attention over preceding layer outputs. Validated on Kimi Linear 48B architecture with 1.4T tokens, achieving ~25% compute efficiency improvement with <2% inference latency overhead. Praised by Elon Musk and Andrej Karpathy.

Flash Linear Attention

Core contributor to the FLA open-source project (4.7K+ GitHub stars), providing efficient implementations of state-of-the-art linear attention models in PyTorch and Triton.

Profile Summary

Expertise

Efficient Attention, Model Architecture, Linear Attention, Continual Learning, Open Source

Platforms

Twitter, LinkedIn, GitHub

Related Profiles

Al Seckel

Alfred Paul "Al" Seckel was an American cognitive neuroscientist, author, and prominent skeptic known for his work in popularizing optical illusions and perceptual phenomena. He was an instructor at the California Institute of Technology and co-founded the Southern California Skeptics, serving as its executive director. Seckel lectured extensively on illusions at prestigious institutions like Harvard, MIT, and Caltech, using them as a window into the hidden rules of the human perceptual system.

AI Researcher

Alan Dershowitz

Alan Dershowitz is an American lawyer and legal scholar, renowned for his expertise in U.S. constitutional and criminal law. He served as a professor at Harvard Law School for nearly five decades, where he was the Felix Frankfurter Professor of Law. Dershowitz is also a prominent defender of civil liberties and human rights, known for his extensive writings and high-profile litigation.

AI Researcher

Alexey Kurakin

Alexey Kurakin is a Senior Staff Research Engineer at Google DeepMind, specializing in the security and privacy of machine learning, large language models (LLMs), and differential privacy. He has a long tenure at Google, previously working as a Research Software Engineer at Google Brain, where he focused on fundamental research and applications in machine learning. Kurakin is a prolific researcher, publishing papers at top machine learning conferences like NeurIPS, ICLR, and ICML, and is highly cited in the field.

AI Researcher

Anca Dragan

Anca Dragan is a prominent figure in AI and robotics, currently serving as the Director of AI Safety and Alignment at Google DeepMind while on leave from her role as an Associate Professor at UC Berkeley's EECS Department. Her research, conducted through the InterACT Lab, focuses on algorithms for human-AI and human-robot interaction, particularly on AI alignment to ensure agents act in accordance with human goals and values. She is recognized for her work in reward engineering and enabling AI agents to work collaboratively with people.

AI Researcher

Andrew Ng

Andrew Ng is a globally recognized leader in Artificial Intelligence, known for co-founding Coursera and founding DeepLearning.AI, AI Fund, and LandingAI. He is a former head of Baidu AI Group and the founder of the Google Brain team. He currently serves as an Adjunct Professor of Computer Science at Stanford University.

AI Researcher

Been Kim

Been Kim is a Senior Staff Research Scientist at Google DeepMind, specializing in interpretable machine learning (Explainable AI). Her research focuses on developing human-centered tools and concepts to help people communicate with and understand complex ML models. She is known for her work on TCAV (Testing with Concept Activation Vectors), which received the UNESCO Netexplo award, and has given keynotes at major conferences like ICLR 2022 and ECML 2020.

AI Researcher

Chelsea Finn

Chelsea Finn is an Assistant Professor of Computer Science and Electrical Engineering at Stanford University, where she directs the IRIS (Intelligence, Robotics, and Interactive Systems) Lab. She is a leading American computer scientist specializing in machine learning, robotics, and reinforcement learning. Finn is also the co-founder of Physical Intelligence, a company focused on developing intelligent physical systems. Her work has been highly influential, evidenced by over 100,000 citations.

AI Researcher

Dawn Song

Dawn Song is a highly distinguished Professor of Computer Science at UC Berkeley, where she also co-directs the Berkeley RDI Center. Her research focuses on critical areas including AI safety and security, Agentic AI, deep learning, and decentralization technology. A serial entrepreneur, she is the founder of Oasis Labs and is recognized for her significant contributions to computer security and privacy, including being a recipient of the prestigious MacArthur Fellowship.

AI Researcher

Fei-Fei Li

Fei-Fei Li is a Chinese-born American computer scientist, best known as the inventor of ImageNet and the ImageNet Challenge, a pivotal dataset that fueled the rapid advancement of modern computer vision and deep learning. She is the Sequoia Professor of Computer Science at Stanford University and the Co-Director of the Stanford Institute for Human-Centered Artificial Intelligence (HAI). She also co-founded the non-profit AI4ALL and is the CEO and Cofounder of World Labs.

AI ResearcherAIStartup

Ready to reach more person?

Start for free

Guangyu Chen

About Guangyu Chen

Education

Skills

Domains

Tags

Experience

Notable Work

Profile Summary

Related Profiles

Ready to reach more person?

Guangyu Chen

About Guangyu Chen

Education

Skills

Domains

Tags

Experience

Notable Work

Profile Summary

Related Profiles

Ready to reach more person?