Sparse Autoencoder Example

News

DeepMind makes big jump toward interpreting LLMs with sparse ...

Sparse autoencoders (SAE) use the concept of autoencoder with a slight modification. During the encoding phase, the SAE is forced to only activate a small number of the neurons in the intermediate ...

Yahoo Finance4mon

MicroCloud Hologram Inc. announces optimization of stacked sparse autoencoders through DeepSeek model - Yahoo Finance

MicroCloud Hologram Inc. (NASDAQ: HOLO), ("HOLO" or the "Company"), a technology service provider, they Announced the deep optimization of stacked sparse autoencoders through the DeepSeek open ...

TechRepublic1y

OpenAI, Anthropic AI Research Reveals More About How LLMs Affect Security and Bias - TechRepublic

So far, they can’t interpret all of GPT-4’s behaviors: “Currently, passing GPT-4’s activations through the sparse autoencoder results in a performance equivalent to a model trained with ...

MIT Technology Review7mon

Google DeepMind has a new way to look inside an AI’s “mind”

For example, a team lead by Samuel Marks, now at Anthropic, used sparse autoencoders to find features that showed a particular model was associating certain professions with a specific gender.

The Economist12mon

Researchers are figuring out how large language models work

A sparse autoencoder is, essentially, a second, smaller neural network that is trained on the activity of an LLM, looking for distinct patterns in activity when “sparse” (ie, very small ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results