Transformer

Self-Attention

Self-Attention を中核とするニューラル構造。 LLM・VLM・音声・蛋白質構造予測まで横断的に使われる。

Transformer は、 2017 年の論文「Attention is All You Need」で提案されたニューラル構造。 Self-Attention によって系列内の任意の位置同士の関係を直接計算できる。 RNN / CNN を置き換え、 LLM (GPT / Claude / Llama) / VLM (LLaVA / Flamingo) / 音声 (Whisper) / 蛋白質構造予測 (AlphaFold 2) まで広く使われる。