[Paper Review] Set Transformer: A Framework for Attention-based Permutation-Invariant Neural Networks

Notice

Recent Posts

Recent Comments

Link

« 2025/07 »
일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

Tags more

Archives

Today

Total

관리 메뉴

Research Notes

[Paper Review] Set Transformer: A Framework for Attention-based Permutation-Invariant Neural Networks 본문

Paper Review

[Paper Review] Set Transformer: A Framework for Attention-based Permutation-Invariant Neural Networks

jiachoi 2022. 8. 29. 15:46

Title: Set Transformer: A Framework for Attention-based Permutation-Invariant Neural Networks
Authors: Juho Lee, Yoonho Lee, Jungtaek Kim, Adam R. Kosiorek, Seungjin Choi, Yee Whye Teh

0. Abstract

다양한 ML tasks는 set의 순서에 의존하지 않는 *permutation invariant한 특성을 반영하지 못함
기존 모델의 이러한 한계로, Attention-based neural network model인 Set Transformer를 제안함
Set Transformer는 input set 내 요소들의 interaction을 반영할 수 있도록 설계됨. 또한 attention mechanisms에 기반하여 encoder-decoder 구조로 이루어짐
Set Transformer는 sparse Gaussian process literature를 통해 computation time을 절감함

1. Introduction

Set-structured data는 set of instances이 input이며, entire set에 대해 한 label이 존재함. 이러한 set-input 형태 데이터를 처리하기 위한 모델은 아래의 두 가지를 만족해야 함

It should be permutation invariant - 모델의 output은 input set의 순서에 상관 없이 동일해야 함
It should be able to process input sets of any size. - input의 shape이 달라도 이를 처리할 수 있어야 함. 대부분의 neural network-based models는 이를 만족하지 못함

Set-structured data를 처리하기 위해 "set pooling methods"라는 것이 제안됨. 하지만, output의 형태가 black-box라는 단점이 존재함. "amortized clustering"도 parametric mapping을 통해 cluster와 set을 매핑하여, set-structured data를 처리하고자 함. 하지만, quantization이 contents of the set을 반영할 수 없었음

따라서, Set Transformer가 제안됨. Set Transformer의 novelty는 아래와 같음

Self-attention mechanism을 사용해, 모든 element를 input-set으로 처리함. 이는 set 내의 elements 간의 interaction을 반영하여 encoding 할 수 있음
computation time을 줄임
feature를 aggregate할 때, self-attention mechanism을 사용하여 problem이 multiple outputs를 요구할때에도 사용이 가능함

2. Background

Pooling Architecture for Sets

Problems involving a set of objects have the permutation invariance property: the target value for a given set is the same regardless of the order of objects in the set.
A simple example of a permutation invariant model → set element에서 추출된 embedding에 대해 pooling을 수행하는 network

모든 permutation invariant function은 (1)처럼 표현. pool은 sum operator, ρ, φ는 continuous function
(1)은 encoder-decoder 파트로 구분 가능
- encoder(φ): element of a set of n items
- decoder(ρ): which aggregates these encoded features and produces output

Attention

Att(Query, Key, Value). W(Weight, activation function)

n개의 query vector(set with n elements)
dot product Q*K(T)는 query의 each pair들끼리 얼마나 유사한지 측정함
**Attention에 대한 설명은 아래 링크 참조

3. Set Transformer

Set Transformer란?

sets of data를 처리하기 위한 an attention-based neural network
Set Transformer는 encoder-decoder architecture 구조임 (각각의 encoder-decoder의 layer가 activation을 생성)
또한, fixed pooling operation 대신, aggregating function pool(·) 이 problem에 맞게 parameterized 될 수 있음

Permutation Equivariant (Induced) Set Attention Blocks

self-attention → compute interactions among instances during the encoding process
multi-head attention mechanism

Pooling by Multihead Attention

Overall Architecture

Analysis

Since the blocks used to construct the encoder (i.e., SAB, ISAB) are permutation equivariant, the mapping of the encoder X → Z is permutation equivariant as well. Combined with the fact that the PMA in the decoder is a permutation invariant transformation, we have the following:

Proposition 1. The Set Transformer is permutation invariant.

Being able to approximate any function is a desirable property, especially for black-box models such as deep neural networks. Building on previous results about the universal approximation of permutation invariant functions, we prove the universality of Set Transformers:

Proposition 2. The Set Transformer is a universal approximator of permutation invariant function.

* Permutation Invariant 개념 (https://jiachoi.tistory.com/15)

[ML] Permutation invariant의 개념

Permutation Invariant Adjacency matrix의 순서가 바뀌더라도 그 output이 바뀌지 않는 함수의 특성 A: Adjacency matrix, P: 행/열의 순서를 바꾸는 permutation matrix 아래의 식을 통해, adjacency matrix..

jiachoi.tistory.com

** Attention 논문 리뷰(https://jiachoi.tistory.com/4?category=1054535)

[Paper Review] Transformer: Attention is All you Need

< Transformer: Attention is All you Need > Transformer는 구글이 발표한 논문인 "Attention is All you Need"에서 나온 모델이며, Encoder-Decoder를 따르면서도 Attention만으로 구현한 모델이며 자연어..

jiachoi.tistory.com

*** Paper URL (https://arxiv.org/abs/1810.00825)

Set Transformer: A Framework for Attention-based Permutation-Invariant Neural Networks

Many machine learning tasks such as multiple instance learning, 3D shape recognition, and few-shot image classification are defined on sets of instances. Since solutions to such problems do not depend on the order of elements of the set, models used to add

arxiv.org

저작자표시 (새창열림)

'Paper Review' 카테고리의 다른 글

[Paper Review] Manufacturing process similarity measurement model and application based on process constituent elements (0)	2022.09.15
[Paper Review] Reciptor: An Effective Pretrained Model for Recipe Representation Learning (0)	2022.08.30
[Paper Review] Recipe representation learning with networks (0)	2022.08.30
[Paper Review] Transformer Review (with Pytorch) (0)	2022.08.25
[Paper Review] Transformer: Attention is All you Need (0)	2022.03.31

'Paper Review' Related Articles

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

Research Notes

Research Notes

[Paper Review] Set Transformer: A Framework for Attention-based Permutation-Invariant Neural Networks 본문

[Paper Review] Set Transformer: A Framework for Attention-based Permutation-Invariant Neural Networks

Title: Set Transformer: A Framework for Attention-based Permutation-Invariant Neural Networks
Authors: Juho Lee, Yoonho Lee, Jungtaek Kim, Adam R. Kosiorek, Seungjin Choi, Yee Whye Teh

0. Abstract

1. Introduction

2. Background

3. Set Transformer

Set Transformer란?

Permutation Equivariant (Induced) Set Attention Blocks

Pooling by Multihead Attention

Overall Architecture

Analysis

'Paper Review' 카테고리의 다른 글

티스토리툴바

단축키

내 블로그

블로그 게시글

모든 영역

Research Notes

[Paper Review] Set Transformer: A Framework for Attention-based Permutation-Invariant Neural Networks 본문

[Paper Review] Set Transformer: A Framework for Attention-based Permutation-Invariant Neural Networks

Title: Set Transformer: A Framework for Attention-based Permutation-Invariant Neural NetworksAuthors: Juho Lee, Yoonho Lee, Jungtaek Kim, Adam R. Kosiorek, Seungjin Choi, Yee Whye Teh

0. Abstract

1. Introduction

2. Background

3. Set Transformer

Set Transformer란?

Permutation Equivariant (Induced) Set Attention Blocks

Pooling by Multihead Attention

Overall Architecture

Analysis

'Paper Review' 카테고리의 다른 글

티스토리툴바

단축키

내 블로그

블로그 게시글

모든 영역

Title: Set Transformer: A Framework for Attention-based Permutation-Invariant Neural Networks
Authors: Juho Lee, Yoonho Lee, Jungtaek Kim, Adam R. Kosiorek, Seungjin Choi, Yee Whye Teh