Skip to content

GitLab

Explore

Sign in

Primary navigation

Project

SwissArmyTransformer
- Activity
- Members
- Labels
- Issues
- Issue boards
- Milestones
- Iterations
- Wiki
- Environments
- Terraform modules
- Incidents
- Service Desk

Snippets Groups Projects

34990eea

Commit 34990eea authored 3 years ago by Ming Ding

Downloads
- Patches
- Plain Diff

sparse 2d and cache qkv

parent 8abd84d6

No related branches found

No related tags found

No related merge requests found

Changes 4

Expand all Hide whitespace changes

Inline Side-by-side

Showing

mpu/local_attention_function.py 149 additions, 0 deletions

mpu/local_attention_function.py
mpu/sparse_transformer.py 164 additions, 237 deletions

mpu/sparse_transformer.py
mpu/utils.py 4 additions, 0 deletions

mpu/utils.py
test_sparse_attention.py 169 additions, 0 deletions

test_sparse_attention.py

with 486 additions and 237 deletions

Loading

0% Loading or .

You are about to add 0 people to the discussion. Proceed with caution.

Finish editing this message first!

Please register or sign in to comment

Strive to be the person your dogs believe you are