Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
SwissArmyTransformer
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Iterations
Wiki
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Locked files
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Package Registry
Container Registry
Model registry
Operate
Environments
Terraform modules
Monitor
Incidents
Service Desk
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Code review analytics
Issue analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Terms and privacy
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
mirrored_repos
MachineLearning
thukeg
SwissArmyTransformer
Graph
tokenization
Select Git revision
Branches
20
GLM-130B
MoE
adamop
adapt_env
args
beamsearch
bert
bert_large
bert_new
cait
chatglm-rotary
clever_dataset
clip
cogvideo
config
cp_support
cross
dev
dev_fixnan
develop
Tags
1
v0.1.10
21 results
You can move around the graph by using the arrow keys.
Begin with the selected commit
Created with Raphaël 2.2.0
21
Apr
18
12
11
10
9
7
3
2
31
Mar
30
29
28
27
22
10
Jan
25
Dec
17
Nov
1
13
Sep
2
28
Aug
22
19
18
16
5
30
Jul
24
22
21
20
19
18
17
16
15
14
11
10
9
7
2
30
Jun
27
26
25
22
21
20
18
17
16
15
14
12
10
9
8
7
6
5
4
3
2
27
May
26
25
23
21
20
19
18
17
15
14
13
12
11
5
4
28
Apr
26
25
22
21
20
19
18
16
14
13
12
10
9
8
28
Mar
27
24
16
23
Feb
19
18
16
14
27
Jan
24
20
19
18
17
15
14
13
12
11
9
7
6
5
31
Dec
30
29
28
23
22
21
20
19
18
17
16
15
13
12
11
10
9
5
4
3
2
1
30
Nov
26
25
23
21
19
14
9
8
7
6
5
3
2
31
Oct
30
29
28
27
26
25
24
23
22
21
20
19
18
14
10
9
8
7
6
5
1
23
Sep
22
16
8
30
Aug
20
18
10
22
Jun
18
17
16
Merge pull request #85 from THUDM/optional_arch_args_save
fix args
try & support cross_attn_hidden_size
support file with non-ascii characters
Merge branch 'v0.3' into main
fix deepspeed mpu simpleinit error and support 0.9
Merge pull request #83 from THUDM/registry
dynamic loading
registry
registry
Merge pull request #80 from zhangfanTJU/fix-chatmodel-order
Update chat_model.py
Update README.md
fix-chatmodel-order
support batch generation for chatglm
Merge pull request #79 from THUDM/v0.3
Merge branch 'main' of github.com:THUDM/SwissArmyTransformer into v0.3
v0.3
v0.3
Merge pull request #75 from xloem/cross-attention
change the pkg name back .
change from_pretrained args-name order
Merge branch 'main' of github.com:THUDM/SwissArmyTransformer into v0.3
v.0.3.1 refactor SwissArmyTransformer as sat
v0.3.0
list info & model-only mode
fix chatglm pad mask and position bug
chatglm batch generation
update chatglm by official repo
tmpsave
fix by fake some args
fix_iterable_va…
fix_iterable_val_lastbatch
fix chatglm finetune pad
update chatglm-6b inference
update chatglm-6b finetune
finetune chatglm-6b
update chatglm and eva2 inference
eva2 model
chatglm-6b chat example
chatglm-6b
gpt2 model (#77)
gpt2 model
gpt
gpt
Update tokenizer
Support 2D position encoding for GLM130B model
Re-set cross attention forward parameter for T5
Loading