Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
SwissArmyTransformer
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Iterations
Wiki
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Locked files
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Package Registry
Container Registry
Model registry
Operate
Environments
Terraform modules
Monitor
Incidents
Service Desk
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Code review analytics
Issue analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Terms and privacy
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
mirrored_repos
MachineLearning
thukeg
SwissArmyTransformer
Graph
91afa917ed43cee124a609b469b4f6c005d1b5c7
Select Git revision
Branches
20
GLM-130B
MoE
adamop
adapt_env
args
beamsearch
bert
bert_large
bert_new
cait
chatglm-rotary
clever_dataset
clip
cogvideo
config
cp_support
cross
dev
dev_fixnan
develop
Tags
1
v0.1.10
21 results
You can move around the graph by using the arrow keys.
Begin with the selected commit
Created with Raphaël 2.2.0
11
Apr
10
9
7
3
2
31
Mar
30
29
28
27
22
10
Jan
25
Dec
17
Nov
1
13
Sep
2
28
Aug
22
19
18
16
5
30
Jul
24
22
21
20
19
18
17
16
15
14
11
10
9
7
2
30
Jun
27
26
25
22
21
20
18
17
16
15
14
12
10
9
8
7
6
5
4
3
2
27
May
26
25
23
21
20
19
18
17
15
14
13
12
11
5
4
28
Apr
26
25
22
21
20
19
18
16
14
13
12
10
9
8
28
Mar
27
24
16
23
Feb
19
18
16
14
27
Jan
24
20
19
18
17
15
14
13
12
11
9
7
6
5
31
Dec
30
29
28
23
22
21
20
19
18
17
16
15
13
12
11
10
9
5
4
3
2
1
30
Nov
26
25
23
21
19
14
9
8
7
6
5
3
2
31
Oct
30
29
28
27
26
25
24
23
22
21
20
19
18
14
10
9
8
7
6
5
1
23
Sep
22
16
8
30
Aug
20
18
10
22
Jun
18
17
16
Merge branch 'main' of github.com:THUDM/SwissArmyTransformer into v0.3
v.0.3.1 refactor SwissArmyTransformer as sat
v0.3.0
list info & model-only mode
fix chatglm pad mask and position bug
chatglm batch generation
update chatglm by official repo
tmpsave
fix by fake some args
fix_iterable_va…
fix_iterable_val_lastbatch
fix chatglm finetune pad
update chatglm-6b inference
update chatglm-6b finetune
finetune chatglm-6b
update chatglm and eva2 inference
eva2 model
chatglm-6b chat example
chatglm-6b
gpt2 model (#77)
gpt2 model
gpt
gpt
Update tokenizer
Support 2D position encoding for GLM130B model
Re-set cross attention forward parameter for T5
Implement WordPiece tokenizer
Update ice_tokenizer.py
new version
new_finetune
new_finetune
2
finetune
finetune
fix master_addr cannot parse
fb
fb
solve auto-expand
Merge branch 'main' of https://github.com/THUDM/SwissArmyTransformer
Merge pull request #68 from THUDM/GLM-130B
v0.2.12 add dpr & webdataset
Merge branch 'main' of https://github.com/THUDM/SwissArmyTransformer
wrap webdataset
add iterable (wds) support
check LOCAL_RANK & args.device
Update glm130B_model.py
GLM-130B
GLM-130B
Merge pull request #65 from THUDM/dpr
dpr and gpt-neo
dpr
dpr
Merge branch 'main' of https://github.com/THUDM/SwissArmyTransformer
format save args logs
Loading