Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
SwissArmyTransformer
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Iterations
Wiki
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Locked files
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Package registry
Container Registry
Model registry
Operate
Environments
Terraform modules
Monitor
Incidents
Service Desk
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Code review analytics
Issue analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Terms and privacy
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
mirrored_repos
MachineLearning
thukeg
SwissArmyTransformer
Repository graph
Repository graph
You can move around the graph by using the arrow keys.
049b6ffcdd6b766f83783bf3746f65693547a7ff
Select Git revision
Branches
20
GLM-130B
MoE
adamop
adapt_env
args
beamsearch
bert
bert_large
bert_new
cait
chatglm-rotary
clever_dataset
clip
cogvideo
config
cp_support
cross
dev
dev_fixnan
develop
Tags
1
v0.1.10
21 results
Begin with the selected commit
Created with Raphaël 2.2.0
30
Mar
29
28
27
22
10
Jan
25
Dec
17
Nov
1
13
Sep
2
28
Aug
22
19
18
16
5
30
Jul
24
22
21
20
19
18
17
16
15
14
11
10
9
7
2
30
Jun
27
26
25
22
21
20
18
17
16
15
14
12
10
9
8
7
6
5
4
3
2
27
May
26
25
23
21
20
19
18
17
15
14
13
12
11
5
4
28
Apr
26
25
22
21
20
19
18
16
14
13
12
10
9
8
28
Mar
27
24
16
23
Feb
19
18
16
14
27
Jan
24
20
19
18
17
15
14
13
12
11
9
7
6
5
31
Dec
30
29
28
23
22
21
20
19
18
17
16
15
13
12
11
10
9
5
4
3
2
1
30
Nov
26
25
23
21
19
14
9
8
7
6
5
3
2
31
Oct
30
29
28
27
26
25
24
23
22
21
20
19
18
14
10
9
8
7
6
5
1
23
Sep
22
16
8
30
Aug
20
18
10
22
Jun
18
17
16
eva2 model
chatglm-6b chat example
chatglm-6b
gpt2 model (#77)
gpt2 model
gpt
gpt
Update tokenizer
Support 2D position encoding for GLM130B model
Re-set cross attention forward parameter for T5
Implement WordPiece tokenizer
Update ice_tokenizer.py
new version
new_finetune
new_finetune
2
finetune
finetune
fix master_addr cannot parse
fb
fb
solve auto-expand
Merge branch 'main' of https://github.com/THUDM/SwissArmyTransformer
Merge pull request #68 from THUDM/GLM-130B
v0.2.12 add dpr & webdataset
Merge branch 'main' of https://github.com/THUDM/SwissArmyTransformer
wrap webdataset
add iterable (wds) support
check LOCAL_RANK & args.device
Update glm130B_model.py
GLM-130B
GLM-130B
Merge pull request #65 from THUDM/dpr
dpr and gpt-neo
dpr
dpr
Merge branch 'main' of https://github.com/THUDM/SwissArmyTransformer
format save args logs
Fix interactive generation with model parallel & update glm-130b model (#64)
BinaryDataset for readonly system
cogvideo
cogvideo
Merge branch 'typofix'
fix some problems
add clip large (2.10)
fix a minor bug of CaiT
Merge pull request #63 from hanyullai/main
fix rotary bug
Merge pull request #62 from THUDM/typofix
fix clip typo
typofix
typofix
Merge pull request #61 from THUDM/args
fix a minor bug
args
args
specify exception
pass in teacher
Loading