Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
SwissArmyTransformer
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Iterations
Wiki
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Locked files
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Package Registry
Container Registry
Model registry
Operate
Environments
Terraform modules
Monitor
Incidents
Service Desk
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Code review analytics
Issue analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Terms and privacy
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
mirrored_repos
MachineLearning
thukeg
SwissArmyTransformer
Graph
18be5e6a9692d1b4d11a85b39f857ede890cd1e6
Select Git revision
Branches
20
GLM-130B
MoE
adamop
adapt_env
args
beamsearch
bert
bert_large
bert_new
cait
chatglm-rotary
clever_dataset
clip
cogvideo
config
cp_support
cross
dev
dev_fixnan
develop
Tags
1
v0.1.10
21 results
You can move around the graph by using the arrow keys.
Begin with the selected commit
Created with Raphaël 2.2.0
16
Jun
15
14
12
10
9
8
7
6
5
4
3
2
27
May
26
25
23
21
20
19
18
17
15
14
13
12
11
5
4
28
Apr
26
25
22
21
20
19
18
16
14
13
12
10
9
8
28
Mar
27
24
16
23
Feb
19
18
16
14
27
Jan
24
20
19
18
17
15
14
13
12
11
9
7
6
5
31
Dec
30
29
28
23
22
21
20
19
18
17
16
15
13
12
11
10
9
5
4
3
2
1
30
Nov
26
25
23
21
19
14
9
8
7
6
5
3
2
31
Oct
30
29
28
27
26
25
24
23
22
21
20
19
18
14
10
9
8
7
6
5
1
23
Sep
22
16
8
30
Aug
20
18
10
22
Jun
18
17
16
make apex optional
Merge pull request #47 from pierrefdz/main
updated url for cogview
v0.2.1
Merge pull request #45 from THUDM/main_from
Merge remote-tracking branch 'origin/main' into main_from
main_from
main_from
add cogview2
Merge pull request #44 from THUDM/bert_large
support roberta
bert_large
bert_large
support bert large
Merge pull request #43 from THUDM/main_from
v0.2
update readme
save ckpt create model_config.json
layernorm-order and tokenizer-type args
Merge branch 'main_from' of github.com:THUDM/SwissArmyTransformer into main_from
make training_main tokenizer-free, load hf
merge main and resolve conflict
update examples to new layernorm args
add pre/post/sandwich options
Merge branch 'main_from' of github.com:THUDM/SwissArmyTransformer into main_from
move init distributed and seed to get_args
adapt yolos to new version
adapt clip to new version
adapt cait to new version
adapt deit to new version
adapt vit to new version
add model type args
adapt bert to new version
change log
Merge branch 'main_from' of github.com:THUDM/SwissArmyTransformer into main_from
fix lock and update_args
tmp_from_pretrain
update model name and url
merge
new param format
reformat args & add default zero-stage
update from_pretrained for more models
move transformer.py to model and out, make ops folder
split hooks out
Loading