Zhan Su
zhan1993
AI & ML interests
None yet
Organizations
None yet
models
76
zhan1993/library-mistral7B_flan_5ep_higher_lr
Updated
zhan1993/BeaverTails_filtered_train_experts
Updated
zhan1993/shared_experts_trained_from_lora_soup_llama
Updated
•
4
zhan1993/phi-3-10-clusters-Spectral-merge
Text Generation
•
4B
•
Updated
•
6
zhan1993/gpt-neo-125m_merged_lora_merge
Text Generation
•
0.1B
•
Updated
•
5
zhan1993/mathqa_trained_from_lorasoup
Updated
•
4
zhan1993/code_trained_from_lorasoup
Updated
•
2
zhan1993/gptneo_1B_flan_10_experts-epoch_2
Updated
zhan1993/library-phi_2-v3-10-flan-clusters
Updated
zhan1993/trained_gpt125m_experts_colab
Updated
datasets
35
zhan1993/task_adapter_dataset
Viewer
•
Updated
•
3.13k
•
20
zhan1993/BeaverTails_filtered_safe
Viewer
•
Updated
•
16k
•
32
zhan1993/coconot_experts_train
Viewer
•
Updated
•
12.5k
•
8
zhan1993/BeaverTails_filtered_train
Viewer
•
Updated
•
185k
•
24
zhan1993/BeaverTails_filtered_test
Viewer
•
Updated
•
18.7k
•
18
zhan1993/gsm-8k-perturb
Viewer
•
Updated
•
1.32k
•
7
zhan1993/mathqa_lora_soup_repo
Viewer
•
Updated
•
395k
•
16
zhan1993/coconot_original_train_routing
Viewer
•
Updated
•
500
•
14
zhan1993/coconot_contrast_eval
Viewer
•
Updated
•
379
•
21
zhan1993/coconot_original_eval
Viewer
•
Updated
•
1k
•
10