·
AI & ML interests
None yet
Organizations
sert121/orpo_gpt2_hh_helpful-merged
Text Generation
•
0.1B
•
Updated
sert121/orpo_gpt2_hh_helpful
Updated
•
3
sert121/orpo_gpt2_hh_full-merged
Text Generation
•
0.1B
•
Updated
sert121/gpt2_orpo_hh_helpful-merged
Text Generation
•
0.1B
•
Updated
•
2
sert121/pythia2.8_orpo_hh_helfpul-merged
Text Generation
•
3B
•
Updated
•
2
sert121/orpo_run_yash_hh_helpful_gpt2
Updated
sert121/orpo_run_yash_pythia2.8
Updated
sert121/orpo_run_yash_finance_llama_hh_rlhf
Updated
sert121/orpo_run_yash_medicine_llama_hh_rlhf
Updated
sert121/orpo_run_yash_beta_hh_full_gpt2
Updated
sert121/orpo_run_yash_beta_hh_full_pythia
Updated
•
1
sert121/rm_anthropic_hh_gpt2
Text Generation
•
0.1B
•
Updated
sert121/reward_lm_gpt2_hh
Updated
sert121/orpo_run_yash_defog-llama-3-sqlcoder-8b_preference_data_v11_cleaned_filtered
Updated
•
1
sert121/orpo_run_yash_defog-sqlcoder-8b-slerp_preference_data_v11_cleaned_filtered
Updated
•
1
sert121/orpo_run_yash_defog-sqlcoder-8b-dare-ties_preference_data_v11_cleaned_filtered
Updated
•
1
sert121/orpo_run_yash_epochs_10_preference_data_v10_for_orpo
Updated
sert121/orpo_run_yash_epochs_15_preference_data_v10_for_orpo
Updated
sert121/orpo_run_yash_epochs_20_preference_data_v10_for_orpo
Updated
sert121/orpo_run_yash_epochs_25_preference_data_v10_for_orpo
Updated
•
4
sert121/orpo_run_yash_beta_0.5_preference_data_v10_for_orpo
Updated
•
1
sert121/orpo_run_yash_beta_0.1_preference_data_v10_for_orpo
Updated
•
1
sert121/orpo_run_yash_beta_0.3_preference_data_v10_for_orpo
Updated
•
1
sert121/orpo_run_yash_beta_0.4_preference_data_v10_for_orpo
Updated
•
1
sert121/orpo_run_yash_beta_0.2_preference_data_v10_for_orpo
Updated
•
1
sert121/orpo_run_yash_defog-sqlcoder-8b-slerp_preference_data_v10_for_orpo
Updated
•
1
sert121/orpo_run_yash_defog-sqlcoder-8b-dare-ties_preference_data_v10_for_orpo
Updated
•
1
sert121/orpo_run_yash_lora_32_preference_data_v10_for_orpo
Updated