KyuheeKim's picture

5

KyuheeKim

koreankiwi99

·

AI & ML interests

None yet

Organizations

Collections 5

View 5 collections

models 86

koreankiwi99/llama-3.1-8b-paraphrase-qlora-10000

Updated Oct 28, 2025

koreankiwi99/0_predpo_lower_beta_balanced_lower_beta_mnlp_aggregate

0.6B • Updated Jun 7, 2025 • 4

koreankiwi99/1_predpo_tuned_balanced_lower_beta_mnlp_aggregate

0.6B • Updated Jun 7, 2025 • 2

koreankiwi99/2_predpo_base_balanced_plus_lower_beta_mnlp_aggregate

0.6B • Updated Jun 7, 2025 • 1

koreankiwi99/3_predpo_base_curriculum_lower_beta_mnlp_aggregate

0.6B • Updated Jun 7, 2025 • 1

koreankiwi99/4_dpo_curriculum_lower_beta_mnlp_aggregate

0.6B • Updated Jun 7, 2025

koreankiwi99/5_dpo_balanced_plus_lower_beta_mnlp_aggregate

0.6B • Updated Jun 7, 2025

koreankiwi99/dpo_model_predpo_config_mnlp_aggregate

0.6B • Updated Jun 7, 2025 • 1

koreankiwi99/sft_model_sft_base_mnlp_stem_curriculum

0.6B • Updated Jun 7, 2025 • 1

koreankiwi99/sft_model_sft_base_mnlp_stem_balanced_plus

0.6B • Updated Jun 6, 2025 • 2

datasets 18

koreankiwi99/Nunchi-Bench

Preview • Updated Jul 23, 2025 • 24 • 1

koreankiwi99/MNLP_M3_dpo_dataset

Viewer • Updated Jun 10, 2025 • 135k • 4

koreankiwi99/helpsteer3-dpo-general

Viewer • Updated Jun 10, 2025 • 915 • 3

koreankiwi99/helpsteer3-dpo-stem

Viewer • Updated Jun 10, 2025 • 243 • 4

koreankiwi99/helpsteer3-dpo-code

Viewer • Updated Jun 10, 2025 • 432 • 4

koreankiwi99/mtbench-dpo-turn1-gpt4_pair

Viewer • Updated Jun 10, 2025 • 882 • 5

koreankiwi99/mtbench-dpo-turn1-human

Viewer • Updated Jun 10, 2025 • 1.28k • 4

koreankiwi99/hh-dpo-eval

Viewer • Updated Jun 8, 2025 • 8.53k • 5

koreankiwi99/mnlp_stem_curriculum

Viewer • Updated Jun 6, 2025 • 31.8k • 5

koreankiwi99/mnlp_stem_balanced_plus

Viewer • Updated Jun 6, 2025 • 40.8k • 5

View 18 datasets