AI Plans

company

https://aiplans.org

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

dsouzaJithesh updated a collection about 2 months ago

TinyLlama RLHF Models

dsouzaJithesh updated a collection about 2 months ago

TinyLlama RLHF Models

dsouzaJithesh updated a collection about 2 months ago

TinyLlama RLHF Models

View all activity

AIPlans 's models 47

AIPlans/TinyLlama-1.1B-ORPO-PKU-SafeRLHF

Text Generation • 1B • Updated May 15 • 2

AIPlans/TinyLlama-1.1B-IPO-PKU-SafeRLHF

Text Generation • 1B • Updated May 14 • 17

AIPlans/TinyLlama-1.1B-KTO-SafeRLHF

1B • Updated May 14 • 1

AIPlans/TinyLlama-1.1B-RM-SafeRLHF

AIPlans/tinyllama-1.1b-dpo-pku-saferlhf_2

Text Generation • 1B • Updated May 11 • 18

AIPlans/tinyllama-1.1b-dpo-pku-saferlhf

Text Generation • 1B • Updated May 11 • 19

AIPlans/Qwen2.5-1.5B-KTO-PKU-SafeRLHF

2B • Updated May 5 • 1

AIPlans/Qwen3-0.6B-GRPO-CrossCoder-Only

Updated Apr 19 • 2

AIPlans/Qwen3-0.6B-ORPO-CrossCoder-Only

Updated Apr 18 • 1

AIPlans/Qwen3-0.6B-IPO-CrossCoder-Only

Updated Apr 11 • 3

AIPlans/Qwen3-0.6B-KTO-CrossCoder-Only

Updated Apr 11 • 4

AIPlans/Qwen3-0.6B-PPO-CrossCoder-Only

Updated Apr 10 • 1

AIPlans/Qwen3-0.6B-PPO

Text Generation • 0.6B • Updated Mar 27 • 20 • • 2

AIPlans/Qwen3-0.6B-KTO1

Text Generation • 0.8B • Updated Mar 10 • 4

AIPlans/Qwen3-0.6B-ORPO-Crosscoder-MixedDataset

AIPlans/Qwen3-0.6B-GRPO-Crosscoder-MixedDataset

AIPlans/Qwen3-0.6B-KTO-Crosscoder-MixedDataset

AIPlans/Qwen3-0.6B-IPO-Crosscoder-MixedDataset

AIPlans/Crosscoder_GRPO

AIPlans/Qwen3-0.6B-ReMax

Reinforcement Learning • 0.6B • Updated Dec 22, 2025 • 4 • 2

AIPlans/Qwen3-0.6B-GRPO-RM_NVIDIA

Text Generation • 0.6B • Updated Dec 20, 2025 • 2

AIPlans/Qwen3-0.6B-GRPO_Epoch2

Text Generation • 0.6B • Updated Dec 18, 2025 • 2

AIPlans/Qwen3-0.6B-GRPO_Epoch1

Text Generation • 0.6B • Updated Dec 18, 2025 • 2

AIPlans/Qwen3-0.6B-GRPO

Updated Dec 15, 2025

AIPlans/Qwen3-0.6B-IPO

Reinforcement Learning • 0.6B • Updated Dec 12, 2025 • 12 • 1

AIPlans/qwen3-0.6b-base-PPO-hs2

Updated Dec 11, 2025

AIPlans/Qwen3-0.6B-DPO_Epoch_1

Text Generation • 0.6B • Updated Dec 8, 2025 • 2

AIPlans/Qwen3-0.6B-PPO1

Updated Dec 5, 2025

AIPlans/Qwen3-0.6B-SFT-hs2

Text Generation • 0.6B • Updated Dec 4, 2025 • 97 •

AIPlans/Qwen3-0.6B-RM-hs2

Text Classification • 0.6B • Updated Dec 1, 2025 • 4 • 1