-1
Equation Database
nlp
- Binary Cross Entropy Optimization BCO
- Contrastive Preference Optimization CPO
- Denoising Diffusion Policy Optimization DDPO
- Generalized Knowledge Distillation GKD
- Group Relative Policy Optimization GRPO
- KTO Kahneman-Tversky Optimisation Equation
- LOW RANK ADAPTATION LORA
- Odds Ratio Preference Optimization ORPO
- RLHF Reinforcement Learning from Human Feedback
ai
- Binary Cross Entropy Optimization BCO
- Contrastive Preference Optimization CPO
- Denoising Diffusion Policy Optimization DDPO
- Generalized Knowledge Distillation GKD
- Group Relative Policy Optimization GRPO
- KTO Kahneman-Tversky Optimisation Equation
- LOW RANK ADAPTATION LORA
- Odds Ratio Preference Optimization ORPO
- Proximal Policy Optimization PPO
- RLHF Reinforcement Learning from Human Feedback