-1
Equation Database
nlp
- BLEU Bilingual Evaluation Understudy
- Binary Cross Entropy Optimization BCO
- Conditional Random Field CRF
- Contrastive Preference Optimization CPO
- Denoising Diffusion Policy Optimization DDPO
- Direct Policy Optimization DPO
- Direct Preference Optimization DPO
- Generalized Knowledge Distillation GKD
- Hidden Markov Model
- KTO Kahneman-Tversky Optimisation Equation
ai
- Binary Cross Entropy Optimization BCO
- Contrastive Preference Optimization CPO
- Denoising Diffusion Policy Optimization DDPO
- Generalized Knowledge Distillation GKD
- KTO Kahneman-Tversky Optimisation Equation
- LOW RANK ADAPTATION LORA
- Odds Ratio Preference Optimization ORPO
- RLHF Reinforcement Learning from Human Feedback