• In this blog, we will summarize the latex code of most fundamental equations of multi-task learning(MTL) and transfer learning(TL). Multi-Task Learning aims to optimize N related tasks simultaneously and achieve the overall trade-off between multiple tasks. Typical network structure include shared-bottom models, Cross-Stitch Network, Multi-Gate Mixture of Experts (MMoE), Progressive Layered Extraction (PLE), Entire Space Multi-Task Model (ESSM) models and etc. Different from multi-task learning. In the following sections, we will dicuss more details of MTL equations, which is useful for your quick reference.