Anchor-based Robust Finetuning of Vision-Language Models