Alexandra Abbas

Cohort 5
  
Position
Biography

Alexandra is a software engineer specialising in machine learning, with extensive experience in managing engineering teams. She recently served as a technical lead at Wise and has a strong background in data engineering and provisioning infrastructure for machine learning workloads.

Research interests
Alexandra's research interests focus on understanding and mitigating harmful behaviours such as sycophancy and deception through novel techniques. She is also interested in expanding the open source toolkit for frontier model releases.
Focus during his fellowship
Alexandra is focusing on reducing sycophancy in frontier AI models by experimenting with techniques like fine tuning, activation steering and latent adversarial training.
HomepageGithubLinkedInScholarX
Download press photos