LLM safety (UdS, Winter 25-26)
Seminar, Saarland University, 2025
Seminar, Saarland University, 2025
Seminar, Saarland University, 2025
Seminar, Saarland University, 2026
Seminar, Saarland University, 2026
Published:
Interactive tool for visualizing attribution patterns in language models with support for multiple attribution methods.
Published:
Python toolkit for detecting and analyzing memorization patterns in neural language models with support for various detection methods.
Download here