MATH-PT: A Math Reasoning Benchmark for European and Brazilian Portuguese
·72 words
A benchmark of 1,729 native Portuguese math problems (European and Brazilian variants) for evaluating mathematical reasoning in modern language models.
PROPOR 2026
A benchmark of 1,729 native Portuguese math problems (European and Brazilian variants) for evaluating mathematical reasoning in modern language models.
PROPOR 2026
A workshop paper proposing orthogonal gradient projection for continual LLM unlearning in recursive self-improvement settings.
ICLR 2026 Workshop on AI with Recursive Self-Improvement