Jannik Brinkmann

According to our database1, Jannik Brinkmann authored at least 10 papers between 2023 and 2025.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2025
Large Language Models Share Representations of Latent Grammatical Concepts Across Typologically Diverse Languages.
CoRR, January, 2025

NSA: Neuro-symbolic ARC Challenge.
CoRR, January, 2025

2024
The Quest for the Right Mediator: A History, Survey, and Theoretical Grounding of Causal Interpretability.
CoRR, 2024

NNsight and NDIF: Democratizing Access to Foundation Model Internals.
CoRR, 2024

Measuring Progress in Dictionary Learning for Language Model Interpretability with Board Game Models.
Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Unsupervised Extraction of Test Scenarios from Time-Series Sensor Data using Trace Graphs.
Proceedings of the 57th Hawaii International Conference on System Sciences, 2024

GOV-REK: Governed Reward Engineering Kernels for Designing Robust Multi-Agent Reinforcement Learning Systems.
Proceedings of the 23rd International Conference on Autonomous Agents and Multiagent Systems, 2024

A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning Task.
Proceedings of the Findings of the Association for Computational Linguistics, 2024

2023
A Multidimensional Analysis of Social Biases in Vision Transformers.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Bias Mitigation for Large Language Models using Adversarial Learning.
Proceedings of the 1st Workshop on Fairness and Bias in AI co-located with 26th European Conference on Artificial Intelligence (ECAI 2023), 2023


  Loading...