Arda Senocak

Orcid: 0000-0001-9141-3270

According to our database1, Arda Senocak authored at least 20 papers between 2018 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Audio Mamba: Bidirectional State Space Model for Audio Representation Learning.
IEEE Signal Process. Lett., 2024

AVHBench: A Cross-Modal Hallucination Benchmark for Audio-Visual Large Language Models.
CoRR, 2024

Aligning Sight and Sound: Advanced Sound Source Localization Through Audio-Visual Alignment.
CoRR, 2024

ElasticAST: An Audio Spectrogram Transformer for All Length and Resolutions.
CoRR, 2024

Can CLIP Help Sound Source Localization?
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Speech Guided Masked Image Modeling for Visually Grounded Speech.
Proceedings of the IEEE International Conference on Acoustics, 2024

From Coarse to Fine: Efficient Training for Audio Spectrogram Transformers.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Event-Specific Audio-Visual Fusion Layers: A Simple and New Perspective on Video Understanding.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

FlexiAST: Flexibility is What AST Needs.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Sound Source Localization is All about Cross-Modal Alignment.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples.
Proceedings of the IEEE International Conference on Acoustics, 2023

MarginNCE: Robust Sound Localization with a Negative Margin.
Proceedings of the IEEE International Conference on Acoustics, 2023

Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Audio-Visual Fusion Layers for Event Type Aware Video Recognition.
CoRR, 2022

Less Can Be More: Sound Source Localization With a Classification Model.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Learning Sound Localization Better from Semantically Similar Samples.
Proceedings of the IEEE International Conference on Acoustics, 2022

2021
Learning to Localize Sound Sources in Visual Scenes: Analysis and Applications.
IEEE Trans. Pattern Anal. Mach. Intell., 2021

2018
Part-Based Player Identification Using Deep Convolutional Representation and Multi-Scale Pooling.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

On Learning Association of Sound Source and Visual Scenes.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Learning to Localize Sound Source in Visual Scenes.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018


  Loading...