Arda Senocak

Orcid: 0000-0001-9141-3270

According to our database¹, Arda Senocak authored at least 21 papers between 2018 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

2018

2019

2020

2021

2022

2023

2024

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Audio Mamba: Bidirectional State Space Model for Audio Representation Learning.

[BibT_eX]

[DOI]

IEEE Signal Process. Lett., 2024

Sound2Vision: Generating Diverse Visuals from Audio through Cross-Modal Latent Alignment.

[BibT_eX]

[DOI]

CoRR, 2024

AVHBench: A Cross-Modal Hallucination Benchmark for Audio-Visual Large Language Models.

[BibT_eX]

[DOI]

CoRR, 2024

Aligning Sight and Sound: Advanced Sound Source Localization Through Audio-Visual Alignment.

[BibT_eX]

[DOI]

CoRR, 2024

ElasticAST: An Audio Spectrogram Transformer for All Length and Resolutions.

[BibT_eX]

[DOI]

CoRR, 2024

Can CLIP Help Sound Source Localization?

[BibT_eX]

[DOI]

Sooyoung Park

Arda Senocak

Joon Son Chung

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Speech Guided Masked Image Modeling for Visually Grounded Speech.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

From Coarse to Fine: Efficient Training for Audio Spectrogram Transformers.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2024

2023

Event-Specific Audio-Visual Fusion Layers: A Simple and New Perspective on Video Understanding.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

FlexiAST: Flexibility is What AST Needs.

[BibT_eX]

[DOI]

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Sound Source Localization is All about Cross-Modal Alignment.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

MarginNCE: Robust Sound Localization with a Negative Margin.

[BibT_eX]

[DOI]

Sooyoung Park

Arda Senocak

Joon Son Chung

Proceedings of the IEEE International Conference on Acoustics, 2023

Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Audio-Visual Fusion Layers for Event Type Aware Video Recognition.

[BibT_eX]

[DOI]

CoRR, 2022

Less Can Be More: Sound Source Localization With a Classification Model.

[BibT_eX]

[DOI]

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Learning Sound Localization Better from Semantically Similar Samples.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

2021

Learning to Localize Sound Sources in Visual Scenes: Analysis and Applications.

[BibT_eX]

[DOI]

IEEE Trans. Pattern Anal. Mach. Intell., 2021

2018

Part-Based Player Identification Using Deep Convolutional Representation and Multi-Scale Pooling.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

On Learning Association of Sound Source and Visual Scenes.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Learning to Localize Sound Source in Visual Scenes.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Arda Senocak

Timeline

Legend:

Links

On csauthors.net:

Bibliography

Loading...