2025

Steering Guidance for Personalized Text-to-Image Diffusion Models.

[DOI]

,

,

,

CoRR, August, 2025

From Wardrobe to Canvas: Wardrobe Polyptych LoRA for Part-level Controllable Human Image Generation.

[DOI]

,

,

,

,

,

CoRR, July, 2025

Memory-Efficient Personalization of Text-to-Image Diffusion Models via Selective Optimization Strategies.

[DOI]

,

,

,

,

CoRR, July, 2025

ConsNoTrainLoRA: Data-driven Weight Initialization of Low-rank Adapters using Constraints.

[DOI]

,

,

,

,

,

CoRR, July, 2025

MultiHuman-Testbench: Benchmarking Image Generation for Multiple Humans.

[DOI]

Shubhankar Borse

,

,

,

,

,

Risheek Garrepalli

,

,

,

CoRR, June, 2025

Tripartite Weight-Space Ensemble for Few-Shot Class-Incremental Learning.

[DOI]

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

2024

Hollowed Net for On-Device Personalization of Text-to-Image Diffusion Models.

[DOI]

,

,

,

Matthias Reisser

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024, 2024

Balanced Learning for Multi-Domain Long-Tailed Speaker Recognition.

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2024

Feature Diversification and Adaptation for Federated Domain Generalization.

[DOI]

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

FedHide: Federated Learning by Hiding in the Neighbors.

[DOI]

,

Proceedings of the Computer Vision - ECCV 2024, 2024

2023

Multi-Scale Temporal Feature Fusion for Few-Shot Action Recognition.

[DOI]

,

Proceedings of the IEEE International Conference on Image Processing, 2023

Label Shift Adapter for Test-Time Adaptation under Covariate and Label Shifts.

[DOI]

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Few-Shot Common Action Localization via Cross-Attentional Fusion of Context and Temporal Dynamics.

[DOI]

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Neural Transformation Network to Generate Diverse Views for Contrastive Learning.

[DOI]

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Progressive Random Convolutions for Single Domain Generalization.

[DOI]

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Leaky Gated Cross-Attention for Weakly Supervised Multi-Modal Temporal Action Localization.

[DOI]

,

,

Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Domain Agnostic Few-shot Learning for Speaker Verification.

[DOI]

,

,

,

,

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

ConFeSS: A Framework for Single Source Cross-Domain Few-Shot Learning.

[DOI]

,

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

Improving Test-Time Adaptation Via Shift-Agnostic Weight Regularization and Nearest Source Prototypes.

[DOI]

,

,

,

Proceedings of the Computer Vision - ECCV 2022, 2022

Multi-Head Modularization to Leverage Generalization Capability in Multi-Modal Networks.

[DOI]

,

,

,

Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence, 2022

2021

Distribution Estimation to Automate Transformation Policies for Self-Supervision.

[DOI]

,

,

,

,

CoRR, 2021

Federated Learning of User Verification Models Without Sharing Embeddings.

[DOI]

Hossein Hosseini

,

,

,

Christos Louizos

,

,

Proceedings of the 38th International Conference on Machine Learning, 2021

Cross-Attentional Audio-Visual Fusion for Weakly-Supervised Action Localization.

[DOI]

,

,

,

Proceedings of the 9th International Conference on Learning Representations, 2021

Efficient Action Recognition via Dynamic Knowledge Propagation.

[DOI]

,

,

,

,

Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

Prototype-Based Personalized Pruning.

[DOI]

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2021

Subspectral Normalization for Neural Audio Data Processing.

[DOI]

,

,

,

,

,

Proceedings of the IEEE International Conference on Acoustics, 2021

2020

Federated Learning of User Authentication Models.

[DOI]

Hossein Hosseini

,

,

,

Christos Louizos

,

,

CoRR, 2020

End-to-End Lane Marker Detection via Row-wise Classification.

[DOI]

,

,

,

,

,

,

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

An End-to-End Text-Independent Speaker Verification Framework with a Keyword Adversarial Network.

[DOI]

,

,

,

,

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Weakly Labeled Sound Event Detection using Tri-training and Adversarial Learning.

[DOI]

,

,

,

,

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

Acoustic Scene Classification Based on a Large-margin Factorized CNN.

[DOI]

,

,

,

,

Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

2017

Speaker Clustering by Iteratively Finding Discriminative Feature Space and Cluster Labels.

[DOI]

,

,

Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

2012

Loss-Scaled Large-Margin Gaussian Mixture Models for Speech Emotion Classification.

[DOI]

,

IEEE Trans. Speech Audio Process., 2012

Phoneme Classification using Constrained Variational Gaussian Process Dynamical System.

[DOI]

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

Joint Kernel Learning for Supervised Image Segmentation.

[DOI]

,

,

,

,

Proceedings of the Computer Vision - ACCV 2012, 2012

2011

Large Margin Discriminative Semi-Markov Model for Phonetic Recognition.

[DOI]

,

,

IEEE Trans. Speech Audio Process., 2011

Learning a discriminative visual codebook using homonym scheme.

[DOI]

,

,

Proceedings of the IEEE International Conference on Acoustics, 2011

2010

Wearable sensor activity analysis using semi-Markov models with a grammar.

[DOI]

,

,

,

,

,

Matthew W. Robards

,

Alexander J. Smola

,

,

Pervasive Mob. Comput., 2010

Parametric emotional singing voice synthesis.

[DOI]

,

,

Proceedings of the IEEE International Conference on Acoustics, 2010

Largemargin training of semi-Markov model for phonetic recognition.

[DOI]

,

,

Proceedings of the IEEE International Conference on Acoustics, 2010

2009

Speech emotion recognition via a max-margin framework incorporating a loss function based on the Watson and Tellegen's emotion model.

[DOI]

,

Proceedings of the IEEE International Conference on Acoustics, 2009

2004

Hybrid utterance verification based on n-best models and model derived from kulback-leibler divergence.

[DOI]

,

,

,

Proceedings of the 8th International Conference on Spoken Language Processing, 2004