Montevideo, Uruguay · Audio AI · Machine Listening · MIR

Gabriel Bibbó

Audio AI Research Engineer

HERO BIO PLACEHOLDER — Gabriel will provide final copy.

View projects LinkedIn Email me

Projects

Research systems, datasets, demos, and music technology tools.

Peer-reviewed work first, followed by independent systems and personal technical projects.

Audio AI · VAD · Model evaluation

2025-2026

Audio-Language Models for Voice Activity Detection

This research evaluates how audio-language models detect speech when the audio is short, noisy, reverberant, or filtered. The project compares Qwen2-Audio-7B, Qwen2-Audio-7B with LoRA, Qwen3-Omni-30B, and Silero VAD on the same degraded test bank. The best result came from Qwen2-Audio-7B with LoRA and OPRO-Template: 93.3% balanced accuracy on 21,340 degraded clips.

VADQwenLoRAOPROSileroPyTorch

Code

Privacy-preserving dataset · Domestic audio

2024

Sounds of Home Dataset

Sounds of Home is a residential audio dataset for sound event detection. It contains 1,344 one-hour recordings from 8 homes in Belgium, captured with AudioMoth recorders placed in living rooms and kitchens. Speech was removed before release, and PANNs predictions were provided for the audio frames.

SEDPrivacyAudioMothPANNsDatasets

Official site Paper

Privacy · Speech removal · WASPAA

2025

Speech Removal Framework

A framework for removing speech from audio recordings before they are shared or published. The system supports privacy-preserving release workflows for audio datasets while retaining non-speech acoustic information for downstream sound event detection research.

Speech removalPrivacyWASPAA

Demo DOI

MIR · Harmonic mixing · MSc thesis

2021-2022

Harmonic EDM Mixing Compatibility

This music analysis system estimates how well two EDM tracks mix harmonically. It analyzes tracks, computes chroma features, converts them into Tonal Interval Vectors, compares harmonic compatibility, and suggests pitch shifts that can improve a mix. The work began as an MSc thesis and later became an ICWE 2022 publication.

MIREDMChromaTIVEssentialibrosa

Code Paper

Speech enhancement · Backend platform

2026

ASR Enhancement Platform

The ASR Enhancement Platform compares two speech recognition paths on the same audio: raw transcription and enhance-and-transcribe. The backend stores jobs, audio files, transcripts, and provider payloads so each result can be inspected later. It uses FastAPI, Celery, PostgreSQL, Redis, MinIO, Docker Compose, metrics, tracing, Grafana, and CI.

ASRFastAPICeleryDockerPostgreSQLRedis

Code

MIR · DJ library organization

2026

Traktor ML

Traktor ML turns a local Techno and Tech House library into Traktor-ready playlists. The pipeline extracts MERT embeddings, separates stems with Demucs, reads BPM and key metadata with Essentia, clusters similar tracks, orders them for smoother transitions, and exports M3U playlists. The current V4 run processed 239 tracks and exported 14 playlists. The private audio collection is not included in the repo.

MERTDemucsEssentiaHDBSCANUMAPStreamlit

Code

ALPACA

2026

Python-based algorithmic trading platform with market data ingestion, risk controls, backtesting, and real-time monitoring.

PythonBacktestingMonitoring

Code

Raspberry Pi Sound Event Recognition Demo

2023

Raspberry Pi demo for real-time sound event recognition. The system runs pre-trained neural networks on a low-cost edge device, exposes a web interface, and can send email notifications when selected AudioSet events are detected.

Raspberry PiEdge AIAudioSet

Code Video

3H-ATO

2020

Mechanical tool designed during the pandemic to avoid touching shared surfaces directly. It is a physical prototyping project, not an AI project.

Product designPrototyping

Video

Automatic IoT Soap Dispenser

2020

IoT handwashing device for industrial environments. The device used stainless steel, WiFi, cloud connectivity, IR/RFID sensors, and a 3-litre tank.

IoTSensorsIndustrial hygiene

UyVoy Mobile App

2020

Mobile app project for booking appointments and reducing crowding during the pandemic. My role is shown as Project Manager.

Mobile appCivic techProject management

Research / Publications

Publications and works, ordered by year.

2025

Privacy for Audio AI: Risks, Challenges, and Emerging Solutions in the Era of Audio AI [Panel discussion]

Thomas Deacon; Jennifer Williams; Jason R. C. Nurse; Christopher Hicks; Gabriel Bibbó; Arshdeep Singh; Mark D. Plumbley

2025 AES International Conference on Artificial Intelligence and Machine Learning for Audio

Identifier AES program

Speech Removal Framework for Privacy-preserving Audio Recordings

Gabriel Bibbó; Arshdeep Singh; Mark D. Plumbley

2025 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Tahoe City, CA, October 2025

DOI Demo

Room Acoustics and Microphone Characteristics Show Systematic Impact on Sound Event Recognition

Gabriel Bibbó; Craig Cieciura; Mark D. Plumbley

Proceedings of the 54th International Congress and Exposition on Noise Control Engineering, São Paulo, Brazil, August 2025

ISBN

Integrating IP broadcasting with audio tags: Workflow and challenges

Rhys Burchett-Vass; Arshdeep Singh; Gabriel Bibbó; Mark D. Plumbley

2025 AES International Conference on Artificial Intelligence and Machine Learning for Audio

Open research Preprint

Soundscape Experience Mapping: A Deep Listening Approach for Eliciting Older Adults' Perceptions of Indoor Soundscapes

Thomas Deacon; Gabriel Bibbó; Arshdeep Singh; Mark D. Plumbley

Forum Acusticum / Euronoise 2025, Málaga, Spain, June 2025

Link

Personalized Live Sound Recognition Using Efficient PANNs [Show and Tell]

Arshdeep Singh; Gabriel Bibbó; Thomas Deacon; Haohe Liu; Mark D. Plumbley

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025), Hyderabad, India, April 2025

Link

2024

Environmental sound classification on an embedded hardware platform

Gabriel Bibbó; Arshdeep Singh; Mark D. Plumbley

INTER-NOISE and NOISE-CON Congress and Conference Proceedings, Nantes, France, August 2024

DOI

The Sounds of Home: A Speech-Removed Residential Audio Dataset for Sound Event Detection

Gabriel Bibbó; Thomas Deacon; Arshdeep Singh; Mark D. Plumbley

8th International Workshop on Speech Processing in Everyday Environments (CHiME 2024), Kos Island, Greece, September 2024

DOI Dataset site

Soundscape Personalisation at Work: Designing AI-Enabled Sound Technologies for the Workplace

Thomas Deacon; Gabriel Bibbó; Arshdeep Singh; Mark D. Plumbley

International Conference on Sound and Music Computing (SMC 2024), Porto, Portugal, July 2024

Paper

2023

Recognise and Notify Sound Events Using a Raspberry PI Based Standalone Device [Demo]

Gabriel Bibbó; Arshdeep Singh; Mark D. Plumbley

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2023), New York, U.S.A, October 2023

DOI Video

2022

A New Compatibility Measure for Harmonic EDM Mixing

Gabriel Bibbó

International Conference on Web Engineering (ICWE 2022), Bari, Italy, July 2022

DOI

2021

Towards a New Compatibility Measure for Harmonic EDM Mixing

Gabriel Bibbó

Master thesis, Universitat Pompeu Fabra, Barcelona, Spain, 2021

Repository

2017

Autonomous Mobile Robots Comunicated by Software Defined Radio

Gabriel Bibbó

Bachelor thesis, Universidad de la República, Montevideo, Uruguay, 2017

Publication

Experience

Employment experience.

Dec.2025-Present

Visiting Researcher (collaboration)

University of Surrey, Remote

Preparing IEEE/ACM TASLP article with Mark D. Plumbley and Simone Spagnol (Università Iuav di Venezia) on VAD with Qwen-Audio family under psychoacoustic degradations, using PEFT/LoRA, OPRO prompt optimization, 4-bit NF4 quantization, evaluation against frozen Qwen3-Omni baseline.
Co-authoring IEEE Signal Processing Magazine article with Arshdeep Singh (King’s College London) on privacy-preserving audio and machine listening.

Nov.2022-Nov.2025

Research Engineer in Sound Sensing

University of Surrey, Guildford, UK

Developed end-to-end audio ML systems for real-world smart environments, covering data preparation, model evaluation, prototype deployment, open-source releases, demos, datasets and technical documentation for assisted living, smart buildings and urban sound monitoring.
Built privacy-preserving SED pipelines for sensitive in-home recordings, including a 197 GB residential audio dataset, speech-removal workflows and reproducible evaluation resources.
Designed Slurm-based VAD pipelines benchmarking 8 models under controlled acoustic degradations, with robustness analysis and statistical comparison across model families.
Deployed real-time CNN inference on Raspberry Pi, including quantization, thermal profiling, power-aware evaluation and edge sound-sensing documentation.
Published and presented research at IEEE WASPAA, CHiME Workshop, ICWE, Inter-Noise, SMC, UKAI, UKIS and AES. Supervised undergraduate and master’s projects.

Mar.2022-Nov.2022

Technical Support Engineer - Google Workspace

Webhelp, Barcelona, Spain

Tier 3 support for Google Workspace enterprise customers across APIs, OAuth, SAML/SSO, IAM, user provisioning, data migration, DNS/domain configuration, and security/compliance settings.

Nov.2021-Mar.2022

IT Auditor

KPMG, Barcelona, Spain

Support to telecommunications companies and IT departments in audit services.

Apr.2016-Dec.2019

R&D Engineer

Ikatu, Montevideo, Uruguay

Designed and shipped embedded C/C++ audio and IoT firmware for Bang & Olufsen home automation products: low-level drivers, hardware integration, audio I/O, and Internet connectivity.
Owned product lifecycle work across requirements, architecture, implementation, testing, validation, and customer-facing documentation.
Trained and onboarded incoming programmers on embedded development practices.

Formal Education

Formal studies.

2020-2021

MSc Sound and Music Computing

Universitat Pompeu Fabra, Barcelona, Spain

Master thesis on harmonic compatibility for EDM mixing. Final thesis grade: 9/10.

2010-2017

BSc Electrical Engineering

Universidad de la República, Montevideo, Uruguay

Bachelor thesis on autonomous mobile robots communicated by software-defined radio.

Technical Stack

Tools and methods used across research, software, audio, and deployment work.

Stack

PythonC/C++PyTorchHugging FacePEFTTorchAudiolibrosaEssentiascikit-learnpandasNumPySciPyFlaskStreamlitHugging Face SpacesDockerGitLinux CLIBashSlurmRedis StreamsPrometheusGrafanaSQLiteMATLABClaude Code / VS Code

ML

CNNsTransformersAudio-Language ModelsLoRA Fine-tuning4-bit QuantizationSupervised and Self-supervised LearningEvaluation PipelinesStatistical TestingEdge Deployment

Audio

Sound Event DetectionVoice Activity DetectionMusic Information RetrievalDigital Signal ProcessingReal-Time AudioDAWsAbletonDJingElectronic Music Production

Practice

Reproducible ML pipelinesDataset CurationOpen-Source DevelopmentMLOps practicesAI-assisted DevelopmentTechnical WritingInterdisciplinary Collaboration

Additional Information

Languages, certifications, memberships, and grants.

Languages

Spanish — NativeEnglish — C1Portuguese — A2

Certifications

PRINCE2 FoundationDeep Learning SpecializationMachine Learning — Stanford / Coursera

Music and memberships

Music school: Virgilio Scarabelli AlbertiIEEE Signal Processing Society member

Research grants

Participant in EPSRC AI for Sound

Contact

Get in touch.

Italian citizen with EU work authorization. Open to remote roles in LATAM/Europe and selected relocation opportunities within the EU.

Email: gabobibbo@gmail.com

Email me LinkedIn

GitHub Scholar ORCID