Montevideo, Uruguay · Audio AI · Machine Listening · MIR

Audio AI Research Engineer bridging machine listening research, experimental software, and deployable prototypes.

I research, implement, evaluate, and deploy audio machine learning systems. My work connects sound event detection, voice activity detection, privacy-preserving audio, embedded ML, and music information retrieval.

View projects LinkedIn Email me

Projects

Research systems, datasets, demos, and music technology tools.

A mix of peer-reviewed research work, deployable prototypes, and independent technical projects.

Audio AI · VAD · Model evaluation

Audio-Language Models for Voice Activity Detection

I tested how audio-language models detect speech when the audio is short, noisy, reverberant, or filtered. The project compares Qwen2-Audio-7B, Qwen2-Audio-7B with LoRA, Qwen3-Omni-30B, and Silero VAD on the same degraded test bank. The best result came from Qwen2-Audio-7B with LoRA and OPRO-Template: 93.3% balanced accuracy on 21,340 degraded clips.

VADQwenLoRAOPROSileroPyTorch

Code

Speech enhancement · Backend platform

ASR Enhancement Platform

I built a backend platform to compare two ASR paths on the same audio: raw transcription and enhance-and-transcribe. The system stores jobs, audio files, transcripts, and provider payloads so each result can be inspected later. It uses FastAPI, Celery, PostgreSQL, Redis, MinIO, Docker Compose, metrics, tracing, Grafana, and CI.

ASRFastAPICeleryDockerPostgreSQLRedis

Code

Privacy-preserving dataset · Domestic audio

Sounds of Home Dataset

I worked on Sounds of Home, a residential audio dataset for sound event detection. The dataset contains 1,344 one-hour recordings from 8 homes in Belgium. AudioMoth recorders were placed in living rooms and kitchens. Speech was removed before release, and PANNs predictions were provided for the audio frames.

SEDPrivacyAudioMothPANNsDatasets

Paper

MIR · Harmonic mixing · MSc thesis

Harmonic EDM Mixing Compatibility

I built a music analysis system for estimating how well two EDM tracks mix harmonically. The system analyzes tracks, computes chroma features, converts them into Tonal Interval Vectors, compares harmonic compatibility, and suggests pitch shifts that can improve a mix. This was my MSc thesis work and later became an ICWE 2022 publication.

MIREDMChromaTIVEssentialibrosa

Code Paper

MIR · DJ library organization

Traktor ML

I built a pipeline that turns a local Techno and Tech House library into Traktor-ready playlists. The system extracts MERT embeddings, separates stems with Demucs, reads BPM and key metadata with Essentia, clusters similar tracks, orders them for smoother transitions, and exports M3U playlists. The current V4 run processed 239 tracks and exported 14 playlists. The private audio collection is not included in the repo.

MERTDemucsEssentiaHDBSCANUMAPStreamlit

Code

Speech Removal Framework

Framework for removing speech from audio recordings before they are shared or published. It belongs to the privacy-preserving audio line of work and is linked to the WASPAA 2025 demo/publication.

Speech removalPrivacyWASPAA

Demo DOI

ALPACA

Python-based algorithmic trading platform with market data ingestion, risk controls, backtesting, and real-time monitoring. Kept as a secondary project because it shows backend and system design outside Audio AI.

PythonBacktestingMonitoring

Code

Raspberry Pi Sound Event Recognition Demo

Raspberry Pi demo for real-time sound event recognition. The system runs pre-trained neural networks on a low-cost edge device, exposes a web interface, and can send email notifications when selected AudioSet events are detected.

Raspberry PiEdge AIAudioSet

Code Video

3H-ATO

Mechanical tool designed during the pandemic to avoid touching shared surfaces directly. It is a physical prototyping project, not an AI project.

Product designPrototyping

Video

Automatic IoT Soap Dispenser

IoT handwashing device for industrial environments. The device used stainless steel, WiFi, cloud connectivity, IR/RFID sensors, and a 3-litre tank.

IoTSensorsIndustrial hygiene

UyVoy Mobile App

Mobile app project for booking appointments and reducing crowding during the pandemic. My role is shown as Project Manager.

Mobile appCivic techProject management

Research / Publications

Publications and works, ordered by year.

2025

Privacy for Audio AI: Risks, Challenges, and Emerging Solutions in the Era of Audio AI [Panel discussion]

Thomas Deacon; Jennifer Williams; Jason R. C. Nurse; Christopher Hicks; Gabriel Bibbó; Arshdeep Singh; Mark D. Plumbley

2025 AES International Conference on Artificial Intelligence and Machine Learning for Audio

Identifier AES program

Speech Removal Framework for Privacy-preserving Audio Recordings

Gabriel Bibbó; Arshdeep Singh; Mark D. Plumbley

2025 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Tahoe City, CA, October 2025

DOI Demo

Room Acoustics and Microphone Characteristics Show Systematic Impact on Sound Event Recognition

Gabriel Bibbó; Craig Cieciura; Mark D. Plumbley

Proceedings of the 54th International Congress and Exposition on Noise Control Engineering, São Paulo, Brazil, August 2025

ISBN

Integrating IP broadcasting with audio tags: Workflow and challenges

Rhys Burchett-Vass; Arshdeep Singh; Gabriel Bibbó; Mark D. Plumbley

2025 AES International Conference on Artificial Intelligence and Machine Learning for Audio

Open research Preprint

Soundscape Experience Mapping: A Deep Listening Approach for Eliciting Older Adults' Perceptions of Indoor Soundscapes

Thomas Deacon; Gabriel Bibbó; Arshdeep Singh; Mark D. Plumbley

Forum Acusticum / Euronoise 2025, Málaga, Spain, June 2025

Link

Personalized Live Sound Recognition Using Efficient PANNs [Show and Tell]

Arshdeep Singh; Gabriel Bibbó; Thomas Deacon; Haohe Liu; Mark D. Plumbley

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025), Hyderabad, India, April 2025

Link

2024

Environmental sound classification on an embedded hardware platform

Gabriel Bibbó; Arshdeep Singh; Mark D. Plumbley

INTER-NOISE and NOISE-CON Congress and Conference Proceedings, Nantes, France, August 2024

DOI

The Sounds of Home: A Speech-Removed Residential Audio Dataset for Sound Event Detection

Gabriel Bibbó; Thomas Deacon; Arshdeep Singh; Mark D. Plumbley

8th International Workshop on Speech Processing in Everyday Environments (CHiME 2024), Kos Island, Greece, September 2024

DOI

Soundscape Personalisation at Work: Designing AI-Enabled Sound Technologies for the Workplace

Thomas Deacon; Gabriel Bibbó; Arshdeep Singh; Mark D. Plumbley

International Conference on Sound and Music Computing (SMC 2024), Porto, Portugal, July 2024

Paper

2023

Recognise and Notify Sound Events Using a Raspberry PI Based Standalone Device [Demo]

Gabriel Bibbó; Arshdeep Singh; Mark D. Plumbley

IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2023), New York, U.S.A, October 2023

DOI Video

2022

A New Compatibility Measure for Harmonic EDM Mixing

Gabriel Bibbó Frau; Ángel Faraldo

International Conference on Web Engineering (ICWE 2022), Springer, Bari, Italy, July 2022

DOI

2021

Towards a New Compatibility Measure for Harmonic EDM Mixing

Gabriel Bibbó; Angel Faraldo

Dissertation or Thesis, Universitat Pompeu Fabra, October 2021

DOI

2017

Autonomous Mobile Robots Comunicated by Software Defined Radio

Gabriel Bibbó; Mariana Gelós; Martín Randall; Pablo Belzarena; Federico Larroca

Dissertation or Thesis, Universidad de la República, December 2017

Link

Experience

Employment experience.

Dec.2025-Present

Visiting Researcher (collaboration)

University of Surrey, Remote

Preparing IEEE/ACM TASLP article with Mark D. Plumbley and Simone Spagnol (Università Iuav di Venezia) on VAD with Qwen-Audio family under psychoacoustic degradations, using PEFT/LoRA, OPRO prompt optimization, 4-bit NF4 quantization, evaluation against frozen Qwen3-Omni baseline.
Co-authoring IEEE Signal Processing Magazine article with Arshdeep Singh (King’s College London) on privacy-preserving audio and machine listening.

Nov.2022-Nov.2025

Research Engineer in Sound Sensing

University of Surrey, Guildford, UK

Developed end-to-end audio ML systems for real-world smart environments, covering data preparation, model evaluation, prototype deployment, open-source releases, demos, datasets and technical documentation for assisted living, smart buildings and urban sound monitoring.
Built privacy-preserving SED pipelines for sensitive in-home recordings, including a 197 GB residential audio dataset, speech-removal workflows and reproducible evaluation resources.
Designed Slurm-based VAD pipelines benchmarking 8 models under controlled acoustic degradations, with robustness analysis and statistical comparison across model families.
Deployed real-time CNN inference on Raspberry Pi, including quantization, thermal profiling, power-aware evaluation and edge sound-sensing documentation.
Published and presented research at IEEE WASPAA, CHiME Workshop, ICWE, Inter-Noise, SMC, UKAI, UKIS and AES. Supervised undergraduate and master’s projects.

Mar.2022-Nov.2022

Technical Support Engineer - Google Workspace

Webhelp, Barcelona, Spain

Tier 3 support for Google Workspace enterprise customers across APIs, OAuth, SAML/SSO, IAM, user provisioning, data migration, DNS/domain configuration, and security/compliance settings.

Nov.2021-Mar.2022

IT Auditor

KPMG, Barcelona, Spain

Support to telecommunications companies and IT departments in audit services.

Apr.2016-Dec.2019

R&D Engineer

Ikatu, Montevideo, Uruguay

Designed and shipped embedded C/C++ audio and IoT firmware for Bang & Olufsen home automation products: low-level drivers, hardware integration, audio I/O, and Internet connectivity.
Owned product lifecycle work across requirements, architecture, implementation, testing, validation, and customer-facing documentation.
Trained and onboarded incoming programmers on embedded development practices.

Technical Stack

Tools and methods used across research, software, audio, and deployment work.

Stack

PythonC/C++PyTorchHugging FacePEFTTorchAudiolibrosaEssentiascikit-learnpandasNumPySciPyFlaskStreamlitHugging Face SpacesDockerGitLinux CLIBashSlurmRedis StreamsPrometheusGrafanaSQLiteMATLABClaude Code / VS Code

ML

CNNsTransformersAudio-Language ModelsLoRA Fine-tuning4-bit QuantizationSupervised and Self-supervised LearningEvaluation PipelinesStatistical TestingEdge Deployment

Audio

Sound Event DetectionVoice Activity DetectionMusic Information RetrievalDigital Signal ProcessingReal-Time AudioDAWsAbletonDJingElectronic Music Production

Practice

Reproducible ML pipelinesDataset CurationOpen-Source DevelopmentMLOps practicesAI-assisted DevelopmentTechnical WritingInterdisciplinary Collaboration

Contact

Get in touch.

Italian citizen with EU work authorization. Open to remote roles in LATAM/Europe and selected relocation opportunities within the EU.

Email: gabobibbo@gmail.com

Email me LinkedIn

GitHub Scholar ORCID