PATRICK KENNY'S PUBLICATIONS

           Journal Articles

Alam, J., Kenny, P., O Shaughnessy, D., Regularized Minimum Variance Distortionless Response-Based Cepstral Features for Robust Continuous Speech Recognition, Speech Communication, 73, pp. 28-46, October 2015

Stafylakis, T., Alam, J. and Kenny, P., Text-Dependent Speaker Recognition with Random Digit Strings IEEE Trans. Audio, Speech and Language Proc., 24(7), pp. 1195-1204, July 2016

Stafylakis, T., Kenny, P., Alam, J., and Kockmann, M. Speaker and Channel Factors in Text-Dependent Speaker Recognition IEEE Trans. Audio, Speech and Language Proc., 24(1), pp. 65-78, January 2016

Alam, J., Gupta, V., Kenny, P., and Dumouchel, P., Speech Recognition in Reverberant and Noisy Environments Employing Multiple Feature Extractors and I-Vector Speaker Adaptation, EURASIP Journal on Advances in Signal Processing, pp. 2015-50, 2015

Alam, J., Kenny, P., O Shaughnessy, D., Robust Feature Extraction Based on an Aysmmetric Level-Dependent Auditory Filterbank and a Subband Spectrum Enhancement Technique, Digital Signal Processing, 29, pp. 147-157, June 2014

Senoussaoui, M., Kenny, P., Stafylakis, T., and Dumouchel, P., A Study of the Cosine Distance-Based Mean Shift for Telephone Speech Diarization IEEE Trans. Audio, Speech and Language Proc., 22(1), pp. 217-227, January 2014

Alam, J., Kenny, P., and O Shaugnessy, D., Low-variance Multitaper Mel-frequency Cepstral Coefficient Features for Speech and Speaker Recognition Systems, Cognitive Computation, Springer, December 2012

Alam, J., Kinnunen, T., Kenny, P., Ouellet, P., and O Shaugnessy, D., Mutlitaper MFCC and PLP Features for Speaker Verification Using I-Vectors Speech Communication, 55(2), pp. 237-251, February 2013

Dehak, N., Kenny, P., Dehak, R., Dumouchel, P and Ouellet, P. Front-End Factor Analysis for Speaker Verification IEEE Transactions on Audio, Speech and Language Processing, 19(4), pp. 788-798, May 2011.

Kenny, P., Reynolds, D., and Castaldo, F. Diarization of Telephone Conversations using Factor Analysis IEEE Journal of Selected Topics in Signal Processing, December 2010.

Kenny, P., Ouellet, P., Dehak, N., Gupta, V., and Dumouchel, P. A Study of Inter-Speaker Variability in Speaker Verification IEEE Transactions on Audio, Speech and Language Processing, July 2008.
 
Gupta V., Kenny, P., Ouellet, P., Boulianne, G., and Dumouchel, P. Combining Gaussianized/non-Gaussianized features to improve speaker diarization of telephone conversations IEEE Signal Processing Letters, 14 (12), pp. 1040-1043, Dec. 2007.

Dehak, N., Kenny, P. and P. Dumouchel
Modeling prosodic features with joint factor analysis for speaker verification IEEE Transactions on Audio, Speech and Language Processing, 15 (7), pp. 2095-2103, Sept. 2007. 

Yin, S.-C., Rose, R., Kenny, P. and P. Dumouchel.  A Joint factor analysis approach to progressive model adaptation in text independent speaker verification IEEE Transactions on Audio, Speech and Language Processing, 15 (7), pp. 1999-2010, Sept. 2007.

Kenny, P., Boulianne, G., Ouellet, P. and P. Dumouchel. Joint factor analysis versus eigenchannels in speaker recognition IEEE Transactions on Audio, Speech and Language Processing 15 (4), pp. 1435-1447, May 2007. 

Kenny, P., Boulianne, G., Ouellet, P. and P. Dumouchel. Speaker and session variability in GMM-based speaker verification IEEE Transactions on Audio, Speech and Language Processing 15 (4), pp. 1448-1460, May 2007. 

Kenny, P., Boulianne, G. and P. Dumouchel. Eigenvoice Modeling with Sparse Training Data IEEE Transactions on Speech and Audio Processing, 13 May (3) 2005 : 345-359. 

Kenny, P., Boulianne, G., Ouellet, P. and P. Dumouchel. Speaker Adaptation using an Eigenphone Basis IEEE Transactions on Speech and Audio Processing, 12 November (6) 2004 : 579-589. 

           Technical Report

Brummer, N., Swart, A., Jorrin-Prieto, J., Garcia, P., Buera, L. et al., ABC NIST SRE 2016 System Description, San Diego, CA, December 2016 slides poster

 

Stafylakis. T., Kenny, P., Ouellet, P., Perez, J., Kockmann, M. and Dumouchel, P., I-Vector/PLDA Variants for Text-Dependent Speaker Recognition, Montreal, CRIM, June 2013

Kenny, P. Notes on Boltzmann Machines  Montreal, CRIM, May 2011 

Kenny, P. Bayesian Analysis of Speaker Diarization with Eigenvoice Priors  Montreal, CRIM, May 2008  

Kenny, P Joint factor analysis of speaker and session variability : Theory and algorithms - Technical report CRIM-06/08-13  Montreal, CRIM, 2005

           Conference Papers          

            Gautam Bhattacharya, Jahangir Alam, Patrick Kenny and Vishwa Gupta Modelling Speaker and Channel Variability Using Deep Neural Networks for Robust Speaker Verification Proc IEEE SLT Workshop, San Diego, CA, December 2016

Md. Jahangir Alam, Patrick Kenny, and Vishwa Gupta Tandem Features for Text-Dependent Speaker Verification on the Red Dots Corpus Proc. Interspeech, San Francisco, August 2016

Md. Jahangir Alam, Vishwa Gupta and Patrick Kenny CRIM's Speech Recognition System for the 4th CHIME Challenge Proc. 4th CHIME Challenge, San Francisco, CA 2016

Md. Jahangir Alam, Patrick Kenny, Vishwa Gupta and Themos Stafylakis, Spoofing Detection on the ASVSpoof2015 Challenge Corpus Employing Deep Neural Networks, Proc. Odyssey Speaker and Language Recognition Workshop, Bilbao, Spain, June 2016

Themos Stafylakis, Patrick Kenny, Vishwa Gupta, Md. Jahangir Alam and Marcel Kockmann, Compensation for Phonetic Nuisance Variability in Speaker Recognition Using DNNs, Proc. Odyssey Speaker and Language Recognition Workshop, Bilbao, Spain, June 2016

Gautam Bhattacharya, Md. Jahangir Alam, Themos Stafylakis and Patrick Kenny, Neural Networks for Low-Resource Text-Dependent Speaker Recognition: Preliminary Results, Proc. Odyssey Speaker and Language Recognition Workshop, Bilbao, Spain, June 2016 slides

Patrick Kenny, Themos Stafylakis, Md. Jahangir Alam, Vishwa Gupta and Marcel Kockmann, Uncertainty Modeling Without Subspace Methods for Text-Dependent Speaker Recognition, Proc. Odyssey Speaker and Language Recognition Workshop, Bilbao, Spain, June 2016 slides

Md. Jahangir Alam, Patrick Kenny and Themos Stafylakis, Combining Amplitude and Phase-Based Features for Speaker Verification with Short Duration Utterances, Proc. Interspeech, Dresden Germany, Sept. 2015 poster

Kong Aik Lee, Anthony Larcher, Guangsen Wang, Patrick Kenny, Niko Brummer, David van Leeuwen, Hagai Aronowitz, Marcel Kockmann, Carlos Vaquero, Bin Ma, Haizhou Li, Themos Stafylakis, Jahangir Alam, Albert Swart and Javier Perez, The RedDots Data Collection for Speaker Recognition, Proc. Interspeech, Dresden Germany, Sept. 2015

Themos Stafylakis, Patrick Kenny, Md. Jahangir Alam and Marcel Kockmann, JFA for Speaker Recognition with Random Digit Strings, Proc. Interspeech, Dresden Germany, Sept. 2015 poster

Patrick Kenny, Themos Stafylakis, Jahangir Alam and Marcel Kockmann, An I-Vector Backend for Speaker Verification, Proc. Interspeech, Dresden Germany, Sept. 2015 slides

Md. Jahangir Alam, Patrick Kenny, Gautam Bhattacharya and Themos Stafylakis, Development of CRIM System for the Automatic Speaker Verification Spoofing and Countermeasures Challenge 2015, Proc. Interspeech, Dresden, Germany, Sept. 2015 poster

Patrick Kenny, Themos Stafylakis, Md. Jahangir Alam and Marcel Kockmann, JFA Modeling with Left-to-Right Structure and a New Backend for Text-Dependent Speaker Recognition Proc. ICASSP, Brisbane, Australia, April 2015, poster

Patrick Kenny, Themos Stafylakis, Md. Jahangir Alam, Pierre Ouellet and Marcel Kockmann, In-Domain versus Out-of-Domain Training for Text-Dependent JFA Proc. INTERSPEECH, Singapore, September 2014.

Md. Jahangir Alam, Patrick Kenny, Pierre Dumouchel and Douglas O'Shaughnessy, Noise Spectrum Estimation using Gaussian Mixture Model-based Speech Presence Probability for Robust Speech Recognition, Proc. INTERSPEECH, Singapore, September 2014.

Md. Jahangir Alam, Patrick Kenny, Pierre Dumouchel and Douglas O'Shaughnessy, Robust Feature Extractors for Continuous Speech Recognition Proc. EUSIPCO, Lisbon, Portugal, September 2014.

Md. Jahangir Alam, Patrick Kenny, Pierre Dumouchel and Douglas O'Shaughnessy, Robust Speech Recognition Using Warped DFT-Based Cepstral Features in Clean and Multistyle Training Proc. of EUSIPCO, Lisbon, Portugal, 2014.

Kenny, P., Stafylakis, T., Alam, J., Ouellet, P., and Kockmann, M., Joint Factor Analysis For Text-Dependent Speaker Verification Proc. Odyssey Speaker and Language Recognition Workshop, Joensuu, Finland, June 2014 slides

Kenny, P., Gupta, V., Stafylakis, T., Ouellet, P. and Alam, J., Deep Neural Networks for Extracting Baum-Welch Statistics for Speaker Recognition Proc. Odyssey Speaker and Language Recognition Workshop, Joensuu, Finland, June 2014 slides

Alam, J., Kenny, P., Ouellet, P., Stafylakis, T. and Dumouchel, P., Supervised/Unsupervised Voice Activity Detectors for Text-Dependent Speaker Recognition on the RSR2015 Corpus Proc. Odyssey Speaker and Langauge Recognition Workshop, Joensuu, Finland June 2014

Martinez Gonzalez, D., Burget, L., Stafylakis, T. Lei, Y., Kenny, P. and Lleida, E., Unscented Transform for iVector-Based Noisy Speaker Recognition Proc. ICASSP 2014, Florence, Italy, May 2014

Kenny, P., Stafylakis, T., Ouellet, P., and Alam, J., JFA-Based Front Ends for Speaker Recognition Proc. ICASSP, May 2014

Gupta, V., Kenny, P., Ouellet, P., and Stafylakis, T., I-Vector Based Speaker Adaptation of Deep Neural Networks for French Broadcast Audio Transcription Proc. ICASSP 2014, Florence, Italy, May 2014

Alam, J., Gupta, V., Kenny, P., Dumouchel, P., Use of Multiple Front-Ends and I-Vector Based Speaker Adaptation for Robust Speech Recogntion, Proc. REVERB Challenge, Florence, Italy, May 2014

Alam, J., Attabi, Y., Dumouchel, P., Kenny, P., O Shaughnessy, D., Amplitude Modulation Features for Emotion Recognition from Speech Proc. Interspeech, Lyon, France, August 2013

Alam, J., Kenny, P., O Shaughnessy, D., Regularized MVDR Spectrum Estimation-Based Robust Feature Extractors for Speech Recognition Proc. Interspeech, Lyon, France, August 2013

Kinnunen, T., Alam, J., Matejka, P., Kenny, P., Cernocky, J., O Shaughnessy, D., Frequency Warping and Robust Speaker Verification: A Comparison of Alternative Mel-Scale Representations Proc. Interspeech, Lyon, France, August 2013

Alam, J., Kenny, P., O Shaughnessy, D., Smoothed Nonlinear Energy Operator Based Amplitude Modulation Features for Robust Speech Recognition Proc. NOLISP, Mons, Belgium, June 2013

Senoussaoui, M., Kenny, P., Dumouchel, P., Dehak, N., New Cosine Similarity Scorings to Implement Gender-Independent Speaker Verification Proc. Interspeech, Lyon, France, August 2013 slides

Stafylakis, T., Kenny, P., Ouellet, P., Perez, J.. Kockmann, M., Dumouchel, P., Text-Dependent Speaker Recogntion using PLDA with Uncertainty Propagation Proc. Interspeech, Lyon, France, August 2013 poster

Senoussaoui, M., Kenny, P., Dumouchel, P., Stafylakis. T., Efficient Iterative Mean Shift Based Cosine Dissimilarity for Multi-Recording Speaker Clustering Proc. ICASSP, Vancouver, Canada, May 2013

Alam, J., Kenny, P., O Shaughnessy, D., Speech Recognition using Regularized Minimum Variance Distortionless Response Spectrum Estimation Based Cepstral Features Proc. ICASSP, Vancouver, Canada, May 2013

Alam, J., O Shaughnessy, D., Kenny, P., A Novel Feature Extractor Employing Regularized MVDR Spectrum Estimator and Subband Spectrum Enhancement Technique Proc. WOSSPA, Algeirs, Algeria, May 2013

Stafylakis. T., Kenny, P., Gupta, V., Dumouchel, P., Compensation for inter-frame correlations in speaker diarization and recognition Proc. ICASSP, Vancouver, Canada, May 2013

Attabi, Y., Alam, J., Dumouchel, P., Kenny, P., O Shaughnessy, D., Multiple Windowed Spectral Features for Emotion Recognition Proc. ICASSP, Vancouver, Canada, May 2013

Kenny, P., Stafylakis. T., Ouellet, P., Alam, J., Dumouchel, P., PLDA for Speaker Verification with Utterances of Arbitrary Duration Proc. ICASSP, Vancouver, Canada, May 2013

Alam, J., Kenny, P., and O Shaughnessy, D., Robust Speech Recognition under Noisy Environments using Asymmetric Tapers Proc. EUSIPCO, 2012

Alam, J., Kenny, P., and O Shaughnessy, D., Robust Feature Extraction for Speech Recognition by Enhancing Auditory Spectrum Proc. Interspeech, Portland, Oregon, September 2012

Stafylakis, T., Kenny, P., Senoussaoui, M., and Dumouchel, P., PLDA Using Gaussian Restricted Boltzmann Machines with Application to Speaker Verification Proc. Interspeech, Portland, Oregon September 2012 slides

Stafylakis, T., Katsouros, V., Kenny, P., and Dumouchel, P., Mean Shift Algorithm for Exponential Families with Applications to Speaker Clustering Proc. Odyssey Speaker and Language Recognition Workshop, Singapore, June 2012

Stafylakis, T., Kenny, P., Senoussaoui, M., and Dumouchel, P., Preliminary Investigation of Boltzmann Machine Classifiers for Speaker Recognition Proc. Odyssey Speaker and Language Recognition Workshop, Singapore, June 2012

Alam. J., Kenny, P., and O Shaughnessy, D., On the Use of Asymmetric-shaped Tapers for Speaker Verification using I-Vectors Proc. Odyssey Speaker and Language Recognition Workshop, Singapore, June 2012 slides

Kenny, P., A Small Footprint i-Vector Extractor Proc. Odyssey Speaker and Language Recognition Workshop, Singapore, June 2012 slides

Senoussaoui, M., Dehak, N., Kenny, P., Dehak, R., and Dumouchel, P., First Attempt at Boltzmann Machines fo Speaker Recognition Proc. Odyssey Speaker and Language Recognition Workshop, Singapore, June 2012 slides

Alam, J., Kinnunen, T., Kenny, P., Ouellet, P., and O Shaughnessy, D., Multi-Taper MFCC Features for Speaker Verification Using I-Vectors Proc. ASRU 2011, Hawaii, December 2011

Alam, J., Ouellet, P., Kenny, P., O Shaughnessy, D., Comparative Evaluation of Feature Normalization Techniques for Speaker Verification Proc NOLISP 2011, LNAI 7015, pp. 246-253, Las Palmas, Spain, November 2011

Alam, J., Kenny, P., O Shaughnessy, D., A Study of Low-Variance Multi-Taper Features for Distributed Speech Recognition Proc NOLISP 2011, LNAI 7015, pp. 239-245, Las Palmas, Spain, November 2011

Senoussaoui, M., Kenny, P., Brummer, N., de Villiers, E., and Dumouchel, P., Mixture of PLDA Models in I-Vector Space for Gender-Independent Speaker Recognition Proc. Interspeech 2011, Florence, Italy, August 2011 slides

Senoussaoui, M., Kenny, P., Dumouchel, P., and Castaldo, F., Well Calibrated Heavy Tailed Bayesian Speaker Verification for Microphone Speech, Proc ICASSP, Prague, Czech Republic, May 2011

Matejka, P., Glembek, O., Castaldo, F., Alam, J., Plchot, O., Kenny, P., Burget, L., and Cernocky, J., Full Covariance UBM and Heavy-Tailed PLDA in i-vector Speaker Verification, Proc ICASSP, Prague, Czech Republic, May 2011

Glembek, O., Burget, L., Matejka, P., Karafiat, M., and Kenny, P., Simplification and Optimization of i-vector extraction, Proc ICASSP, Prague, Czech Republic, May 2011

Kenny, P., Bayesian Speaker Verification with Heavy-Tailed Priors keynote presentation, Odyssey Speaker and Language Recognition Workshop, Brno, Czech Republic, June 2010 slides

Dehak, N., Dehak, R., Glass, J., Reynolds, D. and Kenny, P. "Cosine Similarity Scoring without Score Normalization Techniques", Proceedings of Odyssey 2010 - The Speaker and Language Recognition Workshop (Odyssey 2010), pp. 71-75. Brno, Czech Republic, June 28 - July 1, 2010.

Senoussaoui, M., Kenny, P., Dehak, N., and Dumouchel, P., An i-vector Extractor Suitable for Speaker Recognition with both Microphone and Telephone Speech in Proc Odyssey Speaker and Language Recognition Workshop, Brno, Czech Republic, June 2010

Dehak, N., Dehak, R., Kenny, P., Brummer, N., Ouellet, P and Dumouchel, P., Support Vector Machines versus Fast Scoring in the Low-Dimensional Total Variability Space for Speaker Verification In Proc  Interspeech 2009, Brighton, UK, September 2009

Reynolds, D., Kenny, P., and Castaldo, F., A Study of New Approaches to Speaker Diarization In Proc  Interspeech 2009, Brighton, UK, September 2009

Glembek, O., Burget, L., Dehak, N., Brummer, N., and Kenny, P., Comparison of Scoring Methods used in Speaker Recognition with Joint Factor Analysis In Proc  ICASSP 2009, Taipei, Taiwan, April 2009

Dehak, N., Kenny, P., Dehak, R., Glembek O., Dumouchel. P., Burget, L., Hubeika, V. and Castaldo, F., Support Vector Machines and Joint Factor Analysis for Speaker Verification In Proc  ICASSP 2009, Taipei, Taiwan, April 2009

Kenny, P, Dehak N, Ouellet, P, Gupta, V, and Dumouchel, P, Development of the Primary CRIM System for the NIST 2008 Speaker Recognition Evaluation In Proc  Interspeech 2008, Brisbane, Australia, Sept 2008


Yin S-C, Rose, R and Kenny, P Adaptive Score Normalization for Progressive Model Adaptation in Text Independent Speaker Verification  In Proc  ICASSP 2008, Las Vegas, Nevada, Mar 2008

Gupta V, Boulianne, G, Kenny, P, Ouellet, P, and Dumouchel, P Speaker Diarization of French Broadcast News In Proc  ICASSP 2008, Las Vegas, Nevada, Mar 2008

Kenny, P, Dehak N, Gupta, V, and Dumouchel, P A New Training Regimen for Factor Analysis of Speaker Variability Mar 2008 

Dehak, N, Dehak, R, Kenny, P and P Dumouchel Comparison between factor analysis and GMM support vector machines for speaker verification  In Proceedings of the IEEE Odyssey Speaker and Language Recognition Workshop 2008 Stellenbosch, South Africa, Jan 2008 

Dehak, R, Dehak, N, Kenny, P and P Dumouchel Kernel Combination for SVM Speaker Verification  In Proceedings of the IEEE Odyssey Speaker and Language Recognition Workshop 2008 Stellenbosch, South Africa, Jan 2008

Kenny, P, Dehak, N, Dehak, R, Gupta, V and P Dumouchel The Role of Speaker Factors in the NIST Extended Data Task  In Proceedings of the IEEE Odyssey Speaker and Language Recognition Workshop 2008 Stellenbosch, South Africa, Jan 2008

Gupta, V, Kenny, P, Ouellet, P, Dehak, R, Boulianne, G and P Dumouchel Multiple Feature Combination to Improve Speaker Diarization of Telephone Conversations  In Proceedings of the IEEE ASRU Workshop 2007 Kyoto, Japan, Dec 2007 

Dehak, N, Kenny, P and P Dumouchel Continuous prosodic features and formant modeling with joint factor analysis for speaker verification  In Proceedings of the International Conference of Interspeech 2007 Antwerp, Belgium, August 2007 

Dehak, R, Dehak, N, Kenny, P and P Dumouchel Linear and non linear kernel GMM support vector machines for speaker verification  In Proceedings of the International Conference of Interspeech 2007 Antwerp, Belgium, August 2007  

Kenny, P, Gupta, V, Boulianne, G, Ouellet, P and P Dumouchel Feature Normalization Using Smoothed Mixture Transformations  In Proceedings of the International Conference on Spoken Language Processing (Interspeech 2006 - ICSLP) Pittsburgh, PA, USA, September 17-21, 2006 

Kenny, P, Boulianne, G, Ouellet, P and P Dumouchel  The Geometry of the Channel Space in GMM-Based Speaker Recognition  In Proceedings of IEEE Odyssey 2006 - The Speaker and Language Recognition Workshop San Juan, Puerto Rico, June 28-30, 2006 

Yin, S-C, Kenny, P and R Rose Experiments in Speaker Adaptation for Factor Analysis Based Speaker Verification  In Proceedings of IEEE Odyssey 2006 - The Speaker and Language Recognition Workshop San Juan, Puerto Rico, June 28-30, 2006 

Kenny, P, Mihoubi, M and P Dumouchel  Improvements in factor analysis based speaker verification  In Proceedings of the 2006 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2006) Toulouse, France, May 14-19, 2006 

Kenny, P, Boulianne, G, Ouellet, P and P Dumouchel Factor Analysis Simplified In Proceedings of the 2005 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP2005), vol 1, pp 637-640 Philadelphia, PA, USA March 18-23, 2005 

Kenny, P, Dumouchel, P Experiments in Speaker Verification using Factor Analysis Likelihood Ratios In Proceedings of Odyssey04 - Speaker and Language Recognition Workshop Toledo, Spain, May 31 - June 3, 2004 

Kenny, P, Dumouchel, P  Disentangling Speaker and Channel Effects in Speaker Verification In Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2004), vol 1, pp I-37-40 Fairmont Queen Elizabeth Hotel, Montreal, Quebec, Canada, May 17-21, 2004 

Kenny, P, Mihoubi, M.,and Dumouchel, P.New MAP Estimators for Speaker Recognition Proceedings of the 8t

h European Conference on Speech Communication and Technology (Eurospeech 2003), pp 2691-2964 Geneva, Switzerland, September 1-4, 2003

Presentations

Kenny, P., Bayesian Speaker Verification with Heavy-Tailed Priors Odyssey Speaker and Language Recognition Workshop, Brno, Czech Republic, June 2010

Brummer, N., Glembek, O., Kenny, P, et al., The ABC and CRIM Systems for the NIST 2010 Speaker Recognition Evaluation June 2010