User Tools

Site Tools


publications

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
publications [2020/07/09 04:54]
admin
publications [2021/12/28 11:05]
admin
Line 3: Line 3:
 === Notable achievements:​ === === Notable achievements:​ ===
  
-  * 1986: First implementation of Deep Neural Networks ([[https://​www.researchgate.net/​publication/​338188874_Speech_Recognition_with_Associative_Networks|Speech Recognition with Associative Networks.]]) - these had the form x<​sub>​i</​sub>​ = f(∑<​sup>​i-1</​sup>​ w<​sub>​ij</​sub>​ x<​sub>​j</​sub>​) and so were very deep having ​yet trained well using all possible skip connections.+  * 1986: First implementation of Deep Neural Networks ([[https://​www.researchgate.net/​publication/​338188874_Speech_Recognition_with_Associative_Networks|Speech Recognition with Associative Networks.]]) - these had the form x<​sub>​i</​sub>​ = f(∑<​sup>​i-1</​sup>​ w<​sub>​ij</​sub>​ x<​sub>​j</​sub>​) and so were maximally ​deep yet trained well using all possible skip connections.
   * 1987: First publication of Real Time Recurrent Learning ([[https://​www.academia.edu/​30351853/​The_utility_driven_dynamic_error_propagation_network|The utility driven dynamic error propagation network.]] also [[https://​papers.nips.cc/​paper/​42-static-and-dynamic-error-propagation-networks-with-application-to-speech-coding|Static and dynamic error propagation networks with application to speech coding.]])   * 1987: First publication of Real Time Recurrent Learning ([[https://​www.academia.edu/​30351853/​The_utility_driven_dynamic_error_propagation_network|The utility driven dynamic error propagation network.]] also [[https://​papers.nips.cc/​paper/​42-static-and-dynamic-error-propagation-networks-with-application-to-speech-coding|Static and dynamic error propagation networks with application to speech coding.]])
   * 1991: First state-of-the-art ASR with neural networks ([[http://​mi.eng.cam.ac.uk/​reports/​svr-ftp/​auto-pdf/​robinson_tr82.pdf|Several improvements to a recurrent error propagation network phone recognition system]])   * 1991: First state-of-the-art ASR with neural networks ([[http://​mi.eng.cam.ac.uk/​reports/​svr-ftp/​auto-pdf/​robinson_tr82.pdf|Several improvements to a recurrent error propagation network phone recognition system]])
Line 20: Line 20:
   * [[https://​arxiv.org/​abs/​1502.00512|Scaling Recurrent Neural Network Language Models.]] Will Williams, Niranjani Prasad, David Mrva, Tom Ash and Tony Robinson. ​ In Proc. ICASSP, pages 5391-5395, 2015.   * [[https://​arxiv.org/​abs/​1502.00512|Scaling Recurrent Neural Network Language Models.]] Will Williams, Niranjani Prasad, David Mrva, Tom Ash and Tony Robinson. ​ In Proc. ICASSP, pages 5391-5395, 2015.
   * [[http://​patft1.uspto.gov/​netacgi/​nph-Parser?​patentnumber=6675144|Audio coding systems and methods.]] R. C. F. Tucker, C. W. Saymour and A. J. Robinson. Patent US6675144. January 2004.   * [[http://​patft1.uspto.gov/​netacgi/​nph-Parser?​patentnumber=6675144|Audio coding systems and methods.]] R. C. F. Tucker, C. W. Saymour and A. J. Robinson. Patent US6675144. January 2004.
-  * [[https://www.ee.columbia.edu/​~dpwe/pubs/​sprach99.pdf|Connectionist speech recognition of broadcast news.]] A. J. Robinson, G. D. Cook, D. P. W. Ellis, E. Fosler-Lussier,​ S. J. Renals, and D. A. G. Williams. Speech Communication,​ 37(1), 2002. +  * [[https://tonyrobinson.com/_media/​sprach99.pdf|Connectionist speech recognition of broadcast news.]] A. J. Robinson, G. D. Cook, D. P. W. Ellis, E. Fosler-Lussier,​ S. J. Renals, and D. A. G. Williams. Speech Communication,​ 37(1), 2002. 
-  * [[https://www.sciencedirect.com/science/article/​pii/​S0167639300000388|Adaptive model-based speech enhancement.]] Beth Logan and Tony Robinson. Speech Communication,​ 34(4), July 2001. +  * [[https://tonyrobinson.com/_media/loganrobinson00.pdf|Adaptive model-based speech enhancement.]] Beth Logan and Tony Robinson. Speech Communication,​ 34(4), July 2001. 
-  * [[https://www.sciencedirect.com/science/article/​pii/​S0885230800901566|Improved language modelling though better language model evaluation measures.]] Philip Clarkson and Tony Robinson. Computer Speech and Language, 15(1), January 2001. +  * [[https://tonyrobinson.com/_media/clarksonrobinson01.pdf|Improved language modelling though better language model evaluation measures.]] Philip Clarkson and Tony Robinson. Computer Speech and Language, 15(1), January 2001. 
-  * [[http://citeseerx.ist.psu.edu/viewdoc/download?​doi=10.1.1.63.2928&​type=pdf|Indexing and retrieval of broadcast news.]] Steve Renals, Dave Abberley, David Kirby and Tony Robinson. Speech Communication,​ 32(1):5-20, 2000. +  * [[https://tonyrobinson.com/_media/renalsabberleykirbyrobinson00.pdf|Indexing and retrieval of broadcast news.]] Steve Renals, Dave Abberley, David Kirby and Tony Robinson. Speech Communication,​ 32(1):5-20, 2000. 
-  * [[https://pdfs.semanticscholar.org/818b/0245dd240a2a8aaf5e2789e332f4aa34abb0.pdf|Segmentation of a speech waveform according to glottal open and closed phases using an autoregressive-HMM.]] Gavin Smith and Tony Robinson. In Proceedings of the International Conference on Spoken Language Processing, 2000.+  * [[https://tonyrobinson.com/_media/smithrobinson00.pdf|Segmentation of a speech waveform according to glottal open and closed phases using an autoregressive-HMM.]] Gavin Smith and Tony Robinson. In Proceedings of the International Conference on Spoken Language Processing, 2000. 
 +  * [[https://​tonyrobinson.com/​_media/​smithdefreitasrobinsonniranjan00.pdf|Speech Modelling Using Subspace and EM Techniques.]] ​ Gavin Smith, João FG de Freitas Mahesan Niranjan and Tony Robinson. ​ In Advances in Neural Information Processing Systems, 2000.
   * [[https://​ieeexplore.ieee.org/​document/​788164|Subspace techniques in speech enhancement.]] Gavin Smith, Mahesan Niranjan and Tony Robinson. In Neural Networks in Signal Processing 9, 1999.   * [[https://​ieeexplore.ieee.org/​document/​788164|Subspace techniques in speech enhancement.]] Gavin Smith, Mahesan Niranjan and Tony Robinson. In Neural Networks in Signal Processing 9, 1999.
   * [[http://​patft1.uspto.gov/​netacgi/​nph-Parser?​patentnumber=5983180|Recognition of sequential data using finite state sequence models organized in a tree structure.]] A. J. Robinson. Patent US5983180. November 1999.   * [[http://​patft1.uspto.gov/​netacgi/​nph-Parser?​patentnumber=5983180|Recognition of sequential data using finite state sequence models organized in a tree structure.]] A. J. Robinson. Patent US5983180. November 1999.