User Tools

Site Tools


publications

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revision Both sides next revision
publications [2018/12/08 09:17]
admin
publications [2020/07/09 04:54]
admin
Line 1: Line 1:
 ===== Publications ==== ===== Publications ====
  
-See also [[https://​scholar.google.co.uk/​citations?​user=UPV1LHUAAAAJ&​hl=en|Google Scholar]], [[https://​www.researchgate.net/​profile/​Tony_Robinson/​publications|ResearchGate]] and [[https://​www.semanticscholar.org/​author/​Tony-Robinson/​1742043|Semantic Scholar]].+=== Notable achievements===
  
-IN PROGRESS:  ​Email me if you want copy of something that doesn'​t yet have PDF.+  * 1986First implementation of Deep Neural Networks ([[https://​www.researchgate.net/​publication/​338188874_Speech_Recognition_with_Associative_Networks|Speech Recognition with Associative Networks.]]) - these had the form x<​sub>​i</​sub>​ = f(∑<​sup>​i-1</​sup>​ w<​sub>​ij</​sub>​ x<​sub>​j</​sub>​) and so were very deep having yet trained well using all possible skip connections. 
 +  * 1987: First publication of Real Time Recurrent Learning ([[https://​www.academia.edu/​30351853/​The_utility_driven_dynamic_error_propagation_network|The utility driven dynamic error propagation network.]] also [[https://​papers.nips.cc/​paper/​42-static-and-dynamic-error-propagation-networks-with-application-to-speech-coding|Static and dynamic error propagation networks with application to speech coding.]]) 
 +  * 1991: First state-of-the-art ASR with neural networks ([[http://​mi.eng.cam.ac.uk/​reports/​svr-ftp/​auto-pdf/​robinson_tr82.pdf|Several improvements to recurrent error propagation network phone recognition system]]) 
 +  * 1992: First real time large vocabulary continuous speech recognition system (Resource Management on DSP32C and SPARCstation) 
 +  * 1994: Shorten - the lossless audio compressor ([[http://​citeseerx.ist.psu.edu/​viewdoc/​summary?​doi=10.1.1.53.7337|SHORTEN:​ Simple lossless and near-lossless waveform compression.]] and [[wp>​Shorten_(file_format)]]) 
 +  * 1996: First end-to-end training ​of neural nets and HMMs [[ftp://​mi.eng.cam.ac.uk/​pub/​reports/​auto-pdf/​senior_fbrnn.pdf|Forward-backward retraining of recurrent neural networks]] 
 +  * 1999: The time-first decoder ([[http://​patft1.uspto.gov/​netacgi/​nph-Parser?​patentnumber=5983180|Recognition of sequential data using finite state sequence models organized in tree structure.]] and [[https://​www.semanticscholar.org/​paper/​Time-first-search-for-large-vocabulary-speech-Robinson-Christie/​123dbf5729b147abba09a5fe59dda454f09be0d2|Time-first search for large vocabulary speech recognition.]]) 
 +  * As supervisor to MPhil students: 
 +    * First speech editor - edit audio as text 
 +    * First editor for correcting speech recognition transcripts 
 +    * First automatically scrolling teleprompter (Autocue)
  
 +
 +=== Full list: ===
   * [[https://​patents.google.com/​patent/​WO2017077330A1/​en|Speech processing system and method.]] T. W. J. Ash and A. J. Robinson. Patent application PCT/​GB2016/​053456. ​ November 2016.   * [[https://​patents.google.com/​patent/​WO2017077330A1/​en|Speech processing system and method.]] T. W. J. Ash and A. J. Robinson. Patent application PCT/​GB2016/​053456. ​ November 2016.
   * [[https://​arxiv.org/​abs/​1502.00512|Scaling Recurrent Neural Network Language Models.]] Will Williams, Niranjani Prasad, David Mrva, Tom Ash and Tony Robinson. ​ In Proc. ICASSP, pages 5391-5395, 2015.   * [[https://​arxiv.org/​abs/​1502.00512|Scaling Recurrent Neural Network Language Models.]] Will Williams, Niranjani Prasad, David Mrva, Tom Ash and Tony Robinson. ​ In Proc. ICASSP, pages 5391-5395, 2015.
Line 33: Line 45:
   * Andrew Senior and Tony Robinson. Online cursive handwriting recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence,​ 20(3):​309-321,​ 1998.   * Andrew Senior and Tony Robinson. Online cursive handwriting recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence,​ 20(3):​309-321,​ 1998.
   * [[https://​www.semanticscholar.org/​paper/​Time-first-search-for-large-vocabulary-speech-Robinson-Christie/​123dbf5729b147abba09a5fe59dda454f09be0d2|Time-first search for large vocabulary speech recognition.]] Tony Robinson and James Christie. In Proc. ICASSP, pages 829-832, 1998.   * [[https://​www.semanticscholar.org/​paper/​Time-first-search-for-large-vocabulary-speech-Robinson-Christie/​123dbf5729b147abba09a5fe59dda454f09be0d2|Time-first search for large vocabulary speech recognition.]] Tony Robinson and James Christie. In Proc. ICASSP, pages 829-832, 1998.
-  * Gary Cook and Tony Robinson. Transcribing broadcast news with the 1997 Abbot system. In Proc. ICASSP , pages 917-920, 1998.+  * Gary Cook and Tony Robinson. Transcribing broadcast news with the 1997 Abbot system. In Proc. ICASSP, pages 917-920, 1998
 +  * [[https://​www.researchgate.net/​publication/​338188472_Joint_Prediction_and_Vector_Quantisation|Joint Prediction and Vector Quantisation.]] ​ Carl Seymour and Tony Robinson. 1997.
   * [[https://​pdfs.semanticscholar.org/​36ba/​e1749b0abc2e91945cd08d392ebcb712bbd9.pdf|The 1997 Abbot system for the transcription of broadcast news.]] G. D. Cook and A. J. Robinson. In Proc. of the Broadcast News Transcription and Understanding workshop, pages 49-54. Morgan Kaufmann, February 1998.   * [[https://​pdfs.semanticscholar.org/​36ba/​e1749b0abc2e91945cd08d392ebcb712bbd9.pdf|The 1997 Abbot system for the transcription of broadcast news.]] G. D. Cook and A. J. Robinson. In Proc. of the Broadcast News Transcription and Understanding workshop, pages 49-54. Morgan Kaufmann, February 1998.
   * Dave Abberley, Steve Renals, Gary Cook, and Tony Robinson. The THISL spoken document retrieval system. In TREC-6 Proceedings , 1998.   * Dave Abberley, Steve Renals, Gary Cook, and Tony Robinson. The THISL spoken document retrieval system. In TREC-6 Proceedings , 1998.
Line 40: Line 53:
   * [[http://​citeseerx.ist.psu.edu/​viewdoc/​download?​doi=10.1.1.49.8389&​rep=rep1&​type=pdf|Transcription of broadcast television and radio news: The 1996 Abbot system.]] ​ G. D. Cook, D. J. Kershaw, J. D. M. Christie, and A. J. Robinson. In Proc. of DARPA Speech Recognition Workshop , pages 79-84. Morgan Kaufmann, February 1997.   * [[http://​citeseerx.ist.psu.edu/​viewdoc/​download?​doi=10.1.1.49.8389&​rep=rep1&​type=pdf|Transcription of broadcast television and radio news: The 1996 Abbot system.]] ​ G. D. Cook, D. J. Kershaw, J. D. M. Christie, and A. J. Robinson. In Proc. of DARPA Speech Recognition Workshop , pages 79-84. Morgan Kaufmann, February 1997.
   * [[http://​citeseerx.ist.psu.edu/​viewdoc/​download?​doi=10.1.1.52.6985&​rep=rep1&​type=pdf|Ensemble methods for connectionist acoustic modelling.]] G. D. Cook, S. R. Waterhouse, and A. J. Robinson. In Proceedings of the European Conference on Speech Technology , volume 4, pages 1959-1962, September 1997.   * [[http://​citeseerx.ist.psu.edu/​viewdoc/​download?​doi=10.1.1.52.6985&​rep=rep1&​type=pdf|Ensemble methods for connectionist acoustic modelling.]] G. D. Cook, S. R. Waterhouse, and A. J. Robinson. In Proceedings of the European Conference on Speech Technology , volume 4, pages 1959-1962, September 1997.
-  * C. W. Seymour and A. J. Robinson. A low-bit-rate speech coder using adaptive line spectral frequency prediction. In Proceedings of the European Conference on Speech Technology , volume 3, pages 1319{1322, September 1997.+  * C. W. Seymour and A. J. Robinson. A low-bit-rate speech coder using adaptive line spectral frequency prediction. In Proceedings of the European Conference on Speech Technology , volume 3, pages 1319-1322, September 1997.
   * [[http://​citeseerx.ist.psu.edu/​viewdoc/​summary?​doi=10.1.1.46.1598|A segmental formant vocoder based on linearly varying mixtures of Gaussians.]] Parham Zolfaghari and Tony Robinson. ​ In Proceedings of the European Conference on Speech Technology, volume 1, pages 425-428, September 1997.   * [[http://​citeseerx.ist.psu.edu/​viewdoc/​summary?​doi=10.1.1.46.1598|A segmental formant vocoder based on linearly varying mixtures of Gaussians.]] Parham Zolfaghari and Tony Robinson. ​ In Proceedings of the European Conference on Speech Technology, volume 1, pages 425-428, September 1997.
   * [[ftp://​svr-ftp.eng.cam.ac.uk/​pub/​reports/​auto-pdf/​logan_euro97.pdf|Improving autoregressive hidden Markov model recognition accuracy using a non-linear frequency scale with application to speech enhancement.]] ​ B. T. Logan and A. J. Robinson. In Proceedings of the European Conference on Speech Technology, volume 4, pages 2103-2106, September 1997.   * [[ftp://​svr-ftp.eng.cam.ac.uk/​pub/​reports/​auto-pdf/​logan_euro97.pdf|Improving autoregressive hidden Markov model recognition accuracy using a non-linear frequency scale with application to speech enhancement.]] ​ B. T. Logan and A. J. Robinson. In Proceedings of the European Conference on Speech Technology, volume 4, pages 2103-2106, September 1997.
Line 53: Line 66:
   * [[https://​www.researchgate.net/​publication/​224265944_Formant_analysis_using_mixtures_of_Gaussians|Formant analysis using mixtures of Gaussians.]]Parham Zolfaghari and Tony Robinson. In Proceedings of the International Conference on Spoken Language Processing, volume 2, pages 1229-1232, October 1996.   * [[https://​www.researchgate.net/​publication/​224265944_Formant_analysis_using_mixtures_of_Gaussians|Formant analysis using mixtures of Gaussians.]]Parham Zolfaghari and Tony Robinson. In Proceedings of the International Conference on Spoken Language Processing, volume 2, pages 1229-1232, October 1996.
   * [[https://​papers.nips.cc/​paper/​1167-bayesian-methods-for-mixtures-of-experts.pdf|Bayesian methods for mixtures of experts.]] Steve Waterhouse, David MacKay and Tony Robinson. In D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, Advances in Neural Information Processing Systems 8. Morgan Kaufmann, 1996.   * [[https://​papers.nips.cc/​paper/​1167-bayesian-methods-for-mixtures-of-experts.pdf|Bayesian methods for mixtures of experts.]] Steve Waterhouse, David MacKay and Tony Robinson. In D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, Advances in Neural Information Processing Systems 8. Morgan Kaufmann, 1996.
-  * S. R. Waterhouse and A. J. Robinson. ​Constructive algorithms for hierarchical mixtures of experts. ​In D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, Advances in Neural Information Processing Systems 8. Morgan Kaufmann, 1996.+  * [[https://​papers.nips.cc/​paper/​1165-constructive-algorithms-for-hierarchical-mixtures-of-experts.pdf|Constructive algorithms for hierarchical mixtures of experts]]. ​S. R. Waterhouse and A. J. Robinson. ​ In D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, Advances in Neural Information Processing Systems 8. Morgan Kaufmann, 1996.
   * [[https://​papers.nips.cc/​paper/​1039-context-dependent-classes-in-a-hybrid-recurrent-network-hmm-speech-recognition-system.pdf|Context-dependent classes in a hybrid recurrent network-HMM speech recognition system.]] Dan Kershaw, Tony Robinson, and Mike Hochberg. ​ In D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, Advances in Neural Information Processing Systems 8, pages 750-756. Morgan Kaufmann, 1996.   * [[https://​papers.nips.cc/​paper/​1039-context-dependent-classes-in-a-hybrid-recurrent-network-hmm-speech-recognition-system.pdf|Context-dependent classes in a hybrid recurrent network-HMM speech recognition system.]] Dan Kershaw, Tony Robinson, and Mike Hochberg. ​ In D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, editors, Advances in Neural Information Processing Systems 8, pages 750-756. Morgan Kaufmann, 1996.
   * [[ftp://​mi.eng.cam.ac.uk/​pub/​reports/​auto-pdf/​senior_fbrnn.pdf|Forward-backward retraining of recurrent neural networks]]. Andrew Senior and Tony Robinson. In D. S. Touretzky, M. C. Mozer and M. E. Hasselmo, editors, Advances in Neural Information Processing Systems 8. Morgan Kaufmann, 1996.   * [[ftp://​mi.eng.cam.ac.uk/​pub/​reports/​auto-pdf/​senior_fbrnn.pdf|Forward-backward retraining of recurrent neural networks]]. Andrew Senior and Tony Robinson. In D. S. Touretzky, M. C. Mozer and M. E. Hasselmo, editors, Advances in Neural Information Processing Systems 8. Morgan Kaufmann, 1996.
Line 86: Line 99:
   * [[http://​citeseerx.ist.psu.edu/​viewdoc/​download?​doi=10.1.1.64.515&​rep=rep1&​type=pdf|A phonetic tactile speech listening listening system.]] E. M. Ellis and A. J. Robinson. Technical Report CUED/​F-INFENG/​TR.122,​ Cambridge University Engineering Department, May 1993.   * [[http://​citeseerx.ist.psu.edu/​viewdoc/​download?​doi=10.1.1.64.515&​rep=rep1&​type=pdf|A phonetic tactile speech listening listening system.]] E. M. Ellis and A. J. Robinson. Technical Report CUED/​F-INFENG/​TR.122,​ Cambridge University Engineering Department, May 1993.
   * C. Giguère, P. C. Woodland, and A. J Robinson. Application of an auditory model to the computer simulation of hearing impairment: Preliminary results. Canadian Acoustics, 21(3), 1993.   * C. Giguère, P. C. Woodland, and A. J Robinson. Application of an auditory model to the computer simulation of hearing impairment: Preliminary results. Canadian Acoustics, 21(3), 1993.
-  * Tony Robinson. ​Arti fcial neural networks: The mole-grips of the speech scientist. In Visual Representations of Speech Signals. John Wiley and Sons, 1993.+  * Tony Robinson. ​Artifcial Neural Networks: The mole-grips of the speech scientist. In Visual Representations of Speech Signals. John Wiley and Sons, 1993.
   * Tony Robinson. The state space and "ideal input" representations of recurrent networks. In Visual Representations of Speech Signals , pages 327-334. John Wiley and Sons, 1993.   * Tony Robinson. The state space and "ideal input" representations of recurrent networks. In Visual Representations of Speech Signals , pages 327-334. John Wiley and Sons, 1993.
   * Tony Robinson. Recurrent nets for phone probability estimation. In Proceedings of the ARPA Continuous Speech Recognition Workshop , Stanford, September 1992.   * Tony Robinson. Recurrent nets for phone probability estimation. In Proceedings of the ARPA Continuous Speech Recognition Workshop , Stanford, September 1992.
   * Tony Robinson. Practical network design and implementation. In Proceedings of the Cambridge Neural Network Summer School , Cambridge Programme for Industry, Cambridge University, September 1992.   * Tony Robinson. Practical network design and implementation. In Proceedings of the Cambridge Neural Network Summer School , Cambridge Programme for Industry, Cambridge University, September 1992.
-  * Tony Robinson. A real-time recurrent error propagation network word recognition system. In Proc. ICASSP , volume I, pages 617{620, 1992.+  * [[http://​mi.eng.cam.ac.uk/​reports/​svr-ftp/​auto-pdf/​robinson_icassp92.pdf|A real-time recurrent error propagation network word recognition system]]. Tony Robinson. In Proc. ICASSP, volume I, pages 617-620, 1992.
   * Anne Cutler and Tony Robinson. Response time as a metric for comparison of speech recognition by humans and machines. In Proceedings of the International Conference on Spoken Language Processing, October 1992.   * Anne Cutler and Tony Robinson. Response time as a metric for comparison of speech recognition by humans and machines. In Proceedings of the International Conference on Spoken Language Processing, October 1992.
   * Christine Tuerk and Tony Robinson. A multiple-speaker phoneme durational model. In Institute of Acoustics Autumn Conference on Speech and Hearing , November 1992.   * Christine Tuerk and Tony Robinson. A multiple-speaker phoneme durational model. In Institute of Acoustics Autumn Conference on Speech and Hearing , November 1992.
   * Errol M. Ellis and Tony Robinson. Two dimensional representation of phonemes of the English language. In Institute of Acoustics Autumn Conference on Speech and Hearing , November 1992.   * Errol M. Ellis and Tony Robinson. Two dimensional representation of phonemes of the English language. In Institute of Acoustics Autumn Conference on Speech and Hearing , November 1992.
-  * Tony RobinsonExperiments with the APU auditory model at CUEDACTS internal report, 1991. +  * [[http://mi.eng.cam.ac.uk/​reports/​svr-ftp/​auto-pdf/​robinson_tr82.pdf|Several improvements to a recurrent error propagation network phone recognition system]]. Tony Robinson. Technical Report CUED/​F-INFENG/​TR.82,​ Cambridge University Engineering Department, September 1991.
-  * Tony Robinson. Several improvements to a recurrent error propagation network phone recognition system. Technical Report CUED/​F-INFENG/​TR.82,​ Cambridge University Engineering Department, September 1991.+
   * Christine Tuerk, Peter Monaco, and Tony Robinson. The development of a connectionist multiple-voice text-to-speech system. In Proc. ICASSP , 1991.   * Christine Tuerk, Peter Monaco, and Tony Robinson. The development of a connectionist multiple-voice text-to-speech system. In Proc. ICASSP , 1991.
   * N. H. Russell, F. Fallside, A. J. Robinson, and R. W. Prager. Lexical access using a recurrent error propagation network. In Proceedings of the European Conference on Speech Technology, Genoa, Italy, 1991.   * N. H. Russell, F. Fallside, A. J. Robinson, and R. W. Prager. Lexical access using a recurrent error propagation network. In Proceedings of the European Conference on Speech Technology, Genoa, Italy, 1991.
   * Tony Robinson. Recognition of continuous speech using recurrent error propagation networks. In Proceedings of Voice Systems Worldwide , London, June 1991.   * Tony Robinson. Recognition of continuous speech using recurrent error propagation networks. In Proceedings of Voice Systems Worldwide , London, June 1991.
-  * Tony Robinson and Frank Fallside. A recurrent error propagation network speech recognition system. Computer Speech and Language , 5(3):259{274, July 1991.+  * [[http://mi.eng.cam.ac.uk/​reports/​svr-ftp/​auto-pdf/​robinson_csl91.pdf|A recurrent error propagation network speech recognition system]]. Tony Robinson and Frank Fallside. Computer Speech and Language , 5(3):259-274, July 1991.
   * Tony Robinson and Frank Fallside. Word recognition from the DARPA resource management database with the Cambridge recurrent error propagation network speech recognition system. In Third Australian International Conference on Speech Science and Technology, Melbourne, November 1990.   * Tony Robinson and Frank Fallside. Word recognition from the DARPA resource management database with the Cambridge recurrent error propagation network speech recognition system. In Third Australian International Conference on Speech Science and Technology, Melbourne, November 1990.
   * [[https://​www.researchgate.net/​publication/​221479490_A_comparison_of_preprocessors_for_the_cambridge_recurrent_error_propagation_network_speech_recognition_system|A comparison of preprocessors for the Cambridge recurrent error propagation network speech recognition system.]] Tony Robinson, John Holdsworth, Roy Patterson, and Frank Fallside. In Proceedings of the International Conference on Spoken Language Processing, pages 1033-1036, Kobe, Japan, November 1990.   * [[https://​www.researchgate.net/​publication/​221479490_A_comparison_of_preprocessors_for_the_cambridge_recurrent_error_propagation_network_speech_recognition_system|A comparison of preprocessors for the Cambridge recurrent error propagation network speech recognition system.]] Tony Robinson, John Holdsworth, Roy Patterson, and Frank Fallside. In Proceedings of the International Conference on Spoken Language Processing, pages 1033-1036, Kobe, Japan, November 1990.
Line 113: Line 125:
   * A. J. Robinson and F. Fallside. A dynamic connectionist model for phoneme recognition:​ Preliminary results. Technical Report CUED/​F-INFENG/​TR.14,​ Cambridge University Engineering Department, 1988.   * A. J. Robinson and F. Fallside. A dynamic connectionist model for phoneme recognition:​ Preliminary results. Technical Report CUED/​F-INFENG/​TR.14,​ Cambridge University Engineering Department, 1988.
   * [[https://​www.academia.edu/​30351853/​The_utility_driven_dynamic_error_propagation_network|The utility driven dynamic error propagation network.]] A. J. Robinson and F. Fallside. Technical Report CUED/​F-INFENG/​TR.1,​ Cambridge University Engineering Department, 1987.   * [[https://​www.academia.edu/​30351853/​The_utility_driven_dynamic_error_propagation_network|The utility driven dynamic error propagation network.]] A. J. Robinson and F. Fallside. Technical Report CUED/​F-INFENG/​TR.1,​ Cambridge University Engineering Department, 1987.
-  * AJ. Robinson. Speech ​recognition ​with associative networksMaster'​s ​thesis, Cambridge University Engineering Department, Trumpington Street, Cambridge CB2 1PZ, UK, August ​1986.+  * [[https://​www.researchgate.net/​publication/​338188874_Speech_Recognition_with_Associative_Networks|Speech ​Recognition ​with Associative Networks.]] Tony Robinson, MPhil thesis, Cambridge University Engineering Department, Trumpington Street, Cambridge CB2 1PZ, UK, September ​1986
 + 
 +IN PROGRESS: ​ Email me if you want a copy of something that doesn'​t yet have a PDF.  See also [[https://​scholar.google.co.uk/​citations?​user=UPV1LHUAAAAJ&​hl=en|Google Scholar]], [[https://​www.researchgate.net/​profile/​Tony_Robinson/​publications|ResearchGate]] and [[https://​www.semanticscholar.org/​author/​Tony-Robinson/​1742043|Semantic Scholar]].