The Wayback Machine - https://web.archive.org/web/20221025124601/https://baike.sogou.com/kexue/d10754.htm

于尔根·施密德胡伯

编辑
于尔根·施密德胡伯

Jürgen Schmidhuber(生于1963年1月17日)[1]是一位计算机科学家,以其在人工智能、深度学习和人工神经网络领域的工作而闻名。他是位于瑞士南部提契诺卢加诺区的曼诺的Dalle Molle人工智能研究所的联合主任。有时他被称为“现代人工智能之父”[2][3][4][5][6]或者“深度学习之父”。[7]

Schmidhuber在德国慕尼黑工业大学完成了他的本科学业。[1]自2004年到2009年,他在瑞士卢加诺的意大利大学担任人工智能教授。

1 工作编辑

Schmidhuber和他的学生Sepp Hochreiter,Felix Gers,Fred Cummins,Alex Graves以及其他人一起,出版了一种叫做长期短期记忆(LSTM)的循环神经网络的日益复杂的版本。Hochreiter的毕业论文(1991)中已经报告了第一批成果,其中分析并克服了著名的消失梯度问题。[8]“LSTM”这个名字是在一份技术报告(1995年)中提出的,该报告是被引用最多的LSTM出版物(1997)。[9]几乎所有当前应用中使用的标准LSTM体系结构于2000年推出。[10]现在的使用通过时间的反向传播技术的“vanilla LSTM”于2005年出版,[11][12]连接时序分类训练算法[13]于2006年出版。连接时序分类支持与LSTM的端到端语音识别。

2015年,由连接时序分类训练的LSTM被用于谷歌智能手机软件中新开发的语音识别。[14]谷歌也用LSTM作为智能助手Allo[14] 和谷歌翻译。[15][16]苹果在苹果手机上使用了LSTM技术的“快速输入”功能[17][18]和Siri。[19]亚马逊将LSTM用于亚马逊Alexa。[20]2017年,Facebook每天使用LSTM网络进行约45亿次自动翻译。[21]彭博商业周刊写道:“这些能力使LSTM成为最商业化的人工智能成就,用于预测疾病到作曲的各个领域。”[22]

2011年,Schmidhuber在IDSIA的团队和他的博士后研究生Dan Ciresan在称为GPU的快速并行计算设备上实现了卷积神经网络(CNN)的显著加速。Chellapilla等人(2006)在GPU上实现的早期CNN比在CPU上的同等实现快4倍。[23]Dan Ciresan等人2011年时在IDSIA中提出的深度循环神经网络已经快了60倍[24],并在2011年8月的计算机视觉竞赛中取得了第一次惊人表现。[25]在2011年5月15日至2012年9月10日期间,他们的快速深度循环神经网络赢得了不下四场图像比赛。[26][27]他们还显著提高了多个图像数据库在文献检索中的最佳性能。[28]这种方法已经成为计算机视觉。[27]该方法基于Yann LeCun等人早期(1989年)引入的循环神经网络的设计[29],他们将反向传播算法应用于福岛核电站原始循环神经网络架构的一个变种,被称为神经认知机,[30]后来被翁氏方法修改为最大池化。[31][27]

2014年,Schmidhuber成立了一家名为Nnaisense的公司,致力于人工智能在金融、重工业和自动驾驶汽车等领域的商业应用。Sepp Hochreiter、Jaan Tallinn和Marcus Hutter是该公司的顾问。[14]2016年销售额在1100万美元以下;然而,Schmidhuber指出,目前的重点是研究,而非盈利。纳尼森在2017年1月进行了第一轮融资。Schmidhuber的总体目标是通过在各种单方面的任务中依次训练单个人工智能来创建一个通用的人工智能;然而,一些怀疑者指出,像Arago股份有限公司和IBM这样的公司多年来已经将人工智能应用于各种不同的项目,但没有任何通用人工智能的迹象。[32]

2 争议编辑

据《卫报》报道,[33]Schmidhuber在一篇“2015尖刻文章”中抱怨说,深度学习研究员同事Geoffrey Hinton, Yann LeCun and Yoshua Bengio“大量引用彼此的成果”,但“不相信该领域的先驱”,据称他们低估了Schmidhuber和其他早期机器学习先驱的贡献,包括早在1965年发布第一个深度学习网络的Alexey Grigorevich Ivakhnenko。LeCun否认了这一指控,并表示Schmidhuber“一直宣称自己不应得的功劳”。[34][33]

3 荣誉编辑

Schmidhuber在2013年获得了 International Neural Network Society颁发的亥姆霍兹奖[34]和2016年IEEE计算智能学会神经网络先锋奖。[35]他是欧洲科学院的一员。[36][37]

参考文献

  • [1]

    ^CV.

  • [2]

    ^Wong, Andrew (May 16, 2018). "The 'father of A.I' urges humans not to fear the technology". CNBC (in 英语). Retrieved February 27, 2019..

  • [3]

    ^Blunden, Mark (June 8, 2018). "Humans will learn to confide in their robot friends, says AI expert. The father of modern AI believes robots could keep lonely people company". The Evening Standard (in 英语). Retrieved February 27, 2019..

  • [4]

    ^Micklethwaite, Jamie (February 17, 2018). "The day robots become smarter than humans will arrive on THIS DATE. THE day robots become smarter than humans is just round the corner, according to a man dubbed "the father of artificial intelligence"". Daily Star (in 英语). Retrieved February 27, 2019..

  • [5]

    ^"The 'father of A.I.' urges humans not to fear the technology". South China Morning Post (in 英语). May 16, 2018. Retrieved February 27, 2019..

  • [6]

    ^"Sony WOW Studio at SXSW 2019, Austin, Texas: Quote: "the guest speakers include Juergen Schmidhuber, the father of modern artificial intelligence who revolutionized machine learning with his lab's deep learning neural networks and Avast Security Ambassador, Garry Kasparov who is widely regarded as the greatest chessplayer in history."". PR Newswire (in 英语). February 22, 2019. Retrieved February 27, 2019..

  • [7]

    ^Wang, Brian (June 14, 2017). "Father of deep learning AI on General purpose AI and AI to conquer space in the 2050s". Next Big Future (in 英语). Retrieved February 27, 2019..

  • [8]

    ^Hochreiter, S. (1991). Untersuchungen zu dynamischen neuronalen Netzen (PDF) (diploma thesis). Technical University Munich, Institute of Computer Science (advisor Jürgen Schmidhuber)..

  • [9]

    ^Sepp Hochreiter; Jürgen Schmidhuber (1997). "Long short-term memory". Neural Computation. 9 (8): 1735–1780. doi:10.1162/neco.1997.9.8.1735. PMID 9377276..

  • [10]

    ^Felix A. Gers; Jürgen Schmidhuber; Fred Cummins (2000). "Learning to Forget: Continual Prediction with LSTM". Neural Computation. 12 (10): 2451–2471. CiteSeerX 10.1.1.55.5709. doi:10.1162/089976600300015015..

  • [11]

    ^Graves, A.; Schmidhuber, J. (2005). "Framewise phoneme classification with bidirectional LSTM and other neural network architectures". Neural Networks. 18 (5–6): 602–610. CiteSeerX 10.1.1.331.5800. doi:10.1016/j.neunet.2005.06.042. PMID 16112549..

  • [12]

    ^Klaus Greff; Rupesh Kumar Srivastava; Jan Koutník; Bas R. Steunebrink; Jürgen Schmidhuber (2015). "LSTM: A Search Space Odyssey". IEEE Transactions on Neural Networks and Learning Systems. 28 (10): 2222–2232. arXiv:1503.04069. doi:10.1109/TNNLS.2016.2582924. PMID 27411231..

  • [13]

    ^Graves, Alex; Fernández, Santiago; Gomez, Faustino (2006). "Connectionist temporal classification: Labelling unsegmented sequence data with recurrent neural networks". In Proceedings of the International Conference on Machine Learning, ICML 2006: 369–376. CiteSeerX 10.1.1.75.6306..

  • [14]

    ^Khaitan, Pranav (May 18, 2016). "Chat Smarter with Allo". Research Blog. Retrieved 2017-06-27..

  • [15]

    ^Wu, Yonghui; Schuster, Mike; Chen, Zhifeng; Le, Quoc V.; Norouzi, Mohammad; Macherey, Wolfgang; Krikun, Maxim; Cao, Yuan; Gao, Qin (2016-09-26). "Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation". arXiv:1609.08144 [cs.CL]..

  • [16]

    ^Metz, Cade (September 27, 2016). "An Infusion of AI Makes Google Translate More Powerful Than Ever | WIRED". Wired. Retrieved 2017-06-27..

  • [17]

    ^Efrati, Amir (June 13, 2016). "Apple's Machines Can Learn Too". The Information. Retrieved 2017-06-27..

  • [18]

    ^Ranger, Steve (June 14, 2016). "iPhone, AI and big data: Here's how Apple plans to protect your privacy | ZDNet". ZDNet. Retrieved 2017-06-27..

  • [19]

    ^Smith, Chris (2016-06-13). "iOS 10: Siri now works in third-party apps, comes with extra AI features". BGR. Retrieved 2017-06-27..

  • [20]

    ^Vogels, Werner (30 November 2016). "Bringing the Magic of Amazon AI and Alexa to Apps on AWS. - All Things Distributed". www.allthingsdistributed.com. Retrieved 2017-06-27..

  • [21]

    ^Ong, Thuy (4 August 2017). "Facebook's translations are now powered completely by AI". www.allthingsdistributed.com. Retrieved 2019-02-15..

  • [22]

    ^Vance, Ashlee (May 15, 2018). "Quote: These powers make LSTM arguably the most commercial AI achievement, used for everything from predicting diseases to composing music". Bloomberg Business Week. Retrieved 2019-01-16..

  • [23]

    ^Kumar Chellapilla; Sid Puri; Patrice Simard (2006). "High Performance Convolutional Neural Networks for Document Processing". In Lorette, Guy. Tenth International Workshop on Frontiers in Handwriting Recognition. Suvisoft..

  • [24]

    ^Ciresan, Dan; Ueli Meier; Jonathan Masci; Luca M. Gambardella; Jurgen Schmidhuber (2011). "Flexible, High Performance Convolutional Neural Networks for Image Classification" (PDF). Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence-Volume Volume Two. 2: 1237–1242. Retrieved 17 November 2013..

  • [25]

    ^"IJCNN 2011 Competition result table". OFFICIAL IJCNN2011 COMPETITION (in 英语). 2010. Retrieved 2019-01-14..

  • [26]

    ^Schmidhuber, Jürgen (17 March 2017). "History of computer vision contests won by deep CNNs on GPU" (in 英语). Retrieved 14 January 2019..

  • [27]

    ^Schmidhuber, Jürgen (2015). "Deep Learning". Scholarpedia. 10 (11): 1527–54. CiteSeerX 10.1.1.76.1541. doi:10.1162/neco.2006.18.7.1527. PMID 16764513..

  • [28]

    ^Ciresan, Dan; Meier, Ueli; Schmidhuber, Jürgen (June 2012). Multi-column deep neural networks for image classification. 2012 IEEE Conference on Computer Vision and Pattern Recognition. New York, NY: Institute of Electrical and Electronics Engineers (IEEE). pp. 3642–3649. arXiv:1202.2745. CiteSeerX 10.1.1.300.3283. doi:10.1109/CVPR.2012.6248110. ISBN 978-1-4673-1226-4. OCLC 812295155..

  • [29]

    ^Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, L. D. Jackel, Backpropagation Applied to Handwritten Zip Code Recognition; AT&T Bell Laboratories.

  • [30]

    ^Fukushima, Neocognitron (1980). "A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position". Biological Cybernetics. 36 (4): 193–202. doi:10.1007/bf00344251..

  • [31]

    ^Weng, J; Ahuja, N; Huang, TS (1993). "Learning recognition and segmentation of 3-D objects from 2-D images". Proc. 4th International Conf. Computer Vision: 121–128..

  • [32]

    ^"AI Pioneer Wants to Build the Renaissance Machine of the Future". Bloomberg.com (in 英语). 16 January 2017. Retrieved 23 February 2018..

  • [33]

    ^Oltermann, Philip (18 April 2017). "Jürgen Schmidhuber on the robot future: 'They will pay as much attention to us as we do to ants'". the Guardian (in 英语). Retrieved 23 February 2018..

  • [34]

    ^John Markoff (27 November 2016). When A.I. Matures, It May Call Jürgen Schmidhuber ‘Dad’. The New York Times. Accessed April 2017..

  • [35]

    ^Recipients: Neural Networks Pioneer Award. Piscataway, NJ: IEEE Computational Intelligence Society. Accessed January 2019.].

  • [36]

    ^Members. European Academy of Sciences and Arts. Accessed December 2016..

  • [37]

    ^Dave O'Leary (3 October 2016). The Present and Future of AI and Deep Learning Featuring Professor Jürgen Schmidhuber. IT World Canada. Accessed April 2017..

阅读 311
版本记录
  • 暂无