科学网—深度学习助力提升物种鉴别效率

分类学工作者采集到大量标本，可以研究物种的形态和遗传的变异范围，是一件非常令人兴奋的事情。但有时，大量小型标本的分拣也是令人头疼的事情。智能手机、数码相机和互联网时代，随时随地可能获得有趣的图片，便于专家进行识别。然而，当相同物种的图片增加到一定程度，专家也无法及时给予回复。这里面最大的问题，还在于物种图片鉴别速度和鉴别准确率的问题。我在上一篇博文中提到“开发识别技术，加速物种分类学研究进程”。分子生物学和分类学整合，在过去的几十年极大推动了物种界定和生物系统学的发展。在快速逼近的大数据时代和深度学习时代，物种图片识别技术是否能够得到快速发展？生物分类的过程，实际上是人类智力和时间高度参与人工图片识别和归类的过程。能否加速图片识别，降低人类对重复图片识别和归类的时间投入？

2012年，中国科学院网络中心的李健博士等就已经实现了鸟类的图片识别。该物种形态学鉴别系统基于 IT 技术，包括了 B/S 架构的 Web 端和手持设备端两部分。使得用户可直接访问专家数据库以判定个体从属的种类，同时也可以上传信息请求专家审核.此鉴别系统在为用户提供方便准确的物种鉴别手段的同时,也为分类学家提供了一个获取大量信息的平台（鸟类物种形态学鉴别系统设计与实现：http://escj.cnic.cn/CN/abstract/abstract12648.shtml）。目前，他的团队已经能够处理动态视频中的鸟类物种识别问题。这样的工作及其平台，能否移植到昆虫工作中？

今天早晨读到Nvidia网站上一篇关于野生动物图片自动鉴别的报道，同时阅读了昆虫图像识别的一篇综述。两者分别从硬件和软件的角度，提出了一些有价值的思路，值得参考。它们的共同点如下：

2、像素得到极大提升的智能手机和数码相机已经非常普及，和植物图片一样，昆虫的图片数据也在不断增加；

4、野外相机监测、智能手机和数码相机获得图片不一定和室内的专业图片相同的大小、摆放位置和角度；

5、均可以引入深度学习的理念，在未来海量图片的基础上，整合专业级图片，加速图像识别和自动鉴别，并提高准确程度；

6、把分类学者从标本分拣的琐碎工作中解放出来，既能够获得物种的分布数据，又能够集中精力针对部分标本开展深入的物种研究工作。

A research team led by University of Wyoming developed a deep learning-based system to identify species in the Serengeti National Park in Tanzania that could make it easier for ecologists to track animals in the wild.

Camera traps automatically take pictures of passing animals when triggered by heat and motion which produce millions of images and extracting knowledge from these camera-trap images is traditionally done by humans. According to their research paper, it currently takes two to three months for a group of thousands of people to manually label each 6-month batch of images.

When lead researcher Jeff Clune at the University of Wyoming in Laramie heard about the project, he knew it was an ideal opportunity to leverage deep learning.

Using a variety of GPUs including the NVIDIA DGX-1 AI supercomputer and CUDA, cuDNN and NCLL with the Torch deep learning framework, the researchers trained their deep convolutional neural network to recognize 48 species in 3.2 million images taken from the national park’s hidden camera-traps. They were able to train their neural networks to automatically identify the variety of animals with over 92% accuracy.

A sample camera-trap image from the Snapshot Serengeti dataset.

“This is very exciting,” says Chris Carbone at the Zoological Society of London. Automatic species recognition could help us learn more about the distribution of species and get a better idea of the impact humans are having on them, he says.

Read more >

zhuchaodong的个人博客分享 http://blog.sciencenet.cn/u/zhuchaodong

博文

深度学习助力提升物种鉴别效率

当前推荐数：3 推荐人：谢强 张庆费 蔡庆华

该博文允许注册用户评论请点击登录评论 (1 个评论)

朱朝东

全部作者的精选博文

全部作者的其他最新博文

全部精选博文导读

相关博文