大工至善|大学至真分享 http://blog.sciencenet.cn/u/lcj2212916

博文

[转载]【计算机科学】【2018.06】深度学习在食品目标识别中的应用

已有 1291 次阅读 2020-2-1 17:31 |系统分类:科研笔记|文章来源:转载

本文为芬兰阿尔托大学(作者:Janaki Prasad Koirala)的硕士论文,共64页。

 

识别食物图像可以拯救人们的生命。它可以用来了解食物中潜在过敏原的存在,或者估计食物的营养含量,也可以用来对抗肥胖的流行。考虑到这些应用,我们试图利用机器学习和深度学习的进展来训练从数码照片中识别欧洲食品的模型。从研究文献中发现,速度较快的RCNN是目前最先进的基于CNN的图像局部信息获取和识别框架。此外,我们还开发了一个Android应用程序来识别食物对象。更快的RCNN需要大量的数据,其中包含对象的标签和定位信息。要找到这样的数据集来训练我们的网络是非常困难的。我们用445个标签制作了一个69k图像的食物数据集,并用这些图像训练我们的模型。但是数据集在每个类别的图像数量方面存在偏差,这会对模型的性能产生负面影响。为了提高性能,我们尝试了几种方法,比如只提取标签的子集,并为每个标签均衡训练样本的数量。在训练样本有限的情况下,我们也使用迁移学习来解决网络过度拟合的问题。最后,通过使用公开的数据集并根据我们的需要进行调整,我们的模型能够以0.37的平均精度识别图像。Android应用程序使用这个模型从图像中识别食物对象。

 

Identifying a food from its image can savepeople’s life. It can be used to know the presence of potential allergens infood or by estimating the nutritional content of food, it may also be used tocombat the obesity epidemic. With such applications in mind, we seek to exploitthe advances in machine learning and deep learning to train models thatidentify European food from digital photos. From the literature it wasdiscovered that the Faster RCNN was the current state-of-art CNN basedframework which could get local information of object in image and recognizeit. Furthermore, we also develop an Android application for recognition of foodobjects. Faster RCNN requires a large volume of data with labels andlocalization information of the objects present in them. It is very challengingto find such datasets to train our network. We made up a food dataset of 69kimages with 445 labels and trained our model using those images. But thedataset was skewed in terms of numbers of images per category that negativelyaffected the performance of the model. To improve the performance, we triedseveral approaches like taking only a subset of labels and equalizing thenumber of training samples for each label. We also used transfer learning to getaround the problem of overfitting the network when our training sample size islimited. Finally, by using publicly available data set and adapting it to ourneeds, our model was able to identify images with 0.37 mean Average Precision.The Android application uses this model to recognize food objects from images.

 

引言

项目背景

目标识别

实验

结果

结论

附录 排行榜前90的食品类别


更多精彩文章请关注公众号:qrcode_for_gh_60b944f6c215_258.jpg



https://blog.sciencenet.cn/blog-69686-1216468.html

上一篇:[转载]【信息技术】【2017.08】一种用于设备间认证和安全通信的自定义加密算法的设计与实现
下一篇:[转载]【信息技术】【2010.06】城市交通场景的车辆分类
收藏 IP: 114.102.185.*| 热度|

0

该博文允许注册用户评论 请点击登录 评论 (0 个评论)

数据加载中...

Archiver|手机版|科学网 ( 京ICP备07017567号-12 )

GMT+8, 2024-4-27 10:27

Powered by ScienceNet.cn

Copyright © 2007- 中国科学报社

返回顶部