This document summarizes a project to develop an image recognition application for identifying Korean foods and estimating their calorie contents. It used Faster R-CNN with a VGG16 model trained on 40 food classes with 300 training and 100 test images per class. Initial mean average precision results were low for some classes like cooked rice. Further analysis found the model was confusing similar looking foods. To address this, a post-processing layer was added to merge similar predictions and select the most probable class based on intersection over union and probability. Live demos of the web application showed it could now accurately identify multiple foods in single images and estimate calorie totals.
Related topics: