田邊光(たなべひかる)と申します。電気通信大学大学院(UEC)の博士前期課程に所属し、Vision & Language 分野の研究や Web サービス開発に取り組んでいます。最近は vision-centric なタスクを解ける MLLMs に特に関心を持っています。
Specializing in Vision-and-Language research and web development. My work focuses on enhancing food understanding through Multimodal Large Language Models (MLLMs), with an emphasis on reasoning, segmentation, and 3D-aware nutrient estimation. I am particularly interested in vision-centric tasks using MLLMs, and broadly in exploring the potential of multimodal intelligence.