D. Reilly, R. Chakraborty, A. Sinha, MK. Govind, P. Wang, F. Bremond, L. Xue, and S. Das. LLAVIDAL: A Large LAnguage VIsion Model for Daily Activities of Living, In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2025, Nashville, USA, June 11th-15th, 2025.