IRI - MoHuCo: Modeling Humans in Context

Research Project

MoHuCo: Modeling Humans in Context

Type

National Project

Start Date

01/09/2021

End Date

28/02/2025

Project Code

PID2020-120049RB-I00

Staff

Agudo, Antonio

Principal Investigator
Sanchez, Jordi

Principal Investigator

Sanchez, Jordi

Researcher
Pérez, Marc

PhD Student

Gutiérrez, Marc

PhD Student
Raza, Syed Riaz

Master Student

Syed, Raza

Master Student
Pérez, Raül

Member

Ugrinovic, Nicolás

Member
Moreno, Francesc

Member

Project Description

Project PID2020-120049RB-I00 funded by MCIN/ AEI /10.13039/501100011033

Recent advances in computer vision and deep learning have shown impressive results in modelling different aspects of humans. Given a single image or a video sequence, these models provide detailed reconstructions of the body shape and clothes, predict future movements and understand human behaviour, emotions and intentions. However, one essential factor that has been obviated so far, is the fact that most of these human characteristics are inherently driven by interactions with objects and/or other people in the environment. For instance, the body trajectory is highly constrained by the spatial distribution of the rest of objects in the environment; a particular facial expression (e.g. ‘fear’) may respond to a specific circumstance occurring in the surrounding (e.g. ‘danger’). Understanding these types of human-context connections would allow going beyond current state-of-the-art and perform robust human reasoning under complex situations such as partial observations (e.g. crowded scenes, heavy occlusions) or indirect observations (predicting human characteristics from contextual clues).

The goal of MoHuCo is therefore to develop novel computer vision tools to discover interrelations between person’s properties and the context. For this purpose, we will split the project in three main blocks:

1) Observing the human: consolidation and pushing to the limits the algorithms for 3D body/cloth reconstruction, motion prediction and behaviour analysis given direct observations of the person.

2) Observing the context: research on novel algorithms to extract heterogeneous information (both geometric and semantic) of the environment;

3) Build joint human-context models: bringing the representations of humans and environment into a single model, allowing to indirectly reason about the human from direct observations of the context.

Project Publications

Journal Publications

E. Corona, G. Alenyà, G. Pons-Moll and F. Moreno-Noguer. LayerNet: high-resolution semantic 3D reconstruction of clothed people. IEEE Transactions on Pattern Analysis and Machine Intelligence, 46(2): 1257-1272, 2024.

Abstract Info PDF
G.D. Delmas, P. Weinzaepfel, T. Lucas, F. Moreno-Noguer and G. Rogez. PoseScript: linking 3D human poses and natural language. IEEE Transactions on Pattern Analysis and Machine Intelligence: 1-13, 2024, to appear.

Abstract Info PDF
A. Dhamanaskar, M. Dimiccoli, E. Corona, A. Pumarola and F. Moreno-Noguer. Enhancing egocentric 3D pose estimation with third person views . Pattern Recognition, 138(109358), 2023.

Abstract Info PDF

Conference Publications

A. Berresheim and A. Agudo. Photovoltaic power forecasting using sky images and sun motion, 2024 IEEE International Conference on Acoustics, Speech and Signal Processing, 2024, Seoul, Korea, pp. 4260-4264.

Abstract Info PDF
A.F. Budria, A. López, O. Lorente and F. Moreno-Noguer. InstantGeoAvatar: Effective geometry and appearance modeling of animatable avatars from monocular video, 17th Asian Conference on Computer Vision, 2024, Hanoi, in Computer Vision – ACCV 2024, Vol 15472 of Lecture Notes in Computer Science, pp. 255-277, 2024.

Abstract Info PDF
M. Gutiérrez and A. Agudo. No bells, just whistles: sports field registration by leveraging geometric properties, 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). Workshop on Computer Vision in Sports, 2024, Seattle (USA), pp. 3325-3334.

Abstract Info PDF
R. Pérez, A. Espersen and A. Agudo. Generalized nested latent variable models for lossy coding applied to wind turbine scenarios, 2024 IEEE International Conference on Image Processing, 2024, Abu Dhabi, UAE, pp. 1947-1953.

Abstract Info PDF
A. Casanova and A. Agudo. Uncalibrated and unsupervised photometric stereo with piecewise regularizer, 2024 IEEE International Conference on Image Processing, 2024, Abu Dhabi, UAE, pp. 3471-3476.

Abstract Info PDF
G. Capellera, L. Ferraz, A. Rubio, A. Agudo and F. Moreno-Noguer. FootBots: A transformer-based architecture for motion prediction in soccer, 2024 IEEE International Conference on Image Processing, 2024, Abu Dhabi, UAE, pp. 2313-2319.

Abstract Info PDF
N. Ugrinovic, A. Ruiz, A. Agudo, A. Sanfeliu and F. Moreno-Noguer. PIRO: Permutation-invariant relational network for multi-person 3D pose estimation, 19th International Conference on Computer Vision Theory and Applications, 2024, Rome (Italy), pp. 295-305.

Abstract Info PDF
M. Gutiérrez. No bells, just whistles: sports field registration by leveraging geometric properties, 2024 IRI Doctoral Day, 2024, Barcelona, pp. 9.

Abstract Info PDF
N. Ugrinovic, T. Lucas, F. Baradel, P. Weinzaepfel, G. Rogez and F. Moreno-Noguer. Purposer: Putting human motion generation in context , 2024 International Conference on 3D Vision, 2024, Davos, Switzerland, pp. 1310-1319.

Abstract Info PDF
N. Ugrinovic, B. Pan, G. Pavlakos, D. Paschalidou, B. Shen, J. Sanchez, F. Moreno-Noguer and L. Guibas. MultiPhys: Multi-person physics-aware 3D motion estimation , 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024, Seattle, USA, pp. 2331-2340.

Abstract Info PDF
G.D. Delmas, P. Weinzaepfel, F. Moreno-Noguer and G. Rogez. PoseEmbroider: towards a 3D, visual, semantic-aware human pose representation, 18th European Conference on Computer Vision, 2024, Milano, Italy, in Computer Vision – ECCV 2024, Vol 15110 of Lecture Notes in Computer Science, pp. 55-73, 2024.

Abstract Info PDF
G.D. Delmas, P. Weinzaepfel, F. Moreno-Noguer and G. Rogez. PoseFix: correcting 3D human poses with natural language, 2023 International Conference on Computer Vision, 2023, Paris, France, pp. 14972-14982.

Abstract Info PDF
D.F. Ordoñez, M. Martin, A. Agudo and F. Moreno-Noguer. Morphological symmetries in robot learning, 2023 RSS Workshop on Symmetries in Robot Learning, 2023, Daegu (South Korea), pp. 1-5.

Abstract Info PDF
P. Caselles, E. Ramon, J. Garcia, X. Giro-i-Nieto, F. Moreno-Noguer and G. Triginer. SIRA: Relightable Avatars from a Single Image, 2023 IEEE Winter Conference on Applications of Computer Vision, 2023, Waikoloa, Hawaii, pp. 775-784.

Abstract Info PDF
F. Rivas-Manzaneque, J. Sierra-Acosta, A. Penate-Sanchez, F. Moreno-Noguer and A. Ribeiro. NeRFLight: Fast and light neural radiance fields using a shared feature grid, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, Vancouver, Canada, pp. 12417-12427.

Abstract Info PDF
W. Guo, Y. Du, X. Shen, V. Lepetit, X. Alameda and F. Moreno-Noguer. Back to MLP: A simple baseline for human motion prediction, 2023 IEEE Winter Conference on Applications of Computer Vision, 2023, Waikoloa, Hawaii, pp. 4798-4808.

Abstract Info PDF
A. Agudo. Detail-aware uncalibrated photometric stereo, 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, 2023, Rhodes Island, Greece, pp. 1-5.

Abstract Info PDF
D.F. Ordoñez, M. Martin, A. Agudo and F. Moreno-Noguer. On discrete symmetries of robotics systems: A group-theoretic and data-driven analysis, 2023 Robotics: Science and Systems Conference, 2023, Daegu, Republic of Korea.

Abstract Info PDF
A. Urdapilleta and A. Agudo. Comparative study of feature localization methods for endoscopy image matching, 2023 IEEE International Conference on Image Processing Challenges and Workshops, 2023, Kuala Lumpur, Malaysia, pp. 3719-3723, IEEE.

Abstract Info PDF
R. Pérez, A. Espersen and A. Agudo. Robust wind turbine blade segmentation from RGB images in the wild, 2023 IEEE International Conference on Image Processing, 2023, Kuala Lumpur, Malaysia, pp. 1025-1029.

Abstract Info PDF
M. Pérez and A. Agudo. Sensor-agnostic multimodal fusion for multiple object tracking from camera, radar, lidar and V2X, 2023 FISITA 2023 World Congress, 2023, Barcelona.

Abstract Info PDF
M. Pérez and A. Agudo. Robust multimodal and multi-object tracking for autonomous driving applications, 2023 International Conference on Advanced Robotics, 2023, Abu Dhabi, UAE, pp. 100-106, IEEE.

Abstract Info PDF
A. Agudo. Safari from visual signals: Recovering volumetric 3D shapes, 2022 IEEE International Conference on Acoustics, Speech and Signal Processing, 2022, Singapore, pp. 2495-2499.

Abstract Info PDF
D.F. Ordoñez, A. Agudo, F. Moreno-Noguer and M. Martin. An adaptable approach to learn realistic legged locomotion without examples, 2022 IEEE International Conference on Robotics and Automation, 2022, Philadelphia, Pennsylvania, USA, pp. 4671-4678.

Abstract Info PDF
A. Agudo. Spline human motion recovery, 2022 IEEE International Conference on Image Processing, 2022, Bordeaux, France, pp. 4138-4142, IEEE.

Abstract Info PDF
N. Ugrinovic, A. Pumarola, A. Sanfeliu and F. Moreno-Noguer. Single-view 3d body and cloth reconstruction under complex poses, 17th International Conference on Computer Vision Theory and Applications, 2022, Online.

Abstract Info PDF
P. Estevez and A. Agudo. Uncalibrated, unified and unsupervised specular-aware photometric stereo, 2022 ICPR Workshop on Towards a Complete Analysis of People: From Face and Body to Clothes, 2022, Montreal (Canada), pp. 7-20.

Abstract Info PDF
G.D. Delmas, P. Weinzaepfel, T. Lucas, F. Moreno-Noguer and G. Rogez. PoseScript: 3D human poses from natural language, 17th European Conference on Computer Vision, 2022, Tel Aviv (Israel), in Computer Vision – ECCV 2022 , Vol 13666 of Lecture Notes in Computer Science, pp. 346-362, 2022.

Abstract Info PDF
E. Corona, G. Pons-Moll, G. Alenyà and F. Moreno-Noguer. Learned Vertex Descent: a new direction for 3D human model fitting, 17th European Conference on Computer Vision, 2022, Tel Aviv (Israel), in Computer Vision – ECCV 2022 , Vol 13666 of Lecture Notes in Computer Science, pp. 146--165, 2022.

Abstract Info PDF
J. Shen, A. Agudo, F. Moreno-Noguer and A. Ruiz. Conditional-Flow NeRF: Accurate 3D modelling with reliable uncertainty quantification, 17th European Conference on Computer Vision, 2022, Tel Aviv (Israel), in Computer Vision – ECCV 2022 , Vol 13666 of Lecture Notes in Computer Science, pp. 540-557, 2022.

Abstract Info PDF
A. Pérez and A. Agudo. Matching and recovering 3D people from multiple views, 2022 IEEE Winter Conference on Applications of Computer Vision, 2022, Waikoloa, Hawaii, USA, pp. 1184-1193, IEEE.

Abstract Info PDF
A. Ruiz, A. Agudo and F. Moreno-Noguer. Generating attribution maps with disentangled masked backpropagation, 2021 International Conference on Computer Vision, 2021, Montreal, Canada, pp. 885-894.

Abstract Info PDF
J. Sanchez, A. Pumarola and F. Moreno-Noguer. PhysXNet: A customizable approach for learning cloth dynamics on dressed people, 2021 International Conference on 3D Vision, 2021, London, UK (Virtual), pp. 879-888.

Abstract Info PDF
E. Ramon, G. Triginer, J. Escur, A. Pumarola, J. García, X. Giro-i-Nieto and F. Moreno-Noguer. H3D-Net: Few-shot high-fidelity 3D head reconstruction, 2021 International Conference on Computer Vision, 2021, Montreal, Canada, pp. 5600-5609.

Abstract Info PDF
A. Hernandez Ruiz, A. Vilalta and F. Moreno-Noguer. Neural Cellular Automata manifold, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, Nashville, TN, USA (Virtual), pp. 10015-10023, Computer Vision Foundation.

Abstract Info PDF
N. Ugrinovic, A. Ruiz, A. Agudo, A. Sanfeliu and F. Moreno-Noguer. Body size and depth disambiguation in multi-person reconstruction from single images, 2021 International Conference on 3D Vision, 2021, London, UK (Virtual), pp. 53-63.

Abstract Info PDF
J. Shen, A. Ruiz, A. Agudo and F. Moreno-Noguer. Stochastic Neural Radiance Fields: Quantifying uncertainty in implicit 3D representations, 2021 International Conference on 3D Vision, 2021, London, UK (Virtual), pp. 972-981.

Abstract Info PDF
A. Chatziagapi, S. Athar, F. Moreno-Noguer and D. Samaras. SIDER: Single-image neural optimization for facial geometric detail recovery, 2021 International Conference on 3D Vision, 2021, London, UK (Virtual), pp. 815-824.

Abstract Info PDF

Institut de Robòtica i Informàtica Industrial, CSIC-UPC
C/ Llorens i Artigas 4-6, 08028, Barcelona, Spain

Site map
Accessibility
About this web & cookies
Disclaimer

The activities of our institute are supported by:

Research Project

MoHuCo: Modeling Humans in Context

Type

Start Date

End Date

Project Code

Staff

Principal Investigator

Principal Investigator

Researcher

PhD Student

PhD Student

Master Student

Master Student

Member

Member

Member

Project Description

Project Publications

Journal Publications

E. Corona, G. Alenyà, G. Pons-Moll and F. Moreno-Noguer. LayerNet: high-resolution semantic 3D reconstruction of clothed people. IEEE Transactions on Pattern Analysis and Machine Intelligence, 46(2): 1257-1272, 2024.

G.D. Delmas, P. Weinzaepfel, T. Lucas, F. Moreno-Noguer and G. Rogez. PoseScript: linking 3D human poses and natural language. IEEE Transactions on Pattern Analysis and Machine Intelligence: 1-13, 2024, to appear.

A. Dhamanaskar, M. Dimiccoli, E. Corona, A. Pumarola and F. Moreno-Noguer. Enhancing egocentric 3D pose estimation with third person views . Pattern Recognition, 138(109358), 2023.

Conference Publications

A. Berresheim and A. Agudo. Photovoltaic power forecasting using sky images and sun motion, 2024 IEEE International Conference on Acoustics, Speech and Signal Processing, 2024, Seoul, Korea, pp. 4260-4264.

M. Gutiérrez and A. Agudo. No bells, just whistles: sports field registration by leveraging geometric properties, 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW). Workshop on Computer Vision in Sports, 2024, Seattle (USA), pp. 3325-3334.

R. Pérez, A. Espersen and A. Agudo. Generalized nested latent variable models for lossy coding applied to wind turbine scenarios, 2024 IEEE International Conference on Image Processing, 2024, Abu Dhabi, UAE, pp. 1947-1953.

A. Casanova and A. Agudo. Uncalibrated and unsupervised photometric stereo with piecewise regularizer, 2024 IEEE International Conference on Image Processing, 2024, Abu Dhabi, UAE, pp. 3471-3476.

G. Capellera, L. Ferraz, A. Rubio, A. Agudo and F. Moreno-Noguer. FootBots: A transformer-based architecture for motion prediction in soccer, 2024 IEEE International Conference on Image Processing, 2024, Abu Dhabi, UAE, pp. 2313-2319.

N. Ugrinovic, A. Ruiz, A. Agudo, A. Sanfeliu and F. Moreno-Noguer. PIRO: Permutation-invariant relational network for multi-person 3D pose estimation, 19th International Conference on Computer Vision Theory and Applications, 2024, Rome (Italy), pp. 295-305.

M. Gutiérrez. No bells, just whistles: sports field registration by leveraging geometric properties, 2024 IRI Doctoral Day, 2024, Barcelona, pp. 9.

N. Ugrinovic, T. Lucas, F. Baradel, P. Weinzaepfel, G. Rogez and F. Moreno-Noguer. Purposer: Putting human motion generation in context , 2024 International Conference on 3D Vision, 2024, Davos, Switzerland, pp. 1310-1319.

N. Ugrinovic, B. Pan, G. Pavlakos, D. Paschalidou, B. Shen, J. Sanchez, F. Moreno-Noguer and L. Guibas. MultiPhys: Multi-person physics-aware 3D motion estimation , 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024, Seattle, USA, pp. 2331-2340.

G.D. Delmas, P. Weinzaepfel, F. Moreno-Noguer and G. Rogez. PoseFix: correcting 3D human poses with natural language, 2023 International Conference on Computer Vision, 2023, Paris, France, pp. 14972-14982.

D.F. Ordoñez, M. Martin, A. Agudo and F. Moreno-Noguer. Morphological symmetries in robot learning, 2023 RSS Workshop on Symmetries in Robot Learning, 2023, Daegu (South Korea), pp. 1-5.

P. Caselles, E. Ramon, J. Garcia, X. Giro-i-Nieto, F. Moreno-Noguer and G. Triginer. SIRA: Relightable Avatars from a Single Image, 2023 IEEE Winter Conference on Applications of Computer Vision, 2023, Waikoloa, Hawaii, pp. 775-784.

F. Rivas-Manzaneque, J. Sierra-Acosta, A. Penate-Sanchez, F. Moreno-Noguer and A. Ribeiro. NeRFLight: Fast and light neural radiance fields using a shared feature grid, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, Vancouver, Canada, pp. 12417-12427.

W. Guo, Y. Du, X. Shen, V. Lepetit, X. Alameda and F. Moreno-Noguer. Back to MLP: A simple baseline for human motion prediction, 2023 IEEE Winter Conference on Applications of Computer Vision, 2023, Waikoloa, Hawaii, pp. 4798-4808.

A. Agudo. Detail-aware uncalibrated photometric stereo, 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, 2023, Rhodes Island, Greece, pp. 1-5.

D.F. Ordoñez, M. Martin, A. Agudo and F. Moreno-Noguer. On discrete symmetries of robotics systems: A group-theoretic and data-driven analysis, 2023 Robotics: Science and Systems Conference, 2023, Daegu, Republic of Korea.

A. Urdapilleta and A. Agudo. Comparative study of feature localization methods for endoscopy image matching, 2023 IEEE International Conference on Image Processing Challenges and Workshops, 2023, Kuala Lumpur, Malaysia, pp. 3719-3723, IEEE.

R. Pérez, A. Espersen and A. Agudo. Robust wind turbine blade segmentation from RGB images in the wild, 2023 IEEE International Conference on Image Processing, 2023, Kuala Lumpur, Malaysia, pp. 1025-1029.

M. Pérez and A. Agudo. Sensor-agnostic multimodal fusion for multiple object tracking from camera, radar, lidar and V2X, 2023 FISITA 2023 World Congress, 2023, Barcelona.

M. Pérez and A. Agudo. Robust multimodal and multi-object tracking for autonomous driving applications, 2023 International Conference on Advanced Robotics, 2023, Abu Dhabi, UAE, pp. 100-106, IEEE.

A. Agudo. Safari from visual signals: Recovering volumetric 3D shapes, 2022 IEEE International Conference on Acoustics, Speech and Signal Processing, 2022, Singapore, pp. 2495-2499.

D.F. Ordoñez, A. Agudo, F. Moreno-Noguer and M. Martin. An adaptable approach to learn realistic legged locomotion without examples, 2022 IEEE International Conference on Robotics and Automation, 2022, Philadelphia, Pennsylvania, USA, pp. 4671-4678.

A. Agudo. Spline human motion recovery, 2022 IEEE International Conference on Image Processing, 2022, Bordeaux, France, pp. 4138-4142, IEEE.

N. Ugrinovic, A. Pumarola, A. Sanfeliu and F. Moreno-Noguer. Single-view 3d body and cloth reconstruction under complex poses, 17th International Conference on Computer Vision Theory and Applications, 2022, Online.

P. Estevez and A. Agudo. Uncalibrated, unified and unsupervised specular-aware photometric stereo, 2022 ICPR Workshop on Towards a Complete Analysis of People: From Face and Body to Clothes, 2022, Montreal (Canada), pp. 7-20.

G.D. Delmas, P. Weinzaepfel, T. Lucas, F. Moreno-Noguer and G. Rogez. PoseScript: 3D human poses from natural language, 17th European Conference on Computer Vision, 2022, Tel Aviv (Israel), in Computer Vision – ECCV 2022 , Vol 13666 of Lecture Notes in Computer Science, pp. 346-362, 2022.

E. Corona, G. Pons-Moll, G. Alenyà and F. Moreno-Noguer. Learned Vertex Descent: a new direction for 3D human model fitting, 17th European Conference on Computer Vision, 2022, Tel Aviv (Israel), in Computer Vision – ECCV 2022 , Vol 13666 of Lecture Notes in Computer Science, pp. 146--165, 2022.

A. Pérez and A. Agudo. Matching and recovering 3D people from multiple views, 2022 IEEE Winter Conference on Applications of Computer Vision, 2022, Waikoloa, Hawaii, USA, pp. 1184-1193, IEEE.

A. Ruiz, A. Agudo and F. Moreno-Noguer. Generating attribution maps with disentangled masked backpropagation, 2021 International Conference on Computer Vision, 2021, Montreal, Canada, pp. 885-894.

J. Sanchez, A. Pumarola and F. Moreno-Noguer. PhysXNet: A customizable approach for learning cloth dynamics on dressed people, 2021 International Conference on 3D Vision, 2021, London, UK (Virtual), pp. 879-888.

E. Ramon, G. Triginer, J. Escur, A. Pumarola, J. García, X. Giro-i-Nieto and F. Moreno-Noguer. H3D-Net: Few-shot high-fidelity 3D head reconstruction, 2021 International Conference on Computer Vision, 2021, Montreal, Canada, pp. 5600-5609.

A. Hernandez Ruiz, A. Vilalta and F. Moreno-Noguer. Neural Cellular Automata manifold, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, Nashville, TN, USA (Virtual), pp. 10015-10023, Computer Vision Foundation.

N. Ugrinovic, A. Ruiz, A. Agudo, A. Sanfeliu and F. Moreno-Noguer. Body size and depth disambiguation in multi-person reconstruction from single images, 2021 International Conference on 3D Vision, 2021, London, UK (Virtual), pp. 53-63.

J. Shen, A. Ruiz, A. Agudo and F. Moreno-Noguer. Stochastic Neural Radiance Fields: Quantifying uncertainty in implicit 3D representations, 2021 International Conference on 3D Vision, 2021, London, UK (Virtual), pp. 972-981.

A. Chatziagapi, S. Athar, F. Moreno-Noguer and D. Samaras. SIDER: Single-image neural optimization for facial geometric detail recovery, 2021 International Conference on 3D Vision, 2021, London, UK (Virtual), pp. 815-824.