{"id":666,"date":"2024-04-24T13:14:52","date_gmt":"2024-04-24T13:14:52","guid":{"rendered":"http:\/\/www.ub.edu\/cvub\/?post_type=project&#038;p=666"},"modified":"2024-05-09T06:59:39","modified_gmt":"2024-05-09T06:59:39","slug":"ideate","status":"publish","type":"project","link":"https:\/\/www.ub.edu\/aiba\/project\/ideate\/","title":{"rendered":"IDEATE"},"featured_media":669,"template":"","class_list":["post-666","project","type-project","status-publish","has-post-thumbnail","hentry"],"acf":{"summary":"Can deep learning work in the real world?: A data-centric perspective (IDEATE) - [2023 - 2026]\r\n\r\nKey Words: Deep learning, Learning with Noisy Labelling, Data-Centric Deep Learning, Uncertainty  Modelling, Self-supervised Learning","full_content":"<p><b>Name of the project: <\/b><span style=\"font-weight: 400;\">Can deep learning work in the real world?: A data-centric perspective (IDEATE)<\/span><\/p>\n<p><b>Principal Investigator (PI, Co-PI,..): <\/b>Petia Radeva &amp; Ricardo Marques<\/p>\n<p><b>Funding entity: <\/b>Ministry of Science, Innovation and Universities, State Agency of Investigation, Spain<\/p>\n<p><b>Duration: <span style=\"font-weight: 400;\">01\/09\/2023 &#8211; 30\/08\/2026<\/span><\/b><\/p>\n<p><span style=\"font-weight: 400;\">The effectiveness and efficiency of ML\/DL systems depend on the nature of the data and the models\u2019 capacity. Standard ML pipelines are built around specific training tasks characterised by a heuristic model specification, an available training dataset, and an independent and identically distributed evaluation procedure. These properties make the models\u2019 application in real conditions difficult; available effects of spurious correlations, unwanted biases, or opaque predictors in trained models are now quite widespread\u00a0<\/span><\/p>\n<p><b>Hypothesis:<\/b><span style=\"font-weight: 400;\"> We hypothesise that improving the data set quality often results in better performance than just mindlessly fiddling with model hyperparameters. Collecting data, cleaning it, and making it suitable for ML training takes up to 90% of the time. However, with the recent rapid DL development, it becomes clear that data-centric ML\/DL is one of the next challenges. Since data will never be fully clean in real scenarios, DL models should be prepared to cope with imperfect data during model training using robust model training techniques.<\/span><\/p>\n<p><b>\u00a0General objective of the IDEATE project based on our research perspective,<\/b><\/p>\n<ul>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">O1: Develop novel and robust Data-centric DL algorithms where the data is the key to improve the ML models. Address the challenges of imperfect real data.<\/span><\/li>\n<li style=\"font-weight: 400;\" aria-level=\"1\"><span style=\"font-weight: 400;\">O2: Prove the achievements of the novel techniques in real scenarios: Real-world scenarios often face data or label scarcity, missing, unbalanced and noisy data and noisy labels. We successfully apply our algorithms on domains like: food data analysis, medical image analysis, omics data analysis and industrial applications.<\/span><\/li>\n<\/ul>\n","members":[{"ID":18,"post_author":"1","post_date":"2021-03-01 14:40:03","post_date_gmt":"2021-03-01 14:40:03","post_content":"","post_title":"Petia I. Radeva","post_excerpt":"","post_status":"publish","comment_status":"closed","ping_status":"closed","post_password":"","post_name":"petia-radeva","to_ping":"","pinged":"","post_modified":"2024-10-29 09:11:01","post_modified_gmt":"2024-10-29 09:11:01","post_content_filtered":"","post_parent":0,"guid":"http:\/\/cvml.localhost\/?post_type=member&#038;p=18","menu_order":1,"post_type":"member","post_mime_type":"","comment_count":"0","filter":"raw"},{"ID":53,"post_author":"1","post_date":"2021-05-25 15:44:27","post_date_gmt":"2021-05-25 15:44:27","post_content":"","post_title":"Ricardo Marques","post_excerpt":"","post_status":"publish","comment_status":"closed","ping_status":"closed","post_password":"","post_name":"ricardo-marques","to_ping":"","pinged":"","post_modified":"2024-07-29 11:04:32","post_modified_gmt":"2024-07-29 11:04:32","post_content_filtered":"","post_parent":0,"guid":"http:\/\/cvml.localhost\/?post_type=member&#038;p=53","menu_order":6,"post_type":"member","post_mime_type":"","comment_count":"0","filter":"raw"}],"research_lines":[{"ID":244,"post_author":"3","post_date":"2024-03-12 10:16:50","post_date_gmt":"2024-03-12 10:16:50","post_content":"","post_title":"Deep Learning","post_excerpt":"","post_status":"publish","comment_status":"closed","ping_status":"closed","post_password":"","post_name":"deep-learning","to_ping":"","pinged":"","post_modified":"2024-03-15 10:25:45","post_modified_gmt":"2024-03-15 10:25:45","post_content_filtered":"","post_parent":0,"guid":"http:\/\/www.ub.edu\/cvub\/?post_type=research_line&#038;p=244","menu_order":1,"post_type":"research_line","post_mime_type":"","comment_count":"0","filter":"raw"},{"ID":255,"post_author":"3","post_date":"2024-03-12 11:41:03","post_date_gmt":"2024-03-12 11:41:03","post_content":"","post_title":"Uncertainty Modeling","post_excerpt":"","post_status":"publish","comment_status":"closed","ping_status":"closed","post_password":"","post_name":"uncertainty-modeling","to_ping":"","pinged":"","post_modified":"2024-03-12 11:41:03","post_modified_gmt":"2024-03-12 11:41:03","post_content_filtered":"","post_parent":0,"guid":"http:\/\/www.ub.edu\/cvub\/?post_type=research_line&#038;p=255","menu_order":11,"post_type":"research_line","post_mime_type":"","comment_count":"0","filter":"raw"}]},"_links":{"self":[{"href":"https:\/\/www.ub.edu\/aiba\/wp-json\/wp\/v2\/project\/666","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.ub.edu\/aiba\/wp-json\/wp\/v2\/project"}],"about":[{"href":"https:\/\/www.ub.edu\/aiba\/wp-json\/wp\/v2\/types\/project"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.ub.edu\/aiba\/wp-json\/wp\/v2\/media\/669"}],"wp:attachment":[{"href":"https:\/\/www.ub.edu\/aiba\/wp-json\/wp\/v2\/media?parent=666"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}