Yu Du

About me

I'm an AI research engineer and data scientist. At the end of 2021, I completed my Ph.D. in AI and joined Appvizer, a subsidiary of Softonic. As a digital marketing platform, Appvizer is dedicated to helping users find the most relevant software while enabling software vendors to reach their target audience. My research and development work at Appvizer primarily focuses on recommender systems and natural language processing. One of the most notable contributions has been the development of a hybrid recommender system, which led to an over 50% increase CTR (Click-Through Rate) compared to the previous legacy system. With the raise of LLMs in recent years, I have also been actively involved in prompt engineering and the development of innovative AI agents applications.

Past

After obtaining my Master's degree in Computer Science from Montpellier University in 2017, specializing in "Data, Knowledge & Natural Language Processing", I worked for Cirad for 1 year as a research engineer. Applying what I had learnt during my Master's studies, I developed a tool based on Semantic Web to facilitate information retrieval, data analysis and visualisation. Although the tool proved useful and was well-received by the firm's agents, I realised that I need to deepen my knowledge and further refine my problem-solving and analytical skills.

To this end, I chose to pursue a Ph.D. in AI---a challenging yet deeply rewarding achievement for me. During my Ph.D. at the CERIS laboratory within IMT Mines Alès, I focused on recommendation systems, a key application of machine learning widely used in real-world scenarios to mitigate information overload for end-users. The primary goal of my thesis was to enhance recommendation performance from a data-to-knowledge perspective. The data perspective aimed to improve recommendation accuracy (i.e., rating prediction and ranking) by leveraging statistical models. Meanwhile, the knowledge perspective went a step further, focusing on enhancing the quality of recommendations through more user-centric aspects such as diversity and explainability. This was achieved by incorporating knowledge engineering concepts, including the Semantic Web and knowledge graphs.

Training-wise, I have a comprehensive background in both machine learning and knowledge engineering. My Ph.D. research has helped me refine my technical skills, making me more proficient with major machine learning frameworks, containerization, and other relevant technologies. More importantly, it has also allowed me to develop essential professional skills such as presentation, writing, critical thinking, and teamwork.

Outside of work, I enjoy cooking delicious meals, travaling, reading, reflecting, and sharing life experiences with friends---especially my wife 👸. I'm also passionate about self-learning, particularly in emerging technologies.

Domains of interest

Machine learning
Prompt engineering
AI agents
Recommender systems
Knowledge engineering
Knowledge graphs
Information retrieval
Data analysis

Academia

All (11) Journals (4) Conferences (2) Certifications (5)

2025.

MCP for Production Automation. Hugging Face.

Certification

2025.

Excellence in building AI Agents. Hugging Face.

Certification

2023.

Du, Y., Lavarec, E., Lalouette, C. Text Data Augmentation to Manage Imbalanced Classification: Apply to BERT-based Large Multiclass Classification for Product Sheets. International Journal of Computational Linguistics. Vol. 14, Issue 1.

journal

2022.

Du, Y., Ranwez, S., Sutton-Charani, N., Ranwez, V. Post-hoc recommendation explanations through an efficient exploitation of the DBpedia category hierarchy. Knowledge-Based Systems. Vol. 245, pp. 108560.

journal

2021.

Du, Y., Sutton-Charani, N., Ranwez, S., Ranwez, V. Is diversity optimization always suitable? Toward a better understanding of diversity within recommendation approaches. Information Processing & Management. Vol. 58(6), pp. 102721.

journal

2021.

Du, Y., Sutton-Charani, N., Ranwez, S., Ranwez, V. EBCR: Empirical Bayes concordance ratio method to improve similarity measurement in memory-based collaborative filtering. PLOS ONE. Vol. 16(8), pp. e0255929.

journal

2021.

Natural Language Processing Specialization. Coursera.

Certification

2021.

Google Cloud Big Data and Machine Learning Fundamentals. Coursera.

Certification

2021.

Deep Learning Specialization. Coursera.

Certification

2021.

Structured Machine Learning Projects. Coursera.

Certification

2019.

Du, Y., Ranwez, S., Sutton-Charani, N., Ranwez, V. Ajustement bayesien des mesures de similarité entre utilisateurs pour améliorer les recommandations basées sur un filtrage collaboratif. LFA 2019 - Rencontres francophones sur la Logique Floue et ses Applications, Alès, France.

conference

2019.

Du, Y., Ranwez, S., Sutton-Charani, N., Ranwez, V. Apports des ontologies aux systèmes de recommandation : état de l'art et perspectives. 30es Journées Francophones d'Ingénierie des Connaissances, IC 2019, Toulouse, France.

conference

Resume

Jan 2022 - Present

AI Research Engineer & Data Scientist
Appvizer

Bringing AI to the team

Developing hybrid recommender systems

Building Machine Learning models

Data analysis & visualization

Sept 2018 - Dec 2021

Ph.D. in Computer Science
IMT Mines Alès

Research in diversified and explainable recommendations

Building knowledge graphs for recommendations

Conducting online user studies

July 2017 - May 2018

Research Engineer
Cirad

Developing a tool based on Semantic Web

Constructing ontology and knowledge bases

Knowledge navigation through graph data

Sept 2015 - Jun 2017

Master of NLP & Knowledge Engineering
Montpellier University

Jun 2016 - Sept 2016

Development Internship
Pradeo

2014

Bachelor of Computer Science
Clermont-Auvergne University