Grad Students Study AI’s Role in UNDP Work Processes

February 23, 2026

Rutgers graduate students presented their findings on “Effectiveness of AI-assisted report assessments: A case study of the United Nations Development Program,” at the American Association for the Advancement of Science (AAAS) Annual Meeting’s poster competition in Phoenix, AZ in mid-February.

The project is being undertaken by Rutgers graduate students Raquel Padilla (PhD Planning and Public Policy), Sandra Jae-Ah Chung (MPP/MCRP candidate), Hemali Angne (PhD Computer Science), and Bloustein School Professor Hal Salzman, all part of the Rutgers SOCRATES (Socially Cognizant Robotics for a Technologically-Enhanced Society) research group. The project is in collaboration with the United Nations Development Programme (UNDP) through their Independent Evaluation Office (IEO).

Hemali Angne, a PhD candidate in computer science at Rutgers, stands before a presentation board. They are presenting at the AAAS poster competition before the judging panel. Four judges are in the foreground of the photo, wearing headphones and looking at the poster board and presentation.

Hemali Angne is presenting at the AAAS poster competition before the judging panel.

 

The study assesses the effectiveness of an AI tool based on a large language model (LLM), conducting quality assessments of UN project evaluation reports. Initial findings indicate that the language used in AI-generated assessments is more constrained than that of human reviewers. Furthermore, AI outputs lack nuance and context-awareness.

These limitations affect the usefulness of quality assessments in improving the reports and providing feedback to the development projects. This also raises the question of whether incorporating AI will effectively reduce operating costs or instead add new technology expenses that, while they provide improvements, still require a “human-in-the-loop.” The researchers suggest these findings have broader implications for the use of AI in knowledge work.

The UNDP funds a diverse set of projects, ranging from conserving biodiversity to increasing participation in local governance, in different countries. Country offices prepare evaluation reports on these projects. The quality assessment of these reports is functional in improving the projects and giving feedback to country offices. The IEO currently hires external reviewers for this task, with a cost of around $180,000 per year. To reduce operational costs, the IEO is looking to replace reviewers with an AI tool, specifically a general-purpose LLM guided by prompts.

The team at Rutgers conducted qualitative and linguistic analyses of AI outputs, finding shortcomings in the content. The main finding was that the AI struggled with implicit and culturally-specific content, which is essential for an international organization such as the UNDP. AI-generated comments were monotonous and repetitive, not reflecting the diversity of the reports. This was confirmed in a linguistic analysis, which uses a computer model to arrange comments of different reports into a cohesive “linguistic map.”

Comments that share similarities get positioned closer together, while dissimilar comments are located farther apart. The analysis showed that the pool of AI comments was much closer together than the human pool.

A green and orange diagram with point showing a linguistic map of AI and human comments on a set of reports.

A linguistic map of AI and human comments on a set of reports.

 

These gaps suggest that general purpose AI systems have limitations in tasks like report assessment. Rather than reducing costs, such tools may shift work back onto humans, increasing time and effort to interpret and correct AI outputs.

Recent Posts

At Rutgers, Students Are Learning About Democracy in a Lab

Nicholas V. Longo is leading a university-wide effort on how to expand engagement in civic life Nicholas V. Longo, the inaugural director of the Rutgers Democracy Lab, insists democracy is something you learn by doing – not just in a classroom or at the ballot box,...

Samuel, Thakuriah Lead Discussions at RAD Collaboratory

The 𝐑𝐮𝐭𝐠𝐞𝐫𝐬 𝐀𝐫𝐭𝐢𝐟𝐢𝐜𝐢𝐚𝐥 𝐈𝐧𝐭𝐞𝐥𝐥𝐢𝐠𝐞𝐧𝐜𝐞 𝐚𝐧𝐝 𝐃𝐚𝐭𝐚 𝐒𝐜𝐢𝐞𝐧𝐜𝐞 (𝐑𝐀𝐃) 𝐂𝐨𝐥𝐥𝐚𝐛𝐨𝐫𝐚𝐭𝐨𝐫𝐲 recently hosted its inaugural Research Symposium on 3/24/26 - an amazing event that has sparked much interest in collaborative research with AI as a matchmaking catalyst....

Bulger et al. Examine Food Security, Sovereignty as Climate Adaptation

Bridging Western and Indigenous epistemologies in an opaque world Food security and food sovereignty as climate adaptation Abstract Food security and food sovereignty represent two similar but distinct pathways for community-led climate adaptation. This study examines...

Advancing Women’s Equity Through Policymaking: An NJSPL Panel

In response to an invitation from the Douglass Residential College and the Institute for Women's Leadership to host programs focused on women's issues at Rutgers University in honor of Women's History Month, the New Jersey State Policy Lab convened a panel of recent...

Real-World Insights in Global Freight Movement

On Monday, March 23, supply chain leaders from Johnson & Johnson provided real-world insights to Anne Strauss-Wieder’s graduate Freights & Ports class to break down the realities of  pharmaceutical production and global freight movement. Rutgers alumni Lisa...