Grad Students Study AI’s Role in UNDP Work Processes

February 23, 2026

Rutgers graduate students presented their findings on “Effectiveness of AI-assisted report assessments: A case study of the United Nations Development Program,” at the American Association for the Advancement of Science (AAAS) Annual Meeting’s poster competition in Phoenix, AZ in mid-February.

The project is being undertaken by Rutgers graduate students Raquel Padilla (PhD Planning and Public Policy), Sandra Jae-Ah Chung (MPP/MCRP candidate), Hemali Angne (PhD Computer Science), and Bloustein School Professor Hal Salzman, all part of the Rutgers SOCRATES (Socially Cognizant Robotics for a Technologically-Enhanced Society) research group. The project is in collaboration with the United Nations Development Programme (UNDP) through their Independent Evaluation Office (IEO).

Hemali Angne, a PhD candidate in computer science at Rutgers, stands before a presentation board. They are presenting at the AAAS poster competition before the judging panel. Four judges are in the foreground of the photo, wearing headphones and looking at the poster board and presentation.

Hemali Angne is presenting at the AAAS poster competition before the judging panel.

The study assesses the effectiveness of an AI tool based on a large language model (LLM), conducting quality assessments of UN project evaluation reports. Initial findings indicate that the language used in AI-generated assessments is more constrained than that of human reviewers. Furthermore, AI outputs lack nuance and context-awareness.

These limitations affect the usefulness of quality assessments in improving the reports and providing feedback to the development projects. This also raises the question of whether incorporating AI will effectively reduce operating costs or instead add new technology expenses that, while they provide improvements, still require a “human-in-the-loop.” The researchers suggest these findings have broader implications for the use of AI in knowledge work.

The UNDP funds a diverse set of projects, ranging from conserving biodiversity to increasing participation in local governance, in different countries. Country offices prepare evaluation reports on these projects. The quality assessment of these reports is functional in improving the projects and giving feedback to country offices. The IEO currently hires external reviewers for this task, with a cost of around $180,000 per year. To reduce operational costs, the IEO is looking to replace reviewers with an AI tool, specifically a general-purpose LLM guided by prompts.

The team at Rutgers conducted qualitative and linguistic analyses of AI outputs, finding shortcomings in the content. The main finding was that the AI struggled with implicit and culturally-specific content, which is essential for an international organization such as the UNDP. AI-generated comments were monotonous and repetitive, not reflecting the diversity of the reports. This was confirmed in a linguistic analysis, which uses a computer model to arrange comments of different reports into a cohesive “linguistic map.”

Comments that share similarities get positioned closer together, while dissimilar comments are located farther apart. The analysis showed that the pool of AI comments was much closer together than the human pool.

A green and orange diagram with point showing a linguistic map of AI and human comments on a set of reports.

A linguistic map of AI and human comments on a set of reports.

These gaps suggest that general purpose AI systems have limitations in tasks like report assessment. Rather than reducing costs, such tools may shift work back onto humans, increasing time and effort to interpret and correct AI outputs.

Recent Posts

“Work Trends RU” Podcast with Jimmy Green and Jackie Burke

February 24, 2026

A Conversation with Jimmy Green, of the Heldrich Center for Workforce Development, and Jackie Burke, of the New Jersey Council of County Vocational-Technical Schools, Guests on Work Trends RU Podcast Listen to the latest episode of the Heldrich Center’s “Work Trends...

Chen et al. Examine Alcohol, Cannabis, and HIV Risk

February 20, 2026

Alcohol and Cannabis co-use and HIV risk, Treatment and Prevention Outcomes: A Scoping Review Abstract Purpose of Review Alcohol and cannabis are substances commonly used by people with or made vulnerable to HIV. With changing cannabis legalization, cannabis use has...

Christiana Foglio, DC’84, BSPPP’86 Named RAA Loyal Daughter

February 19, 2026

The Rutgers Alumni Association’s Loyal Sons & Daughters Award is its highest recognition of service. Recipients are individuals who have made a meaningful and long-standing contribution to the betterment of Rutgers by performing extraordinary volunteer service or...

Lindenfeld Investigates LFO Impacts on Health Outcomes

February 18, 2026

Legal Financial Obligations: An Understudied Public Health Exposure Abstract The impacts of exposure to the criminal justice system on health-related outcomes are well studied in the United States (US). However, while previous studies focus on the impacts of arrest,...

EJB Talks: Beyond “Does It Work?”

February 17, 2026

Beyond “Does It Work?”: Laura Peck on Policy, Evidence, and Impact EJB Talks returns for Season 14 with Dean Stuart Shapiro speaking with Laura Peck, one of our newest Public Policy Associate Professors and a Principal Faculty Fellow with the Heldrich Center for...

← Previous Post Next Post →