Garden State Open Data Index (GSODI)

July 14, 2023

Metadata for an Integrated View of New Jersey’s Open Information Ecosystem.

By Jim Samuel

Open data and artificial intelligence (AI) are vital for future value creation. The value of aligning open data with AI development and deployment requirements has been elaborated upon in the Garden State Open Data Index (GSODI) 2023 report being released today by the New Jersey State Policy Lab, Rutgers University.[1] This brief article selects excerpts from the GSODI report and comments on the growing importance of the open data movement by presenting a brief introduction to the GSODI, and the role of data characteristics in driving the quality of data-dependent AI applications.

You can view the plain version of an early-stage release of the GSODI [INDEX]. The main GSODI portal is being developed and will be released in 2023. You can also download the Garden State Open Data Index (GSODI) 2023 report.

Open data presents strong opportunities for societal and economic advancement, and AI technologies possess tremendous value creation potential as attested by the billions of dollars of investments that serious AI startups have recently attracted. However, for maximum impact and full realization of benefits, it is necessary to synergize the powers of open data and AI. Adapting open data information ecosystems for seamless alignment with AI technologies will catalyze the development of new capabilities and expanded capacities for AI applications. Furthermore, any improvement in the quality of open data, such as bias-reduction and fairness in representativeness of data, can be expected to lead to improved quality and fairness of AI applications. Open data has been defined as being “data that is made freely available for open consumption, at no direct cost to the public, which can be efficiently located, filtered, downloaded, processed, shared, and reused without any significant restrictions on associated derivatives, use, and reuse.”[2][3] A broad definition of open data accommodates both public and private sources, and data-hosts which may host open data from multiple sources. Open data can be effectively used for the development and improvement of AIs: “When Open Data is used for new products or services, it can increase data demand – and drive the release of more datasets and improvements in data quality,” which could lead to iterative enhancement of the quality of AIs.[4] Artificial intelligences can only be as good as the data they are built upon.[5][6] There are AI technologies which use simulated data or have a relatively lower need for real-world data, but the majority of user-facing AI applications are dependent on large (preferably) quantities or ‘smart’ high quality data. AI is a “set of technologies that mimic the functions and expressions of human intelligence” and AIs can be designed with adaptive capabilities to learn from their own performance and the environment to provide optimal results.[7][8]

The Garden State Open Data Index (GSODI) 2023 report identifies concepts, strategies, principles, and policies to enhance the “availability, accessibility, usability and governance of open data. This is expected to lead to enhanced and accelerated public informatics driven insights, discoveries and value creation.” The GSODI is a mechanism that presents an ‘integrated view’ and rich metadata on the information ecosystem in New Jersey and is globally extensible. The GSODI report provides recommendations for ‘improving the effectiveness and efficiency associated with open data initiatives’ by integrating metadata information on ‘open-data portals and open-data datasets in a cohesive manner’ under a new portal which is expected to be launched in 2023. The GSODI is designed to support research, decision making, planning, and reporting efforts and is expected to lead to more efficient insights-generation for an array of constituents across academic, media, governance, professional, social, and political domains. The GSODI research report also provides policy recommendations which can guide the development of open data ecosystems to maximize support for AI systems and applications. The searchable GSODI portal is expected to ‘serve as a complementary and collaborative mechanism to existing open data infrastructure’ and does not intend to host datasets or in any way replace the many open data portals. Instead, it is expected to augment these open data portals and increase the findability of open data. Furthermore, the GSODI framework possesses a simple and flexible indexing framework and can therefore be scaled into a universal open data index to integrate global open data. Future research is expected to focus on scoring and ranking mechanisms along with improved scalability leading to enhanced capabilities for supporting AI research, development, and deployment.

You can view the plain version of an early-stage release of the GSODI [INDEX]  here. The main GSODI portal is being developed and will be released in 2023. You can also download the Garden State Open Data Index (GSODI) 2023 report.

Please email informatics@ejb.rutgers.edu to contribute to, support, provide feedback to or for additional information on GSODI, informatics and AI research.

Access MPI degree information here https://bloustein.rutgers.edu/graduate/public-informatics/mpi/

 Acknowledgement: This article first appeared in: https://policylab.rutgers.edu/report-release-garden-state-open-data-index/

Disclaimer: This exploratory index (GSODI) is provided for informational purposes only. Consequences of any action taken or omitted to be taken in reliance on it or utilizing the same for any purpose will be the sole responsibility of the decision maker/s themselves. Please read the limitations section of this report for additional details.

References:

  1. Samuel, J., Brennan, M., Pfeiffer, M., Andrews, C., Hale, M., Chidipothu, N., Anand, I., John, S., Parikh, R., Jain, P., Mannepalli, A., Negi, A., and Aslam, Z. (2023). Garden State Open Data Index for Public Informatics (GSODI): An Integrated View of New Jersey’s Open Information Ecosystem. RUCI Lab & New Jersey State Policy Lab research report – 2023, Rutgers University, New Brunswick, NJ, USA.
  2. Chidipothu, N., Mishra, S., John, S. and Samuel, J. Artificial Intelligence and open data for public good: Implications for public policy. (2022, October 24). Retrieved December 28, 2022, from https://policylab.rutgers.edu/artificial-intelligence-and-open-data-for-public-good-implications-for-public-policy/
  3. ODC, Open Data Charter. URL: https://opendatacharter.net/principles/
  4. ODT-Worldbank, 2023: https://opendatatoolkit.worldbank.org/en/essentials.html
  5. Von der Leyen, U. (2019). A Union that strives for more. My agenda for Europe. Political guidelines for the next European Commission2024 (2019), 13.
  6. Jain, A., Patel, H., Nagalapatti, L., Gupta, N., Mehta, S., Guttula, S., & Munigala, V. (2020, August). Overview and importance of data quality for machine learning tasks. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (pp. 3561-3562).
  7. Samuel,J. (2021). A call for proactive policies for informatics and artificial intelligence technologies. Scholars Strategy Network. Url: https://scholars.org/contribution/call-proactive-policies-informatics-and
  8. Samuel, J., Kashyap, Yana Samuel, and Alexander Pelaez. Adaptive cognitive fit: Artificial intelligence augmented management of information facets and representations. International Journal of Information Management 65 (2022) 102505, https://doi.org/10.1016/j.ijinfomgt.2022.102505

Recent Posts

NJSPL: New Jersey Policy Priorities Survey Results

By Angie Nga Le Between October 7 and November 14, 2024, the New Jersey State Policy Lab conducted a brief survey to gain insights into emerging issues and policy priorities in New Jersey. The survey aimed to inform the Policy Lab’s strategic research planning,...

Dr. Grafova Examines Financial Hardships for Cancer Survivors

Household income and county income inequality are associated with financial hardship among cancer survivors in New Jersey Abstract Purpose To examine how household income and county income inequality are linked to financial hardship among cancer survivors. Methods...

Exploring Postsecondary Outcomes of Dual-Enrollment

Heldrich Report: Exploring Postsecondary Outcomes of Dual-Enrollment Participation in New Jersey A new study from the New Jersey Statewide Data System (NJSDS) explores the educational pathways of New Jersey high school graduates from 2014 and 2015 who participated in...

“Rutgers Then and Now:” A Discussion with the Authors

“Rutgers Then and Now”: A Discussion with Authors James W. Hughes and David Listokin As 2024 comes to a close and EJB Talks concludes another season, Stuart Shapiro discusses the new book by University Professor and Bloustein School Dean Emeritus James W. Hughes and...

NJSPL Report: Transportation Priorities for Camden County

By Carla Villacis, Kristin Curtis, Shaghayegh Poursabbagh, Oğuz Kaan Özalp, and Fawaz Al-Juaid Read Report The Senator Walter Rand Institute for Public Affairs at Rutgers-Camden (WRI) exists to conduct community-focused research that connects to the public policy and...

Upcoming Events

2025 Bloustein Alumni Awards Celebration

Zimmerli Art Museum at Rutgers University 71 Hamilton Street, New Brunswick, NJ, United States

Since 1994, the Bloustein School Alumni Association has aimed to present awards to accomplished alumni each year. Our goal is to pay tribute to alumni and friends to recognize their […]

RAISE 2025 – Our Future With AI: Utopian or Dystopian?

Gov. James J. Florio Special Events Forum, CSB 33 Livingston Avenue, New Brunswick, NJ, United States

Informatics - Data Science - AI Competition Step into the future of innovation! RAISE-25 will challenge you to unravel the scope of AI's impact on our lives and human society. […]