Garden State Open Data Index (GSODI)

July 14, 2023

Metadata for an Integrated View of New Jersey’s Open Information Ecosystem.

By Jim Samuel

Open data and artificial intelligence (AI) are vital for future value creation. The value of aligning open data with AI development and deployment requirements has been elaborated upon in the Garden State Open Data Index (GSODI) 2023 report being released today by the New Jersey State Policy Lab, Rutgers University.[1] This brief article selects excerpts from the GSODI report and comments on the growing importance of the open data movement by presenting a brief introduction to the GSODI, and the role of data characteristics in driving the quality of data-dependent AI applications.

You can view the plain version of an early-stage release of the GSODI [INDEX]. The main GSODI portal is being developed and will be released in 2023. You can also download the Garden State Open Data Index (GSODI) 2023 report.

Open data presents strong opportunities for societal and economic advancement, and AI technologies possess tremendous value creation potential as attested by the billions of dollars of investments that serious AI startups have recently attracted. However, for maximum impact and full realization of benefits, it is necessary to synergize the powers of open data and AI. Adapting open data information ecosystems for seamless alignment with AI technologies will catalyze the development of new capabilities and expanded capacities for AI applications. Furthermore, any improvement in the quality of open data, such as bias-reduction and fairness in representativeness of data, can be expected to lead to improved quality and fairness of AI applications. Open data has been defined as being “data that is made freely available for open consumption, at no direct cost to the public, which can be efficiently located, filtered, downloaded, processed, shared, and reused without any significant restrictions on associated derivatives, use, and reuse.”[2][3] A broad definition of open data accommodates both public and private sources, and data-hosts which may host open data from multiple sources. Open data can be effectively used for the development and improvement of AIs: “When Open Data is used for new products or services, it can increase data demand – and drive the release of more datasets and improvements in data quality,” which could lead to iterative enhancement of the quality of AIs.[4] Artificial intelligences can only be as good as the data they are built upon.[5][6] There are AI technologies which use simulated data or have a relatively lower need for real-world data, but the majority of user-facing AI applications are dependent on large (preferably) quantities or ‘smart’ high quality data. AI is a “set of technologies that mimic the functions and expressions of human intelligence” and AIs can be designed with adaptive capabilities to learn from their own performance and the environment to provide optimal results.[7][8]

The Garden State Open Data Index (GSODI) 2023 report identifies concepts, strategies, principles, and policies to enhance the “availability, accessibility, usability and governance of open data. This is expected to lead to enhanced and accelerated public informatics driven insights, discoveries and value creation.” The GSODI is a mechanism that presents an ‘integrated view’ and rich metadata on the information ecosystem in New Jersey and is globally extensible. The GSODI report provides recommendations for ‘improving the effectiveness and efficiency associated with open data initiatives’ by integrating metadata information on ‘open-data portals and open-data datasets in a cohesive manner’ under a new portal which is expected to be launched in 2023. The GSODI is designed to support research, decision making, planning, and reporting efforts and is expected to lead to more efficient insights-generation for an array of constituents across academic, media, governance, professional, social, and political domains. The GSODI research report also provides policy recommendations which can guide the development of open data ecosystems to maximize support for AI systems and applications. The searchable GSODI portal is expected to ‘serve as a complementary and collaborative mechanism to existing open data infrastructure’ and does not intend to host datasets or in any way replace the many open data portals. Instead, it is expected to augment these open data portals and increase the findability of open data. Furthermore, the GSODI framework possesses a simple and flexible indexing framework and can therefore be scaled into a universal open data index to integrate global open data. Future research is expected to focus on scoring and ranking mechanisms along with improved scalability leading to enhanced capabilities for supporting AI research, development, and deployment.

You can view the plain version of an early-stage release of the GSODI [INDEX]  here. The main GSODI portal is being developed and will be released in 2023. You can also download the Garden State Open Data Index (GSODI) 2023 report.

Please email informatics@ejb.rutgers.edu to contribute to, support, provide feedback to or for additional information on GSODI, informatics and AI research.

Access MPI degree information here https://bloustein.rutgers.edu/graduate/public-informatics/mpi/

 Acknowledgement: This article first appeared in: https://policylab.rutgers.edu/report-release-garden-state-open-data-index/

Disclaimer: This exploratory index (GSODI) is provided for informational purposes only. Consequences of any action taken or omitted to be taken in reliance on it or utilizing the same for any purpose will be the sole responsibility of the decision maker/s themselves. Please read the limitations section of this report for additional details.

References:

  1. Samuel, J., Brennan, M., Pfeiffer, M., Andrews, C., Hale, M., Chidipothu, N., Anand, I., John, S., Parikh, R., Jain, P., Mannepalli, A., Negi, A., and Aslam, Z. (2023). Garden State Open Data Index for Public Informatics (GSODI): An Integrated View of New Jersey’s Open Information Ecosystem. RUCI Lab & New Jersey State Policy Lab research report – 2023, Rutgers University, New Brunswick, NJ, USA.
  2. Chidipothu, N., Mishra, S., John, S. and Samuel, J. Artificial Intelligence and open data for public good: Implications for public policy. (2022, October 24). Retrieved December 28, 2022, from https://policylab.rutgers.edu/artificial-intelligence-and-open-data-for-public-good-implications-for-public-policy/
  3. ODC, Open Data Charter. URL: https://opendatacharter.net/principles/
  4. ODT-Worldbank, 2023: https://opendatatoolkit.worldbank.org/en/essentials.html
  5. Von der Leyen, U. (2019). A Union that strives for more. My agenda for Europe. Political guidelines for the next European Commission2024 (2019), 13.
  6. Jain, A., Patel, H., Nagalapatti, L., Gupta, N., Mehta, S., Guttula, S., & Munigala, V. (2020, August). Overview and importance of data quality for machine learning tasks. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (pp. 3561-3562).
  7. Samuel,J. (2021). A call for proactive policies for informatics and artificial intelligence technologies. Scholars Strategy Network. Url: https://scholars.org/contribution/call-proactive-policies-informatics-and
  8. Samuel, J., Kashyap, Yana Samuel, and Alexander Pelaez. Adaptive cognitive fit: Artificial intelligence augmented management of information facets and representations. International Journal of Information Management 65 (2022) 102505, https://doi.org/10.1016/j.ijinfomgt.2022.102505

Recent Posts

Risk Analysis Celebrates Distinguished Prof. Greenberg

Michael Greenberg: Master Synthesizer of Risk, Public Health, and Public Policy by Joanna Burger & Karen W. Lowrie Michael Greenberg is an extraordinary researcher, teacher, and pioneer who has combined his broad knowledge and expertise in environmental...

STEM Pathways are a Two-Way Street, Not a “Leaky Pipeline”

A new article in the Journal for STEM Education Research challenges the longstanding “leaky pipeline” narrative that has shaped U.S. education and workforce policy for decades. The article, “Reconceptualizing College STEM Pathways: Is ‘Leaving STEM’ the Problem?”, was...

NJSPL: New Jersey’s New E-Bike Laws – What Comes Next?

New Jersey’s New E-Bike Laws: Safety, Impact, and What Comes Next Leigh Ann Von Hagen & Gabrielle Cain In recent years, e-bikes have become an increasingly popular form of micromobility, which are small, lightweight transportation devices designed for short trips...

Heldrich: Aligning NJ’s AI Policy with Small Business Needs

Researchers at the Heldrich Center for Workforce Development, with funding from the New Jersey State Policy Lab, are currently engaged in a project to examine how New Jersey’s public Artificial Intelligence (AI) initiatives can better align with the evolving needs of...

EJB Talks: Planning, Policy, Politics, and the Path to Office

Planning, Policy, Politics, and the Path to Office with Assemblywoman Katie Brennan This week on EJB talks, Dean Stuart Shapiro talks to Bloustein alumnus Katie Brennan MCRP '12, now an Assemblywoman in New Jersey's 32nd District. Katie reflects on how her early...