Skip to content

Commit 17b3326

Browse files
author
Quarto GHA Workflow Runner
committed
Built site for gh-pages
1 parent 8a00e0a commit 17b3326

19 files changed

+60
-24
lines changed

.nojekyll

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1 +1 @@
1-
ae2762a5
1+
e6b5948d

_tex/index.tex

Lines changed: 29 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -203,7 +203,7 @@
203203
pdfcreator={LaTeX via pandoc}}
204204

205205
\title{Towards an open-source model for data and metadata standards}
206-
\author{Ariel Rokem \and Vani Mandava}
206+
\author{Ariel Rokem \and Vani Mandava \and Nicoleta Cristea}
207207
\date{}
208208

209209
\begin{document}
@@ -351,7 +351,34 @@ \subsection{High-energy physics (HEP)}\label{high-energy-physics-hep}
351351

352352
\subsection{Earth sciences}\label{earth-sciences}
353353

354-
XXX
354+
The need for geospatial data exchange between different systems began to
355+
be recognized in the 1970s and 1980s, but proprietary formats still
356+
dominated. Coordinated standardization efforts brought the Open
357+
Geospatial Consortium (OGC) establishment in the 1990s, a critical step
358+
towards open standards for geospatial data. The 1990s have also seen the
359+
development of key standards such as the Network Common Data Form
360+
(NetCDF) developed by the University Corporation for Atmospheric
361+
Research (UCAR) and the Hierarchical Data Format (HDF), a set of file
362+
formats (HDF4, HDF5) that are widely used, particularly in climate
363+
research. The GeoTIFF format, which originated at NASA in the late
364+
1990s, is extensively used to share image data. In the 1990s, open web
365+
mapping also began with MapServer (https://mapserver.org) and continued
366+
later with other projects such as OpenStreetMap (www.openstreetmap.org).
367+
The following two decades, the 2000s-2020s, brought an expansion of open
368+
standards and integration with web technologies developed by OGC, as
369+
well as other standards such as the Keyhole Markup Language (KML) for
370+
displaying geographic data in Earth browsers. Formats suitable for cloud
371+
computing also emerged, such as the Cloud Optimized GeoTIFF (COG),
372+
followed by Zarr and Apache Parquet for array and tabular data,
373+
respectively. In 2006, the Open Source Geospatial Foundation (OSGeo,
374+
https://www.osgeo.org) was established, demonstrating the community's
375+
commitment to the development of open-source geospatial technologies.
376+
While some standards have been developed in the industry (e.g., Keyhole
377+
Markup Language (KML) by Keyhole Inc., which Google later acquired),
378+
they later became international standards of the OGC, which now
379+
encompasses more than 450 commercial, governmental, nonprofit, and
380+
research organizations working together on the development and
381+
implementation of open standards (https://www.ogc.org).
355382

356383
\subsection{Neuroscience}\label{neuroscience}
357384

index.docx

733 Bytes
Binary file not shown.

index.html

Lines changed: 10 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -65,6 +65,7 @@
6565
<meta name="citation_title" content="Towards an open-source model for data and metadata standards">
6666
<meta name="citation_author" content="Ariel Rokem">
6767
<meta name="citation_author" content="Vani Mandava">
68+
<meta name="citation_author" content="Nicoleta Cristea">
6869
<meta name="citation_language" content="en">
6970
<meta name="citation_reference" content="citation_title=Without appropriate metadata, data-sharing mandates are pointless;,citation_abstract=Funders and investigators must demand appropriate metadata standards to take data from foul to FAIR. Funders and investigators must demand appropriate metadata standards to take data from foul to FAIR.;,citation_author=Mark A Musen;,citation_publication_date=2022-09;,citation_cover_date=2022-09;,citation_year=2022;,citation_issue=7926;,citation_volume=609;,citation_journal_title=Nature;,citation_publisher=Springer Science; Business Media LLC;">
7071
<meta name="citation_reference" content="citation_title=Zarr-developers/zarr-python: v3.0.0-alpha;,citation_author=Alistair Miles;,citation_author=undefined jakirkham;,citation_author=M Bussonnier;,citation_author=Josh Moore;,citation_author=Dimitri Papadopoulos Orfanos;,citation_author=Davis Bennett;,citation_author=David Stansby;,citation_author=Joe Hamman;,citation_author=James Bourbeau;,citation_author=Andrew Fulton;,citation_author=Gregory Lee;,citation_author=Ryan Abernathey;,citation_author=Norman Rzepka;,citation_author=Zain Patel;,citation_author=Mads R. B. Kristensen;,citation_author=Sanket Verma;,citation_author=Saransh Chopra;,citation_author=Matthew Rocklin;,citation_author=AWA BRANDON AWA;,citation_author=Max Jones;,citation_author=Martin Durant;,citation_author=Elliott Sales Andrade;,citation_author=Vincent Schut;,citation_author=undefined dussin;,citation_author=Shivank Chaudhary;,citation_author=Chris Barnes;,citation_author=Juan Nunez-Iglesias;,citation_author=undefined shikharsg;,citation_publication_date=2024-06;,citation_cover_date=2024-06;,citation_year=2024;,citation_fulltext_html_url=https://doi.org/10.5281/zenodo.11592827;,citation_doi=10.5281/zenodo.11592827;,citation_publisher=Zenodo;">
@@ -112,6 +113,14 @@ <h1 class="title">Towards an open-source model for data and metadata standards</
112113
</div>
113114
<div class="quarto-title-meta-contents">
114115
<p class="author">Vani Mandava <a href="https://orcid.org/0000-0003-3592-9453" class="quarto-title-author-orcid"> <img src=""></a></p>
116+
</div>
117+
<div class="quarto-title-meta-contents">
118+
<p class="affiliation">
119+
University of Washington
120+
</p>
121+
</div>
122+
<div class="quarto-title-meta-contents">
123+
<p class="author">Nicoleta Cristea <a href="https://orcid.org/0000-0002-9091-0280" class="quarto-title-author-orcid"> <img src=""></a></p>
115124
</div>
116125
<div class="quarto-title-meta-contents">
117126
<p class="affiliation">
@@ -226,7 +235,7 @@ <h2 data-number="3.2" class="anchored" data-anchor-id="high-energy-physics-hep">
226235
</section>
227236
<section id="earth-sciences" class="level2" data-number="3.3">
228237
<h2 data-number="3.3" class="anchored" data-anchor-id="earth-sciences"><span class="header-section-number">3.3</span> Earth sciences</h2>
229-
<p>XXX</p>
238+
<p>The need for geospatial data exchange between different systems began to be recognized in the 1970s and 1980s, but proprietary formats still dominated. Coordinated standardization efforts brought the Open Geospatial Consortium (OGC) establishment in the 1990s, a critical step towards open standards for geospatial data. The 1990s have also seen the development of key standards such as the Network Common Data Form (NetCDF) developed by the University Corporation for Atmospheric Research (UCAR) and the Hierarchical Data Format (HDF), a set of file formats (HDF4, HDF5) that are widely used, particularly in climate research. The GeoTIFF format, which originated at NASA in the late 1990s, is extensively used to share image data. In the 1990s, open web mapping also began with MapServer (https://mapserver.org) and continued later with other projects such as OpenStreetMap (www.openstreetmap.org). The following two decades, the 2000s-2020s, brought an expansion of open standards and integration with web technologies developed by OGC, as well as other standards such as the Keyhole Markup Language (KML) for displaying geographic data in Earth browsers. Formats suitable for cloud computing also emerged, such as the Cloud Optimized GeoTIFF (COG), followed by Zarr and Apache Parquet for array and tabular data, respectively. In 2006, the Open Source Geospatial Foundation (OSGeo, https://www.osgeo.org) was established, demonstrating the community’s commitment to the development of open-source geospatial technologies. While some standards have been developed in the industry (e.g., Keyhole Markup Language (KML) by Keyhole Inc., which Google later acquired), they later became international standards of the OGC, which now encompasses more than 450 commercial, governmental, nonprofit, and research organizations working together on the development and implementation of open standards (https://www.ogc.org).</p>
230239
</section>
231240
<section id="neuroscience" class="level2" data-number="3.4">
232241
<h2 data-number="3.4" class="anchored" data-anchor-id="neuroscience"><span class="header-section-number">3.4</span> Neuroscience</h2>

index.pdf

2.82 KB
Binary file not shown.

sections/01-introduction.embed.ipynb

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@
1414
"\n",
1515
"Data and metadata standards that use tools and practices of OSS (“open-source standards” henceforth) reap many of the benefits that the OSS model has provided in the development of other technologies. The present report explores how OSS processes and tools have affected the development of data and metadata standards. The report will triangulate common features of a variety of use cases; it will identify some of the challenges and pitfalls of this mode of standards development, with a particular focus on cross-sector interactions; and it will make recommendations for future developments and policies that can help this mode of standards development thrive and reach its full potential."
1616
],
17-
"id": "dcc262d7-9156-4db6-be64-b8ef45cddf54"
17+
"id": "8d55f75f-fab6-44ca-b603-3269de038767"
1818
}
1919
],
2020
"nbformat": 4,

sections/01-introduction.out.ipynb

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,7 @@
1818
"\n",
1919
"Wilkinson, Mark D, Michel Dumontier, I Jsbrand Jan Aalbersberg, Gabrielle Appleton, Myles Axton, Arie Baak, Niklas Blomberg, et al. 2016. “The FAIR Guiding Principles for Scientific Data Management and Stewardship.” *Sci Data* 3 (March): 160018."
2020
],
21-
"id": "5f0f87de-bcf2-4c1c-bf0a-c54253dff641"
21+
"id": "2b171357-281b-489b-bf0d-87cb2addb305"
2222
}
2323
],
2424
"nbformat": 4,

sections/02-use-cases-preview.html

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -192,7 +192,7 @@ <h2 class="anchored" data-anchor-id="high-energy-physics-hep">High-energy physic
192192
</section>
193193
<section id="earth-sciences" class="level2">
194194
<h2 class="anchored" data-anchor-id="earth-sciences">Earth sciences</h2>
195-
<p>XXX</p>
195+
<p>The need for geospatial data exchange between different systems began to be recognized in the 1970s and 1980s, but proprietary formats still dominated. Coordinated standardization efforts brought the Open Geospatial Consortium (OGC) establishment in the 1990s, a critical step towards open standards for geospatial data. The 1990s have also seen the development of key standards such as the Network Common Data Form (NetCDF) developed by the University Corporation for Atmospheric Research (UCAR) and the Hierarchical Data Format (HDF), a set of file formats (HDF4, HDF5) that are widely used, particularly in climate research. The GeoTIFF format, which originated at NASA in the late 1990s, is extensively used to share image data. In the 1990s, open web mapping also began with MapServer (https://mapserver.org) and continued later with other projects such as OpenStreetMap (www.openstreetmap.org). The following two decades, the 2000s-2020s, brought an expansion of open standards and integration with web technologies developed by OGC, as well as other standards such as the Keyhole Markup Language (KML) for displaying geographic data in Earth browsers. Formats suitable for cloud computing also emerged, such as the Cloud Optimized GeoTIFF (COG), followed by Zarr and Apache Parquet for array and tabular data, respectively. In 2006, the Open Source Geospatial Foundation (OSGeo, https://www.osgeo.org) was established, demonstrating the community’s commitment to the development of open-source geospatial technologies. While some standards have been developed in the industry (e.g., Keyhole Markup Language (KML) by Keyhole Inc., which Google later acquired), they later became international standards of the OGC, which now encompasses more than 450 commercial, governmental, nonprofit, and research organizations working together on the development and implementation of open standards (https://www.ogc.org).</p>
196196
</section>
197197
<section id="neuroscience" class="level2">
198198
<h2 class="anchored" data-anchor-id="neuroscience">Neuroscience</h2>

sections/02-use-cases.embed.ipynb

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -20,7 +20,7 @@
2020
"\n",
2121
"## Earth sciences\n",
2222
"\n",
23-
"XXX\n",
23+
"The need for geospatial data exchange between different systems began to be recognized in the 1970s and 1980s, but proprietary formats still dominated. Coordinated standardization efforts brought the Open Geospatial Consortium (OGC) establishment in the 1990s, a critical step towards open standards for geospatial data. The 1990s have also seen the development of key standards such as the Network Common Data Form (NetCDF) developed by the University Corporation for Atmospheric Research (UCAR) and the Hierarchical Data Format (HDF), a set of file formats (HDF4, HDF5) that are widely used, particularly in climate research. The GeoTIFF format, which originated at NASA in the late 1990s, is extensively used to share image data. In the 1990s, open web mapping also began with MapServer (https://mapserver.org) and continued later with other projects such as OpenStreetMap (www.openstreetmap.org). The following two decades, the 2000s-2020s, brought an expansion of open standards and integration with web technologies developed by OGC, as well as other standards such as the Keyhole Markup Language (KML) for displaying geographic data in Earth browsers. Formats suitable for cloud computing also emerged, such as the Cloud Optimized GeoTIFF (COG), followed by Zarr and Apache Parquet for array and tabular data, respectively. In 2006, the Open Source Geospatial Foundation (OSGeo, https://www.osgeo.org) was established, demonstrating the community’s commitment to the development of open-source geospatial technologies. While some standards have been developed in the industry (e.g., Keyhole Markup Language (KML) by Keyhole Inc., which Google later acquired), they later became international standards of the OGC, which now encompasses more than 450 commercial, governmental, nonprofit, and research organizations working together on the development and implementation of open standards (https://www.ogc.org).\n",
2424
"\n",
2525
"## Neuroscience\n",
2626
"\n",
@@ -30,7 +30,7 @@
3030
"\n",
3131
"Another interesting use case for open-source standards is community/citizen science. This approach, which has grown in the last 20 years, has many benefits for both the research field that harnesses the energy of non-scientist members of the community to engage with scientific data, as well as to the community members themselves who can draw both knowledge and pride in their participation in the scientific endeavor. It is also recognized that unique broader benefits are accrued from this mode of scientific research, through the inclusion of perspectives and data that would not otherwise be included. To make data accessible to community scientists, and to make the data collected by community scientists accessible to professional scientists, it needs to be provided in a manner that can be created and accessed without specialized instruments or specialized knowledge. Here, standards are needed to facilitate interactions between an in-group of expert researchers who generate and curate data and a broader set of out-group enthusiasts who would like to make meaningful contributions to the science. This creates a particularly stringent constraint on transparency and simplicity of standards. Creating these standards in a manner that addresses these unique constraints can benefit from OSS tools, with the caveat that some of these tools require additional expertise. For example, if the standard is developed using git/GitHub for versioning, this would require learning the complex and obscure technical aspects of these system that are far from easy to adopt, even for many professional scientists."
3232
],
33-
"id": "1e9a8160-9b6c-4f0d-818f-35fa3159b74c"
33+
"id": "44191b8a-4483-4fe8-bdb2-bdb58c0f2311"
3434
}
3535
],
3636
"nbformat": 4,

0 commit comments

Comments
 (0)