Exploratorium

Web site generator for the Space-Time Theories Exploratorium, a static file web site. Contents is managed and provided by a simple database and a template system based on Markdown files with a bespoke internal structure. After cloning, all of the web site can be generated automatically by running just one command.

Installation

The following instructions are for a Linux Debian 12.5 (Bookworm) installation. For other OSes and/or Linux distributions, YMMV.

Install the required packages using this command:

sudo apt install openjdk-17-jre-headless openjdk-17-jre libjson-perl \
  libtemplate-perl libdbi-perl libdbd-sqlite3-perl libdigest-md5-perl \
  libinline-java-perl pandoc sqlite3 pgformatter rsync openssh-client \
  inkscape imagemagick-6.q16 node-http-server \
  libspreadsheet-read-perl libspreadsheet-readsxc-perl csvkit

For macOS, install the equivalent packages using Homebrew

Install SQLiteStudio for user-facing database editting and manipulation, or any alternative with which you feel comfortable.

Check deploy.sh usage for the configuration requirements for deployment.

In the near future, PostgreSQL will be supported as an alternative to SQLite, so in some parts of the documentation we may refer to the database platform in an abstract fashion or mention PostgreSQL as a side note.

Observations

All shell scripts have been written specifically for Bash.
If you wish to execute Conexp in stand-alone mode, as an application, you will need to provide a JavaFX implementation to Conexp, either by installing a legacy version of Java (prior to Java 11) or installing OpenJFX.
node-http-server is in the package list as a means to load the web site locally for development. If you do not wish to install its dependencies, you can instead use micro-httpd or mini-httpd and edit scripts/command/http-server.command accordingly.
inkscape and imagemagick-6.q16 are required to process and generate the gravity apple tree. The part executed by Inkscape is a typeface-sensitive operation and should only be done with the same architecture/machine of the author's, otherwise the geometries will come out wrong and the graph will be malformed. There is no need to install these if the gravity tree will not be regenerated.
libspreadsheet-read-perl and libspreadsheet-readsxc-perl are required by xlsx2csv.pl as part of the optional ODS-to-sqlite import workflow. If you are not going to perform these imports, you don't need these packages.

Workflow

The following is a graph of the main workflow. Trapezoids represent files found in this GIT repo, most of which are input files. Skewed boxes represent files resulting from the output of a process. Rounded boxes represent user-facing applications that manipulate data. Normal gray boxes represent a processor. Texts in italics indicate the location of a file or the processor script/executable.

flowchart TD
    SQL[\SQL Files<br/><i>db/*.sql/] <--> DBP
    subgraph DBP[<b>Database Processor]
        DBI[DB Init<br/><i>scripts/db_init.sh]
        DBD[DB Dumper<br/><i>scripts/db_dump.sh]
    end
    DBP <--> DB[(Database<br/><i>PostgreSQL or<br/>SQLite: db/exploratorium.db)]
    DB --> FCA
    DBC([DB Editor<br/><i>SQLite Studio<br/> or PgModeler]) <--> DB
    subgraph TTP[<b>Template Processor]
        TTGen[Generator<br/><i>scripts/gen_pages.sh]
        MD[\Markdown Files<br/><i>tt/*.md/] -- Web Content --> TTGen
        TT[\Template Files<br/><i>tt/*.tt/] -- HTML Structure --> TTGen
    end
    subgraph FCA[<b>FCA Processor]
        direction TB
        subgraph FG[Generator]
            G[Lattice JSON Generator<br/><i>scripts/gen_lattices.sh]
            G2[CSV Catalogs Generator<br/><i>scripts/gen_diagram_catalogs.sh]
        end
        CX[Conexp<br><i>bin/conexp-1.3/*.jar]
        G -- contexts --> CX
        CX -- lattices --> G
    end
    subgraph SITE[<b>Web Site]
        direction LR
        ED([Diagram Viewer / Editor])
        subgraph DAT[Generated Data Files]
            LJ[/"Lattice JSON<br/><i>site/theories/{en,es}/*/lattice.json</i>"/]
            CSV[/"CSV Catalogs<br/><i>site/theories/{en,es}/*/*.csv</i>"/]
            LPJ[/"Node Positions JSON<br/><i>site/theories/{en,es}/*/pos.json</i>"/]
        end
        subgraph LIB[Javascript Libraries]
            D3[\d3<br/><i>site/lib/d3/]
            MJ[\MathJax<br/><i>site/lib/MathJax/]
            JQ[\jQuery<br/><i>site/lib/jQuery/]
            BS[\Bootstrap<br/><i>site/lib/bootstrap/]
        end
        subgraph RES[Resources]
            HTML[/HTML docs<br/><i>site/*.html/]
            CSS[\CSS<br/><i>site/style.css/]
            I[\Images<br/><i>site/img/*/]
            JS[\Javascript<br/><i>site/lib/*.js/]
        end
        ED <---> LPJ
    end
    FCA -- Lattice<br/>JSON & CSV --> SITE
    TTP -- HTML docs --> SITE
    SITE --> DEP
    DEP[<b>Deployer<br/><i>scripts/deploy.sh]
    DEP -- Web Site,<br/>ssh + rsync --> SRV[<b>Web Server]

Usage

The Exploratorium comes with a series of scripts that perform the different transformation stages of the workflow. They are located in the scripts directory, inside of which there is a commands directory which contains the most common operations and are meant to be executed from a file manager by double-clicking (this functionality has been tested on macOS Finder and does not work with GNOME Files v.43 where the best you can do is select an option by right-clicking the .command file or installing and configuring something like the Command Menu extension, so YMMV).

Generalities

All scripts auto-discover the location of the project directory by assuming it to be the parent of the program's location, and they are working-directory agnostic, meaning they can be executed from any location and they will always act by default on the project of which they are a part (unless SCRIPTDIR or PROJECTDIR is overriden, see the following section).

Configuration and Overriding

The following environment variables can be set to customize the transformation process:

Name	Default	Purpose
`CONFIGFILE`	`$SCRIPTDIR/config.sh`	Configuration override by means of a sourced shell script. If the default config file is not found, it is not sourced
`SCRIPTDIR`	`$(dirname "$0")`	Location of the script being invoked
`PROJECTDIR`	`$SCRIPTDIR/..`	Location of the project's main directory
`DBDIR`	`$PROJECTDIR/db`	Location of the project's files related to the database, such as initialization SQL and data dumps, and in the case of SQLite, the database binary file
`SITEDIR`	`$PROJECTDIR/site`	Location of the web site's files. This is the DocumentRoot for a local web server and what is deployed to production
`DIAGRAMSUBDIR`	`theories`	If `DIAGRAMDIR` is not overriden, sets the sub-directory under `$SITEDIR` where diagram files will be put
`DIAGRAMDIR`	`$SITEDIR/$DIAGRAMSUBDIR/%s/%s`	Location where the diagram files will be put. First `%s` is replaced by the language code, and the second by the context code
`DIAGRAM_FILTERS`	(no value)	Filters (grep regexps) separated by space selecting which diagrams will be worked on. No value selects all of them. See gen_lattices.sh
`DEFAULT_DBDSN`	`$DBDIR/exploratorium.db`	Data store name for the database connection. DBI notation, or just the file name of an SQLite database file
`MASTER_NAME`	`master-%s.md`	Name scheme for the master Markdown files found in the tt directory. `%s` is replaced by the language code (i.e. `en` or `es`)
`DEPLOY_HOST`	`remo`	SSH Host where deployment is to connect to transfer the files of the web site. See deploy.sh
`DEPLOY_REMOTEDIR`	`Exploratorium`	Path inside the deployment host where the web site files will reside. See deploy.sh

DEFAULT_DBDSN can point to a file, which selects SQLite as the database engine. DBI notation is supported for future-proofing when the string is prefixed with DBI:, as per usual. If credentials are needed, an external method such as pgpass is recommended. For programs that require a connection string (a DSN) as a parameter, DEFAULT_DBDSN will be taken if none is provided.

If a configuration file is found where CONFIGFILE points it will be sourced and the values of the environment variables set there will override the default values. Only when CONFIGFILE is using its default value will the sourcing be ommited if the file is not found. Otherwise, an error will be produced.

ℹ️Note: the configuration file is sourced twice: once before, and once after the configuration globals are set. Complex commands within this file are discouraged.

Example scripts/config.sh file:

SITEDIR=$PROJECTDIR/test-site
DIAGRAMSUBDIR=cosmo

This will make SITEDIR derive its location from PROJECTDIR's default value, and make DIAGRAMDIR be $SITEDIR/cosmo. All other variables will take their normal default values. Since the configuration file is in the default location, it will provoke the overrides automatically.

Of course, you can set or export these variables at the shell level, for a one-off case and it's the only reliable way to indicate an alternate configuration file location:

$ CONFIGFILE=/tmp/new-config.sh scripts/init.sh

or if you are using a configuration file for a whole session:

$ export CONFIGFILE=$HOME/.my-exploratorium-config.sh
$ scripts/gen_diagram_catalogs.sh
$ scripts/gen_lattices.sh

Execution dependencies

The whole of the project is meant to be totally or partially rebuilt using just one command without user intervention. To understand the execution chain, examine the following dependency tree:

flowchart TD
    deploy.sh([deploy.sh])
    init.sh([init.sh]) --> db_init.sh([db_init.sh])
    init.sh ---> build.sh([build.sh])
    build.sh --> gen_lattices.sh
    build.sh --> gen_pages.sh
    build.sh --> gen_diagram_catalogs.sh
    db_dump.sh([db_dump.sh]) ---> db_dump_ddl.sh
    db_dump.sh ---> db_dump_data.sh
    gen_lattice.sh --> gen_lattice.pl
    gen_lattices.sh --> db_dump_data.sh
    gen_lattices.sh --> gen_lattice.sh
    gen_pages.sh --> gen_pages.pl
    gen_pages.sh --> gen_gravitytree.sh

All of the above programs reside in the scripts directory. Scripts in rounded boxes are top-level programs intended to be invoked by users.

Top-level Operations

These are the most important programs to run while working on the project. All of the other programs that these two invoke are specific steps of the whole processing, but it is not really much more productive to call them directly, so as an end-user, most of the attention should be put on these programs, and the documentation of the other, lower-level programs should be regarded as information for development purposes or to isolate execution failures.

init.sh

This will take a project that has just been cloned from git, initialize the database and then generate the whole web site. It can be useful if changes to the database or other sources have just been pulled from git and we want to regenerate the whole thing.

This program will force the regeneration of all of the lattice files, regardless of changes in the database. Otherwise, it's the equivalent of just calling db_init.sh and then build.sh, which is what is done when working in the database workflow.

ℹ️Note: since this program invokes db_init.sh, it should not be called if there are changes to the database that have not been dumped back to SQL statements or these changes will be reverted to what the SQL scripts indicate (although a backup is produced to mitigate the damage in case this happens by accident). If there are pending changes on the database ahd you want to see how these changes look on the web page, use build.sh

Invocation:

init.sh [«DSN»]

«DSN»: optional Data Source Name, see DEFAULT_DBDSN.

build.sh

Assume the database is up and running and regenerate the whole web site: catalogs, lattices and web pages.

Invocation:

build.sh [«DSN»]

«DSN»: optional Data Source Name, see DEFAULT_DBDSN.

Also, you can double-click or run scripts/commands/rebuild.command from your file browser if that is an available option.

deploy.sh

This program is used to upload all of SITEDIR to the production web host (remo, or the value of DEPLOY_HOST). Your $HOME/.ssh/config has to be correctly configured to perform the transmission, which is done using rsync over an ssh connection for performance: only those files that are detected as changed are uploaded to the server.

Here is an example of an ssh_config section that would make this work, assuming that the relevant asymetric keys have been set up correctly for the default DEPLOY_HOST:

Host remo
	Hostname remo.cua.uam.mx
	User remousr
	IdentityFile ~/.ssh/remo.rsa

Check any of many Internet tutorials on how to set up passwordless ssh authentication through asymmetric keys to get everything set up. Here's one from the University of Indiana on how to do it in 9 easy steps.

In some cases you may have to discuss ssh access and allowed authentication methods with your production host administrator. If a non-interactive access method is not arranged, you may have to provide a password every time you deploy.

Invocation:

deploy.sh

This program takes no arguments. Configuration is done through DEPLOY_HOST and DEPLOY_REMOTEDIR environment variables.

Database Operations

The chosen format for the database schema and data control in the git repo is as SQL statemets. After the repository is cloned from git, the database is not ready to be worked on: it must be restored from the SQL scripts.

Conversely, a dump should be performed to convert the database to plain-text SQL statement scripts for changes to be commited. SQL output from the dump scripts is what is to be commited to the repository, where it is easier to keep track of the changes, instead of pushing binary files which are unnatural for git to manage.

Database Workflow

Here is a state diagram with the expected steps relative to working on changes to the database:

stateDiagram
    direction TB
    [*] --> db_init.sh
    note right of db_init.sh
        Regenerate DB
        from SQL files
        if neccesary
    end note
    edit : Edit Database<br/>(SQLite Studio)
    db_init.sh --> edit
    edit --> build.sh
    test : Test Local Website
    note right of test
        Run scripts/commands/http-server.command
        to start a local web server for testing
    end note 
    build.sh --> test
    test --> edit
    test --> db_dump.sh: 
    gitcommit : git commit
    db_dump.sh --> gitcommit
    note left of db_dump.sh
        Satisfied with changes,
        convert back to SQL
    end note
    gitpush : git push
    gitcommit --> gitpush
    gitcommit --> outside
    outside : Changes Pushed?<br/>(collaborator)
    gitpush --> outside
    gitpull : git pull
    state if_outside <<choice>>
    outside --> if_outside
    if_outside --> gitpull : yes
    if_outside --> edit : no
    gitpull --> db_init.sh

Elaboration

db_init.sh should only be run if the database is not in place or if the data in the SQL files supercedes the one in the database. If there is work in the database pending to be dumped to SQL, this step should be skipped.
Proceed to edit the database. In the future, there should be a CRUD application so that the database is not edited "by hand". For SQLite-based projects, we are using SQLiteStudio.
Run build.sh (or double-click rebuild.command) to regenerate the diagram CSV catalogs and/or the lattice JSON files as needed.
Open the local website with your web browser and check the results. If the local web server is not up, firstly run http-server.command and leave it there. The local server will show the access URL on the terminal, which should typically be http://127.0.0.1:8080/.
Review your changes and go back to the database editor if more work needs to be done.
If you are satisfied with the results, it's time to commit the canges to the git repository. Since the database is not directly stored on git and we are instead using SQL dumps for source control, run db_dump.sh to export the database with the new data onto the SQL files.
Proceed to commit and optionally push to git.
If someone else has pushed changes to the SQL files, you will need to do a git pull, review the changes and then regenerate the database by running db_init.sh once again. This should not be a problem since you have already exported your own changes to the SQL files and reviewed any changes brought from the git pull.

Database Command Reference

db_init.sh

Take the SQL statements stored in $DBDIR/ddl.sql (SQL statements to build the database schema) and $DBDIR/data.sql (data dumped as SQL insert statements) and use them to set up a new database.

In the case of SQLite, if the database file already exists, it is forcefully renamed to its original name plus .bak as an additional suffix to avoid an irrecoverable loss of information in case there are uncommited changes in the binary that haven't been dumped.

For DBMSes, a dump may be performed, then the database would be dropped and recreated if possible, but this has its own caveats as many DBs cannot be dropped if any client is connected to it.

Invocation:

db_init.sh [«DSN»]

«DSN»: optional Data Source Name, see DEFAULT_DBDSN.

db_dump.sh

Dump both the schema of the database and its data and store them as SQL statements in $DBDIR/ddl.sql and $DBDIR/data.sql respectively.

Invocation:

db_dump.sh [«DSN»]

«DSN»: optional Data Source Name, see DEFAULT_DBDSN.

db_dump_ddl.sh

Perform a dump of the schema of the database and send it to standard output. pg_format is used as a base to beautify the output.

Invocation:

db_dump_ddl.sh [«DSN»]

«DSN»: optional Data Source Name, see DEFAULT_DBDSN.

db_dump_data.sh

Dump the data from all of the tables and send them as INSERT SQL statements to standard output.

Invocation:

db_dump_data.sh [«DSN»]

«DSN»: optional Data Source Name, see DEFAULT_DBDSN.

Web Site Operations

From the content creator's perspective, to render the web site's pages and diagrams we have two main inputs: master Markdown files and the database. For the webmaster, the template files affect the markup and structure of the pages and of course there are CSS files, images and Javascript sources to maintain.

The content creator needs only to invoke build.sh to regenerate the web pages automatically after work has been done either on the database or the master markdowns.

Web Content Workflow

Here is a state diagram with the expected steps for working on web content:

stateDiagram
    direction TB
    [*] --> build.sh
    build.sh --> test
    test : View/Reload Local Website
    edit : Edit Master<br/>Markdowns
    edit --> build.sh
    test --> edit
    test --> editl
    editl : Edit geometries with<br/>Diagram Editor
    editl --> copypos
    copypos : Copy/Paste geometries to<br/>pos.json and/or config.js
    copypos --> test
    test --> gitcommit
    gitcommit : git commit
    note left of gitcommit
        Having completed a change,
        send to git before proceeding
        to another change
    end note
    gitpush : git push
    gitcommit --> gitpush
    gitcommit --> outside
    outside : Changes Pushed?<br/>(collaborator)
    gitpush --> outside
    gitpull : git pull
    state if_outside <<choice>>
    outside --> if_outside
    if_outside --> gitpull : yes
    if_outside --> build.sh : no
    gitpull --> build.sh

Elaboration

After running build.sh open the local website with your web browser and check the results. If the local web server is not up, firstly run http-server.command and leave it there. The local server will show the access URL on the terminal, which should typically be http://127.0.0.1:8080/.

From here you can proceed to either:

Work on the content of the site
- Using a text editor, edit the master Markdown file for the desired language. A text editor capable of validating and previewing Markdown formatting is recommended.

or

Visit a diagram page to enter editor mode and work on the diagram nodes positioning.
- The editor button will only be available if the page is visited through address 127.0.0.1.
- After altering the node positions proceed to generate the positional JSON code, select the "Copy" button and using a text editor replace the content of the corresponding pos.json file with what has been copied.
- After the modifying the size and offset of the diagram, copy the corresponding value and using a text editor open the corresponding config.js and replace the value of the VIEWBOX parameter.
Run build.sh and review your changes.
If you are satisfied with the results, it's time to commit the canges to the git repository and optionally git push to upload the changes to the upstream repository (github).
If someone else has pushed changes to the upstream repo, you will need to do a git pull, review the changes and then regenerate the site by running build.sh once again.

Master Markdowns

The web pages for the site are generated using Markdown-formatted plain text files that follow a bespoke document structure. Because the individual pages behind the project are all relatively small, containing most of them two or three parragraphs each, it is easier for the content creator to edit just one file where all of the content is deposited.

Multi-language Support

Currently, only two languages are supported, which at the time are being used for English and Spanish, but they could be any two languages. Support for more languages would entail easy modifications to the template system and the UI: make the language selector a pop-up menu, and let the per-page navlang config variable be a JSON object, instead of the current situation whre the language selector is just a link that toggles the language and navlang's value is a string with the URL of the current page's alternate version in the "other" language.

Languages are enumerated at the database level in the lang table. Each language has a code, which is relevant at the file level. The system allows for a configuration with only one language, and the database does not limit the amount of languages that there can be: it's just a matter of making template system accomodate more than two of them.

File Location

Master Markdown files are located inside the tt directory, alongside the template files that generate the HTML markup. The MASTER_NAME variable at the script configuration level determines the actual name of the master files, which on the default configuration and with two languages (English or en and Spanish or es) derives to tt/master-en.md and tt/master-es.md respectively.

Master Markdown Structure

This is a guide on how to follow the bespoke structure of the master Markdown files. Even if these files follow the Markdown format, you cannot just take it away and liberally write your text: you have to follow a defined structure, explained here. Also, the file is not consumed using typical Markdown-to-HTML converters, so every Markdown element type must be explicitly supported (for example, table rendering is not currently supported).

Knowledge on the basic strucutre and syntax of Markdown files is absolutely essential for the file to pass validatation with the Markdown parser (pandoc) that is invoked by the template system and to avoid confusing the parser with semi-correct formatting. The Markdown Guide is a recommended resource if you are not familiar with Markdown.

Also, the use of an editor that supports Markdown syntax highlighting and live preview is highly recommended. You can even use GitHub's github.dev editor, which has support for both these things and is a web page that does not require installation.

Main title

Set up a level-1 heading at the top of the file. Preferably using the ornamental notation with the title on one line underscored with equal signs:

Exploratorium de teorías del espacio-tiempo
===========================================

This will be the actual title (<h1>) of the Home Page, and also the text for the <title>. For pages other than the Home page, this text will be used first, followed by a colon and the name of the page in question. So, a main title of:

Mi Proyecto
===========

and a page titled Diagrama 1 will render <title>Mi Proyecto: Diagrama 1</title> for that page. This markup is part of the <head> metadata of the page, and is used for bookmarks, Google search results, browser window names and so on.

Global Configuration

After the title, a json block with the config class is to be created, with a JSON object that stores a series of key-value pairs for configuration purposes:

```{ .json .config }
{
  "lang": "es",
  "strings": {
    "Close info box": "Cerrar caja de info",
    "Zoom In": "Acercar"
  }
}
```

Two keys are currently supported:

lang is the language code for the content currently displayed. This is an important value that will be passed to the Javascript code so it can become aware of the language used for the page.
strings is an optional key, with another object assigned that is a mapping of original strings found on the templates and their translations to the language of the master file's content. In the example, since the content is declared to be Spanish and the original templates' translatable strings are written in English, a mapping is necessary. Consequently, no strings mapping is necessary for English, although it would be if the template's translatable strings were written in Spanish and the example's language was English. If the translated version of a string is required within a template and it is not made available in this mapping, the template processor will issue a warning so that the string is provided or some mismatching key-value pair is fixed.

Additionaly, each individual page also has its own configuration block where per-page parameters are stored.

Navigation

Create a level-2 section titled Navigation (no matter the Master file's language, the following section names are fixed strings). Inside, there must be a block of unordered list of links that will conform the main items of the menu:

## Navigation

- [Inicio](index-es.html)
- [Explorando](nav_exploring_menu)
- [Diagramas](nav_diagrams_menu)
- [REMO](/)
- [English](navlang)

This menu is rendered at the top of every page as a series of links or sub-menus with which the user can interact to navigate the site. The links that define menu items are URLs: they can point to any site's URL on the web. In the examples posed, relative URLs were used and so they look like special-purpose locations, but they are not. They just happen to point to the internal directory structure of the rendered site.

There are two sepcial cases here, though:

If the URL of a menu item's link is navlang, the item will point to the current page's declared alternate language version. Compare the Navigation section of both English and Spanish master files to understand how the set up makes this menu item work as a toggle.

If the URL of a menu item's link starts with the suffix nav_, a corresponding level-2 section will be looked up to use as a source to render a submenu. These navigation submenues follow the same format as Navigation:

## Diagrams Menu

- [Protofísica](theories/es/protophysics)
- [Dinámica Analítica](theories/es/analytical-dynamics)
- [Teorías Básicas](theories/es/basic)
- [Manzano de la Gravitación](gravity-tree)

Author

A level-2 section titled Author containing a single parragraph used as a subtitle for the Home Page. This subtitle used to be present on every page, but it was later restricted to only the home page, hence the existence of this section.

Footer

A level-2 section titled Footer with two sub-sections (level-3), for Acknowledgements and the Copyright legend. This appears at the bottom of all pages using a small font:

## Footer

### Acknowledgements

Este proyecto ha sido posible gracias al apoyo del Seminario de
Representación y Modelización del Conocimiento Científico de la
Universidad Autónoma Metropolitana- Unidad Cuajimalpa, así como de la
Coordinación de Inovación y el Departaménto de Cómputo. Un
agradecimiento a Mario Casanueva (UAM-C), Diego Méndez (UAM-C), Sergio
Mendoza (IA-UNAM), Joaquín Hernández (UAM-C), y en especial a Arturo
Espinosa por toda su ayuda y apoyo.

### Copyright

Copyright © 2017-2024, Mariana Espinosa Aldama

Pages

A level-2 section titled Pages, where each (level-3) subsection represents a page within the site. Here is an example using extracts from the Spanish master file illustrating three pages: the Home Page, a second one titled Guía de exploración and a third one titled Protofísica:

## Pages


### Home

```{ .json .config }
{
  "template": "index.tt",
  "output": "index-es.html",
  "navlang": "index.html"
}
```

En el Exploratorium, desentrañamos las
intrincadas estructuras semánticas de las teorías físicas. Nuestra misión gira
en torno a la creación de visualizaciones interactivas de redes
hipervinculadas: macroscopios diseñados para explorar los cimientos de estas
teorías.

#### Descubre diez macroscopios

Adéntrate en el reino de las teorías físicas y espacio-temporales con
nuestras visualizaciones interactivas. Hemos disectado diversas
clasificaciones conjuntistas realizadas por reconocidos filósofos de
la ciencia y conformado una base de datos que relaciona modelos y sus
atributos. Hemos analizado contextos formales para extraer conceeptos
formales y visualizar redes jerárquicas. Los nodos representan
conceptos, mientras que los enlaces ilustran relaciones jerárquicas de
inclusión en redes interactivas que despliegan información extra.


### Guía de exploración

```{ .json .config }
{
  "template": "index.tt",
  "output": "guidance-es.html",
  "navlang": "guidance.html"
}
```

#### ¡Explora la estructura de las teorías físicas a través de redes conceptuales interactivas!

#### Qué hacer:

- Explora diez macroscopios con más de 50 teorías del
  espacio-tiempo y la gravitación.
- Conoce la metodología del Análisis de Conceptos Formales (FCA).
- Aprende la lógica de las redes jerárquicas.


### Protofísica

```{ .json .config }
{
  "template": "diagram-page.tt",
  "output": "theories/es/protophysics/index.html",
  "navlang": "theories/en/protophysics/index.html"
}
```

Las teorías físicas tienen como fundamento diversas teorías
matemáticas, la lógica, la semántica y a  serie de presupuestos y
teorías muy generales que en *Foundations of Physics* (1967) Mario
Bunge divide en tres grupos: la base formal, la base material a la que
llama *Protofísica*, y ciertos principios "zerológicos".

Home Page

The page called Home is special as it takes the title from the master file's main title and uses it again to put an <h1> heading the top of its content. It also then puts the Author section's parragraph as a subtitle to this. Otherwise, it is processed like any of the other pages.

Page Title

Each level-3 section inside the ## Pages section represents a page. The page's title is derived from the main title followed by a colon and the page's section title.

Page Configuration

As with the global configuration, a json block with the config class is to be created below the page's heading. From the example above:

### Guía de exploración

```{ .json .config }
{
  "template": "index.tt",
  "output": "guidance-es.html",
  "navlang": "guidance.html"
}
```

Five keys are currently supported:

template states the name of the template file to be used to render the current page. Currently, there are two template files: index.tt, used for all regular pages and diagram-page.tt, used for pages that show a lattice diagram (a third template file main.tt exists, but it only contains function and macro definitions and it's used by the other two templates, but it is not for rendering web pages).
output states the file path relative to the site directory (or wherever the SITEDIR variable sets) where the rendered HTML will be written.
navlang states the URL (most probably relative) of the alternate language's version of the current page.
styles states an array of URLs pointing to CSS files that are to be included for the page. This allows for bespoke styling to be applied for the page in question.
background states for the diagram-page.tt template an URL pointing to an image to be rendered as a background for the diagram.

Page Content

It is a series of parragraphs, headings and list blocks with the actual content for the page. Level-4 headings and deeper will be translated to HTML headings two levels higher, so level-4 becomes <h2>, level-5 becomes <h3> and so on. Images, links, and ordered and unordered lists are supported, as well as basic text formatting such as bold, italics and so on. Support for additional Markdown blocks requires development for their accomodation in the template processor.

In case the diagram-page.tt template is used, the diagram's viewer and editor will be rendered after the content.

Diagram Pages

Diagram pages are special because they require a set of input files that are expected to be placed alongside their HTML output file. There are five diagram input files:

config.js containing code that is executed before the diagram viewer and editor is initialized. Particularly it sets up window.Config, an object with a configuration parameters used by the diagram viewer. This file is to be put in place by the content creator and edited manually.
lattice.json contains the actual data related to the lattice's strucutre and FCA properties such as intents and extents. It is automatically generated from the corresponding context located in the database, processed during the build phase and specifically by gen_lattices.sh.
pos.json contains the on-screen coordinates of each of the lattice's nodes. The JSON data is provided by the Diagram Editor to the content creator as the diagram's nodes are visually arranged. The content has to be manually copy/pasted from a text box in the editor into the file.
attr_desc.csv and attr_class_desc.csv are catalogs related to documenting fields for Attributes and Attribute Classes, respecitvely. They provide text that is used as different kind of labels all around the diagram and are automatically extracted from the database in the build phase, specifically by gen_diagram_catalogs.sh.

Since these files have the same name for all diagrams, this implicitly requires for each diagram page to be located inside a directory of its own so that the input files' paths are distinct between diagrams. The environment variable governing these locations is DIAGRAMDIR.

Diagram Editor

Among the Diagram Pages input files there are two related to the visual presentation of the data: pos.json and config.js. The Diagram Editor allows to graphically work on the parameters and values of these files and get the new data.

To access the Diagram Editor, just use the web browser and visit the corresponding diagram's page.

ℹ️Note: Due to modern web browser policies, you won't be able to open the page through the file system (URL beginning with file:// scheme), so you will have to launch a local web server for local browsing. You can do that by running scripts/commands/http-server.command

If you open the diagram page through the loopback device (where the URL has the address 127.0.0.1, such as http://127.0.0.1:8080/theories/en/basic/), you will automatically have the ➀ Editor button. Otherwise, you can force the activation of the editor with two other methods:

Add the editor query parameter: visit the diagram page using the original URL with ?editor appended at the end. So, instead of visiting https://remo.cua.uam.mx/vis/Exploratorium/theories/en/basic, visit https://remo.cua.uam.mx/vis/Exploratorium/theories/en/basic?editor
Use the diagram's configuration file: in the corresponding config.js file, set the value for USE_EDITOR to true.

Here is a screenshot of a diagram page with the Diagram Editor activated:

You can now proceed to move the diagram's nodes as needed. Once you are done with the changes, click on the ➁ JSON Get button. This will populate the adjacent text box with the JSON code representing the position of the nodes in the diagram. As per most standard web pages and browsers, you can stretch the text box by dragging it by the corner if you wish to inspect the code.

Now the JSON code has to be put in the corresponding diagram's pos.json file to make the changes permanent. To facilitate this, you can click on the ➂ JSON Copy button to copy the entirety of the code onto the clipboard, allowing you to open the pos.json file with a text editor and replace the old code with a paste operation.

⚠️Warning: do not reload or navigate away from the web page while you have changes pending to be saved or you will lose your work. The changes made to the diagrams do not persist between visits.

The second row of the editor interface allows you to modify the viewboxe's geometry. To save those changes, click on the ➃ Viewbox Copy button so that the geometry values are put onto the clipboard. Open the corresonding config.js file and replace the VIEWBOX value with the new one. Respect the quotation marks while doing so.

Web Content Command Reference

All of the following programs are automatically called by build.sh and are not top-level, so they need not be called directly by the content creator, but for development purposes or for a more fine-grained control of the generation process.

Check the execution dependencies tree to visualize which scripts call which.

gen_diagram_catalogs.sh

Generate from the database the diagram catalogs (attr_desc.csv and attr_class_desc.csv, see Diagram Pages) for all of the contexts, or those selected by DIAGRAM_FILTERS.

Invocation:

gen_diagram_catalogs.sh [«DSN»]

«DSN»: optional Data Source Name, see DEFAULT_DBDSN.
DIAGRAM_FILTERS environtment variable selects which diagrams/contexts to work on.
DIAGRAMDIR environtment variable determines the directory where each lattice.json file will be put.

gen_lattices.sh

Generate from the database the lattice.json file (see Diagram Pages) corresponding to every context, or those selected by DIAGRAM_FILTERS. Internally calls gen_lattice.sh for each selected context.

Since calling Conexp to perform the FCA processing has some overhead, only those contexts whose data changed in the database since the last execution are actually processed. This is done by generating a temporary dump using db_dump_data.sh and comparing that to the last one generated to see which contexts present changes. These temporary dumps are put in DBDIR, and removing them forces all of the selected context lattices to be reanalized. Use the optional -f argument to directly perform the generation without any checks.

Invocation:

gen_lattices.sh [-f] [«DSN»]

-f: optional flag that forces the regeneration of all of the lattice.json files.
«DSN»: optional Data Source Name, see DEFAULT_DBDSN.
DIAGRAM_FILTERS environtment variable selects which diagrams/contexts to work on.
DIAGRAMDIR environtment variable determines the directory where each lattice.json file will be put.

gen_lattice.sh

A wrapper script that prepares the environment for Conexp to be called and then calls the actual generating script, either be the Perl version or a Wolframscript version that was coded as a technology demonstrator but was superceded by the Perl version as it runs faster and it's easier to set up.

Invocation:

See gen_lattice.pl Invocation

gen_lattice.pl

Call Conexp functionality using its binary jar as a library to perform an FCA analysis on the specified context. There's a twin script called gen_lattice.wls that does exactly the same thing with the same invocation interface, but using Wolframscript. This is a Perl port of that script, and it's the preferred method to interface with Conexp as it is faster and easier to set up, as it elliminates the requirement to install the Wolfram Enigne and put a license in place.

Invocation:

gen_lattice.pl «DSN» «context» «lang code» «debug flag»

«DSN»: Data Source Name, currently the name of the Sqlite3 database file.
«context»: name in the database of the context to process.
«lang code»: language code of the context, afecting the render of the name of Attributes and Objects.
«debug flag»: optional. 1 or 0 (default), makes the generator print to standard error the result of the operations interfacing with Conexp as the FCA processes are performed.

Output is sent to standard output.

gen_pages.sh

Regenerates Gravity Tree using gen_gravitytree.sh and all pages from Master Markdown files for all languages configured in the database. Calls gen_pages.pl for each master, using --force.

Invocation:

gen_pages.sh [«DSN»]

«DSN»: optional Data Source Name, see DEFAULT_DBDSN.
MASTER_NAME environtment variable determines the name pattern of the Master Markdown files to use as input.

gen_gravitytree.sh

This is a bespoke script written to convert the resources in the gravity-tree directory into the web-friendly version that goes to $SITE_DIR/gravity-tree. It requires Inkscape and Imagemagick 6.q16 (see Installation Observations.

Invocation:

gen_gravitytree.sh

This program takes no arguments. Configuration is done through PROJECTDIR and SITEDIR environment variables.

gen_pages.pl

This is the core of the Template Processor. It takes a Master Markdown file as input and generates all of the HTML files specified in the Pages section. Only the HTML files that are older than the input file are processed, unless the force flag is used.

Invocation:

gen_pages.pl «master file» [«target» | «force flag» | «clean flag»]

«master file»: name of the Master Markdown file to parse and process.
«target»: Specifies a specific output file that is to be generated, instead of all of them (compared against the output Page Configuration value of each Page).
«force flag»: If --force is specified, all of the pages are regenerated.
«clean flag»: If --clean is specified the files that would be generated are deleted instead of processing the pages.

Database Design

Understanding the design of the database gives a pretty good picture of the system's capabilities and potential. The database is normalized to the fourth normal form (4NF), to guarantee that no inconsistencies are introduced by the content creator (example: assigning in a context an attribute to an object when the latter does not belong to such context). The 4NF is suitable to this application since the amount of relations and data is low.

The diagram has been laid out horizontally to illustrate entities that describe the data from the most core descriptions on the right (object-attribute assignments on contexts) to the most detailed information on the left (language-dependent descriptions).

Relation names are all in singular, as they are considered object classes. They are shown in lowercase as they are easier to read and are easier to distinguish from SQL reserved words which are typically written in uppercase.

Entities

Each entity will be firstly described conceptually and afterwards the relations that reprsent it will be treated.

Languages Catalog

lang

Just a catalog of language codes (PK) and a label for display purpopses. lang_code is an mnemonic identifier that will affect on the directory structure of the site. Suggested values are en, es, fr, jp and so on.

Objects

In FCA, the concept of an object is just a thing that has the capacity of being assigned attributes. They can be just about anything that is being described, observed or studied, and of which attributes are found and listed. For easier identification and interaction within the diagram, objects are assigned user-facing labels, and for internal representation, they are given unique codes.

object

Base relation that establishes the existence of an object. A numeric ID would suffice, and that indeed is what is used as object_id whenever an object is referenced within the database. Nevertheless, code was added to mantain the original representation of objects within context matrixes.

code allows for the rendering of context tables that the content creator can work with, without having to use longer labels that are otherwise better suited for the final product. Also, code is just a mnemonic, independent of the display language, probably an abbreviation from its label in English. So, while an object's label may be Vector tensor General in English and V-T General in Spanish, its code could be GeneralVT and that would be easier to work with when displayed in a spreadsheet or an interacitve Context matrix.

object_desc

Stores descriptive data of an object that is language-dependent, probably for user-facing display purposes. For now, only label has been required, and that is what appears in a white box on the bottom-right side of nodes that represent objects in the diagram.

Three object nodes displayed with their corresponding labels in English

Attributes

All objects may be assigned a set of attributes, and many objects may share a number of attributes in common for a given context. As the presence of attributes within an object caracterize the object itself, attributes require a greater deal of description fields to enrich the information portrayed by the diagram.

attribute

Base relation that establishes the existence of an attribute, with its mandatory numeric ID.

⚠️*Note: Attributes should have a code field, just as the case for Objects, but they are not present in the database structure, which is a strange omission, and it has been noted as a TO-DO item.

Attributes also have a formula field whose value is optional, which contains text (probably LaTeX code) describing the equation related to the attribute. This text is considered language-independent, and that's why it is in this relation, and not in attribute_desc.

attribute_desc

Stores descriptive data of an object that is language-dependent, for user-facing display purposes. obs is a field that is not for display, but to help with internally keeping track on the work of the attribute in question. The other data fields are described in the figure below.

Since the description of an attribute varies according to the methodology behind the construction of the ontology in which the attribute appears, context classes have been implemented so that some attributes may have different descriptor fields based on the class of the context, hence the foreign key represented by context_class_id.

An attribute labeled "Eq 13" with its info box open and the diagram's legend

In the figure above, we see:

➀ attribute_desc.label represents the attribute in the diagram, within a box colored according to its attr_class_id and is located besides the concept node that it qualifies.
➁ attribute_desc.title is the big text that heads the attribute's info box. It is not as abbreviated as the label.
➂ attribute.formula may optionally be present in the infobox with the equation related to the attribute.
➃ attribute_desc.explanation is the longer text, most probably a short parragraph giving more detail on the attribute.
➄ attr_context_class.reference is the formal reference, a locator of the original text that supports the identification of the attribute.
➅ attr_class_desc.title is the label representing the attribute class in the diagram's legend. The color of the boxes for the class is determined by attr_class.code and the site's style sheet. The order in which they appear is set by attr_class.ord.

Attribute Classes

Attribute classes were implemented to enhance the readability of complex diagrams that present many attributes which may have basic commonalities between them and that can be distinguished by graphical means, such as coloring.

attr_class

Base relation that establishes the existence of an attribute class. These entities have a code for internal representation and a numeric ordinal ord that alows for the discrete establishment of the position of the classes within the diagram legend. As expected, these fields are present in this base relation as they are are language-independent.

attr_class_desc

Descriptive, language-dependent data related to the attribute class. For now, it is only title, which is displayed in the diagram's legend.

Contexts

In FCA, formal contexs represent ontologies described by a collection of objects and attribute assignments. An Exploratorium project may have a number of contexts that may share objects and attributes between them.

Contexts have classes, an internal classification mean that allows for attributes to be shared between contexts, even if the role of the attribute is different between them. Since the attribute's role changes, the description data of the attribute will probably change too, depending on the class of the context being visualized, and that's why Attribute Descriptions are related to an attribute-context class combination and not just the attribute.

context

Base relation that establishes the existence of a context. code is used for internal representation, and by default sets the name of the directory where the files of the diagram related to the context will reside (see DIAGRAMDIR).

A context has a class and only one class, and as so it is represented by the simple foreign key context_class_id.

context_class

Base relation that establishes the existence of a context class. As with some other base relations, it has a code field for internal work, but it has no reprecussion in the rendering of web pages; it is used mostly just as a mnemonic.

object_context

Stores tuples declaring which objects belong to which contexts. This is a many-to-many relation, where many objects may appear in many contexts so they may be reused or present in more than one context for a given project. This is useful as it polices against inconsistencies for the assignment of an attribute to an object within a given context, and it also allows for the representation within a given context of objects that (maybe transitionaly) have no attributes assigned to them.

attribute_context

Stores tuples declaring which attributes belong to which contexts. This is a many-to-many relation, where many attributes may appear in many contexts so they may be reused or present in more than one context for a given project. This is useful as it polices against inconsistencies for the assignment of an attribute to an object within a given context, and it also allows for the representation within a given context of attributes that (maybe transitionaly) are not present for any objects.

object_attribute

Should have been triplets of context-object-attribute associations, but due to relational algebra workings, the context ID has to be redundantly stated twice. There is a constraint forcing object_context_id and attribute_context_id to be equal within every record, to anyways avoid inconsitencies. Also, there is an x field, forced to always have a value of 1 and it is there to facilitate the creation of pivot tables.

After all of the declarations and definitions of the previous relations, this is the central part of the database, which finally joins contexts, objects and attributes and establishes the structure of the lattices that will be represented in the diagrams.

Name		Name	Last commit message	Last commit date
Latest commit History 351 Commits
bin/conexp-1.3		bin/conexp-1.3
db		db
doc		doc
fonts		fonts
gravity-tree		gravity-tree
scripts		scripts
site		site
tt		tt
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
TODO.md		TODO.md

License

pupitetris/Exploratorium

Folders and files

Latest commit

History

Repository files navigation

Exploratorium

Installation

Observations

Workflow

Usage

Generalities

Configuration and Overriding

Execution dependencies

Top-level Operations

init.sh

build.sh

deploy.sh

Database Operations

Database Workflow

Elaboration

Database Command Reference

db_init.sh

db_dump.sh

db_dump_ddl.sh

db_dump_data.sh

Web Site Operations

Web Content Workflow

Elaboration

Master Markdowns

Multi-language Support

File Location

Master Markdown Structure

Main title

Global Configuration

Navigation

Author

Footer

Pages

Home Page

Page Title

Page Configuration

Page Content

Diagram Pages

Diagram Editor

Web Content Command Reference

gen_diagram_catalogs.sh

gen_lattices.sh

gen_lattice.sh

gen_lattice.pl

gen_pages.sh

gen_gravitytree.sh

gen_pages.pl

Database Design

Entities

Languages Catalog

lang

Objects

object

object_desc

Attributes

attribute

attribute_desc

Attribute Classes

attr_class

attr_class_desc

Contexts

context

context_class

object_context

attribute_context

object_attribute

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages