eigenfoo
diff --git a/‎docs/_static/notebooks/quickstart.ipynb
Lines changed: 180 additions & 4 deletions b/‎docs/_static/notebooks/quickstart.ipynb
Lines changed: 180 additions & 4 deletions
diff --git a/‎docs/api.rst
Lines changed: 21 additions & 0 deletions b/‎docs/api.rst
Lines changed: 21 additions & 0 deletions
diff --git a/‎littlemcmc/base_hmc.py
Lines changed: 27 additions & 23 deletions b/‎littlemcmc/base_hmc.py
Lines changed: 27 additions & 23 deletions
@@ -4,7 +4,33 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "# Quickstart"
+    "# LittleMCMC Quickstart\n",
+    "\n",
+    "LittleMCMC is a lightweight and performant implementation of HMC and NUTS in Python, spun out of the PyMC project. In this quickstart tutorial, we will introduce LittleMCMC\n",
+    "\n",
+    "## Table of Contents\n",
+    "\n",
+    "- [Who should use LittleMCMC?](#Who-should-use-LittleMCMC?)\n",
+    "- [Sampling](#Sampling)\n",
+    "  - [Inspecting the Output of `lmc.sample`](#Inspecting-the-Output-of-lmc.sample)\n",
+    "- [Other Modules](#Other-Modules)\n",
+    "\n",
+    "## Who should use LittleMCMC?\n",
+    "\n",
+    "<div class=\"alert alert-block alert-info\">\n",
+    "LittleMCMC is a fairly bare bones library with a very niche use case. Most users will probably find that [PyMC3](https://github.com/pymc-devs/pymc3) will satisfy their needs, with better strength of support and quality of documentation.\n",
+    "</div>\n",
+    "\n",
+    "If you:\n",
+    "\n",
+    "1. Have model with only continuous parameters,\n",
+    "1. Are willing to manually \"unconstrain\" all of your model's parameters (if necessary),\n",
+    "1. Have methods to compute the log probability of the model and its derivative, exposed via a Python callable,\n",
+    "1. And all you need is an implementation of HMC/NUTS (preferably in Python) to sample from your model\n",
+    "\n",
+    "then you should consider using LittleMCMC!\n",
+    "\n",
+    "## Sampling"
    ]
   },
   {
@@ -38,10 +64,160 @@
   },
   {
    "cell_type": "code",
-   "execution_count": null,
+   "execution_count": 3,
    "metadata": {},
-   "outputs": [],
-   "source": []
+   "outputs": [
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "/Users/george/miniconda3/lib/python3.6/site-packages/ipykernel_launcher.py:2: RuntimeWarning: divide by zero encountered in log\n",
+      "  \n",
+      "/Users/george/miniconda3/lib/python3.6/site-packages/ipykernel_launcher.py:2: RuntimeWarning: divide by zero encountered in log\n",
+      "  \n"
+     ]
+    },
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "\n"
+     ]
+    }
+   ],
+   "source": [
+    "trace, stats, results = lmc.sample(\n",
+    "    logp_dlogp_func=logp_dlogp_func,\n",
+    "    size=1,\n",
+    "    draws=1000,\n",
+    "    tune=500,\n",
+    "    step=lmc.NUTS(logp_dlogp_func=logp_dlogp_func, size=1),\n",
+    "    chains=4,\n",
+    "    cores=4,\n",
+    "    progressbar=\"notebook\"\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Inspecting the Output of `lmc.sample`"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "array([-0.38331274, -1.76994233, -0.67234733, ...,  0.27817656,\n",
+       "        0.29250676,  0.42966184])"
+      ]
+     },
+     "execution_count": 4,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "trace"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "(4000,)"
+      ]
+     },
+     "execution_count": 5,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "trace.shape"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "{'depth': array([1, 1, 1, ..., 1, 2, 1]),\n",
+       " 'step_size': array([0.94586326, 0.94586326, 0.94586326, ..., 2.16938615, 2.16938615,\n",
+       "        2.16938615]),\n",
+       " 'tune': array([False, False, False, ..., False, False, False]),\n",
+       " 'mean_tree_accept': array([1.        , 0.43665689, 1.        , ..., 0.98765583, 0.72296808,\n",
+       "        0.97965297]),\n",
+       " 'step_size_bar': array([1.20597596, 1.20597596, 1.20597596, ..., 1.28614833, 1.28614833,\n",
+       "        1.28614833]),\n",
+       " 'tree_size': array([1., 1., 1., ..., 1., 3., 1.]),\n",
+       " 'diverging': array([False, False, False, ..., False, False, False]),\n",
+       " 'energy_error': array([-0.25675836,  0.82860753, -0.74393026, ...,  0.01242099,\n",
+       "         0.00169732,  0.02055688]),\n",
+       " 'energy': array([1.25393394, 2.56056236, 1.91071276, ..., 0.95981431, 1.76229677,\n",
+       "        1.02575724]),\n",
+       " 'max_energy_error': array([-0.25675836,  0.82860753, -0.74393026, ...,  0.01242099,\n",
+       "         0.56981615,  0.02055688]),\n",
+       " 'model_logp': array([-0.99240286, -2.48528646, -1.144964  , ..., -0.95762963,\n",
+       "        -0.96171864, -1.01124318])}"
+      ]
+     },
+     "execution_count": 6,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "stats"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "(4000,)"
+      ]
+     },
+     "execution_count": 7,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "stats[\"diverging\"].shape"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Other Modules\n",
+    "\n",
+    "LittleMCMC exposes:\n",
+    "\n",
+    "1. Two step methods: Hamiltonian Monte Carlo (HMC) and the No-U-Turn Sampler (NUTS)\n",
+    "1. Quadpotentials (a.k.a. mass matrices or inverse metrics)\n",
+    "1. Dual-averaging step size adaptation\n",
+    "1. Leapfrog integration\n",
+    "\n",
+    "Refer to the [API Reference](https://littlemcmc.readthedocs.io/en/latest/api.html) for more information."
+   ]
   }
  ],
  "metadata": {
 
@@ -26,6 +26,8 @@ Step Methods
     HamiltonianMC
     NUTS
 
+.. _quadpotentials_api:
+
 Quadpotentials (a.k.a. Mass Matrices)
 -------------------------------------
 
@@ -39,3 +41,22 @@ Quadpotentials (a.k.a. Mass Matrices)
     QuadPotentialDiagAdapt
     QuadPotentialFullAdapt
 
+.. _step_sizes_api:
+
+Dual Averaging Step Size Adaptation
+-----------------------------------
+
+.. autosummary::
+    :toctree: generated/
+
+    step_sizes.DualAverageAdaptation
+
+.. _integrators_api:
+
+Integrators 
+-----------
+
+.. autosummary::
+    :toctree: generated/
+
+    integration.CpuLeapfrogIntegrator
@@ -54,25 +54,34 @@ def __init__(
             the log-probability, respectively.
         size : int
             Total number of parameters. Dimensionality of the output of
-            `logp_dlogp_func`.
+            ``logp_dlogp_func``.
         scaling : 1 or 2-dimensional array-like
             Scaling for momentum distribution. 1 dimensional arrays are
-            interpreted as a matrix diagonal. Only one of `scaling` or
-            `potential` may be non-None.
+            interpreted as a matrix diagonal. Only one of ``scaling`` or
+            ``potential`` may be non-None.
         is_cov : bool
             Treat scaling as a covariance matrix/vector if True, else treat
             it as a precision matrix/vector
         potential : littlemcmc.quadpotential.Potential, optional
-            An object that represents the Hamiltonian with methods `velocity`,
-            `energy`, and `random` methods. Only one of `scaling` or `potential`
-            may be non-None.
+            An object that represents the Hamiltonian with methods ``velocity``,
+            ``energy``, and ``random`` methods. Only one of ``scaling`` or
+            ``potential`` may be non-None.
         target_accept : float
+            Adapt the step size such that the average acceptance probability
+            across the trajectories are close to target_accept. Higher values
+            for target_accept lead to smaller step sizes. Setting this to higher
+            values like 0.9 or 0.99 can help with sampling from difficult
+            posteriors. Valid values are between 0 and 1 (exclusive).
         Emax : float
+            The maximum allowable change in the value of the Hamiltonian. Any
+            trajectories that result in changes in the value of the Hamiltonian
+            larger than ``Emax`` will be declared divergent.
         adapt_step_size : bool, default=True
             If True, performs dual averaging step size adaptation. If False,
-            `k`, `t0`, `gamma` and `target_accept` are ignored.
+            ``k``, ``t0``, ``gamma`` and ``target_accept`` are ignored.
         step_scale : float
-            Size of steps to take, automatically scaled down by 1 / (size ** 0.25)
+            Size of steps to take, automatically scaled down by 1 / (``size`` **
+            0.25).
         gamma : float, default .05
         k : float, default .75
             Parameter for dual averaging for step size adaptation. Values
@@ -81,8 +90,9 @@ def __init__(
         t0 : int, default 10
             Parameter for dual averaging. Higher values slow initial adaptation.
         step_rand : Python callable
-            Called on step size to randomize, immediately before adapting step
-            size.
+            Callback for step size adaptation. Called on the step size at each
+            iteration immediately before performing dual-averaging step size
+            adaptation.
         """
         self._logp_dlogp_func = logp_dlogp_func
         self.adapt_step_size = adapt_step_size
@@ -109,9 +119,7 @@ def __init__(
         else:
             self.potential = quad_potential(scaling, is_cov)
 
-        self.integrator = integration.CpuLeapfrogIntegrator(
-            self.potential, self._logp_dlogp_func
-        )
+        self.integrator = integration.CpuLeapfrogIntegrator(self.potential, self._logp_dlogp_func)
         self._step_rand = step_rand
         self._warnings: List[SamplerWarning] = []
         self._samples_after_tune = 0
@@ -122,9 +130,7 @@ def stop_tuning(self) -> None:
         if hasattr(self, "tune"):
             self.tune = False
 
-    def _hamiltonian_step(
-        self, start: np.ndarray, p0: np.ndarray, step_size: float
-    ) -> HMCStepData:
+    def _hamiltonian_step(self, start: np.ndarray, p0: np.ndarray, step_size: float) -> HMCStepData:
         """Compute one Hamiltonian trajectory and return the next state.
 
         Subclasses must overwrite this method and return a `HMCStepData`.
@@ -138,9 +144,7 @@ def _astep(self, q0: np.ndarray):
 
         if not np.isfinite(start.energy):
             raise ValueError(
-                "Bad initial energy: {}. The model might be misspecified.".format(
-                    start.energy
-                )
+                "Bad initial energy: {}. The model might be misspecified.".format(start.energy)
             )
 
         # Adapt step size
@@ -194,7 +198,9 @@ def warnings(self) -> List[SamplerWarning]:
         message = ""
         n_divs = self._num_divs_sample
         if n_divs and self._samples_after_tune == n_divs:
-            message = "The chain contains only diverging samples. The model is probably misspecified."
+            message = (
+                "The chain contains only diverging samples. The model is probably misspecified."
+            )
         elif n_divs == 1:
             message = (
                 "There was 1 divergence after tuning. Increase "
@@ -207,9 +213,7 @@ def warnings(self) -> List[SamplerWarning]:
             )
 
         if message:
-            warning = SamplerWarning(
-                WarningType.DIVERGENCES, message, "error", None, None, None
-            )
+            warning = SamplerWarning(WarningType.DIVERGENCES, message, "error", None, None, None)
             warnings.append(warning)
 
         warnings.extend(self.step_adapt.warnings())