genjax-community
diff --git a/‎extending_our_work.ipynb
Lines changed: 54 additions & 133 deletions b/‎extending_our_work.ipynb
Lines changed: 54 additions & 133 deletions
@@ -13,12 +13,12 @@
    "id": "8eaebd2e-4091-494f-83fd-3b221dacd087",
    "metadata": {},
    "source": [
-    "This notebook is intended as a tutorial: a guide to the usage of our system on new problems, which illustrates how several parts of the system work together."
+    "This notebook shows how to use our library to solve a new  inference task, beyond those considered in our experiments. It is intended to illustrate the usage of the library, but assumes some knowledge of variational inference. The inference problem comes from [Pyro's SVI Part I tutorial](https://pyro.ai/examples/svi_part_i.html#A-simple-example)."
    ]
   },
   {
    "cell_type": "code",
-   "execution_count": 1,
+   "execution_count": null,
    "id": "dd955977-8d0a-4a68-81c7-f031f17c348e",
    "metadata": {},
    "outputs": [],
@@ -46,35 +46,28 @@
    "id": "fcdc73aa-dbec-4c10-adc7-5f1e60125c5e",
    "metadata": {},
    "source": [
-    "Models and variational families (guides) in our system are probabilistic programs. We write these using a modeling language which can be accessed via the `genjax.gen` decorator. Below, _addresses (like `\"latent_fairness\"`) denote random variables. Although we don't show it here, deterministic (JAX traceable) code can be freely interwoven between random variable statements."
+    "Models and variational families (guides) in our system are probabilistic programs. We write these using a modeling language which can be accessed via the `genjax.gen` decorator. \n",
+    "\n",
+    "In the code, random choices can be made using the syntax `dist(args) @ \"choice_name\"`, where `\"choice_name\"` is a unique name for the random variable being sampled. In the code below, our model defines a distribution over two random variables, and the variational family, or guide, defines a distribution over only one random variable.\n",
+    "\n",
+    "Although we don't show it here, deterministic (JAX traceable) code can be freely interwoven between random variable statements."
    ]
   },
   {
    "cell_type": "code",
-   "execution_count": 2,
+   "execution_count": null,
    "id": "7330bcaf",
    "metadata": {},
-   "outputs": [
-    {
-     "data": {
-      "text/plain": [
-       "BuiltinGenerativeFunction(source=<function model at 0x142c97370>)"
-      ]
-     },
-     "execution_count": 2,
-     "metadata": {},
-     "output_type": "execute_result"
-    }
-   ],
+   "outputs": [],
    "source": [
     "#####################\n",
     "# Model & Guide\n",
     "#####################\n",
     "\n",
     "@genjax.gen\n",
     "def model():\n",
-    "    f = genjax.beta(2.0, 2.0) @ \"latent_fairness\"\n",
-    "    _ = genjax.tfp_bernoulli(f) @ \"obs\"\n",
+    "    f = genjax.tfp_beta(10.0, 10.0) @ \"latent_fairness\"\n",
+    "    _ = genjax.tfp_flip(f) @ \"obs\"\n",
     "\n",
     "\n",
     "@genjax.gen\n",
@@ -112,7 +105,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 3,
+   "execution_count": null,
    "id": "702e667a-ae2c-4bb5-8a93-7211862a567f",
    "metadata": {},
    "outputs": [],
@@ -156,27 +149,19 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 4,
+   "execution_count": null,
    "id": "6858aa01-1fa5-42a7-8302-6085fec5c08c",
    "metadata": {},
-   "outputs": [
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "[ True False False False False False False False False False]\n"
-     ]
-    }
-   ],
+   "outputs": [],
    "source": [
     "#####################\n",
     "# Data Generation\n",
     "#####################\n",
     "\n",
     "data = []\n",
-    "for _ in range(1):\n",
+    "for _ in range(6):\n",
     "    data.append(True)\n",
-    "for _ in range(9):\n",
+    "for _ in range(4):\n",
     "    data.append(False)\n",
     "\n",
     "data = jnp.array(data)\n",
@@ -186,44 +171,21 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 5,
+   "execution_count": null,
    "id": "23962668-0a2f-423d-8a80-e2930d270f77",
    "metadata": {},
-   "outputs": [
-    {
-     "data": {
-      "text/plain": [
-       "Expectation(prog=ADEVProgram(source=<function elbo.<locals>.elbo_loss at 0x142ce8c10>))"
-      ]
-     },
-     "execution_count": 5,
-     "metadata": {},
-     "output_type": "execute_result"
-    }
-   ],
+   "outputs": [],
    "source": [
     "objective = elbo(model, guide, genjax.choice_map({\"obs\": data}))\n",
     "objective"
    ]
   },
   {
    "cell_type": "code",
-   "execution_count": 6,
+   "execution_count": null,
    "id": "aef685d1-e026-4801-961a-033b4c70b53a",
    "metadata": {},
-   "outputs": [
-    {
-     "data": {
-      "text/plain": [
-       "(Array(-2.9085593, dtype=float32, weak_type=True),\n",
-       " Array(3.0023632, dtype=float32, weak_type=True))"
-      ]
-     },
-     "execution_count": 6,
-     "metadata": {},
-     "output_type": "execute_result"
-    }
-   ],
+   "outputs": [],
    "source": [
     "key, sub_key = jax.random.split(key)\n",
     "_, q_grads = objective.grad_estimate(sub_key, ((), (1.0, 1.0)))\n",
@@ -235,7 +197,7 @@
    "id": "2dd4bf25-31a8-4f17-bcb5-fc01014bd6d3",
    "metadata": {},
    "source": [
-    "That all works, like you'd expect it to."
+    "The `objective.grad_estimate method` takes arguments `(key: PRNGKey, loss_args: Tuple)` and returns an unbiased estimate of the gradient of our objective. We can use these gradient estimates for stochastic optimization of the guide's parameters (see below)."
    ]
   },
   {
@@ -256,44 +218,21 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 7,
+   "execution_count": null,
    "id": "94feba7c-e704-4683-8da8-c7a8e1c37b75",
    "metadata": {},
-   "outputs": [
-    {
-     "data": {
-      "text/plain": [
-       "Expectation(prog=ADEVProgram(source=<function elbo.<locals>.elbo_loss at 0x158be7d00>))"
-      ]
-     },
-     "execution_count": 7,
-     "metadata": {},
-     "output_type": "execute_result"
-    }
-   ],
+   "outputs": [],
    "source": [
     "objective = genjax.vi.elbo(model, guide, genjax.choice_map({\"obs\": data}))\n",
     "objective"
    ]
   },
   {
    "cell_type": "code",
-   "execution_count": 8,
+   "execution_count": null,
    "id": "a330520e-93a5-49ef-b24f-c41e7b2f2dd9",
    "metadata": {},
-   "outputs": [
-    {
-     "data": {
-      "text/plain": [
-       "(Array(-2.1008544, dtype=float32, weak_type=True),\n",
-       " Array(-0.2123673, dtype=float32, weak_type=True))"
-      ]
-     },
-     "execution_count": 8,
-     "metadata": {},
-     "output_type": "execute_result"
-    }
-   ],
+   "outputs": [],
    "source": [
     "key, sub_key = jax.random.split(key)\n",
     "_, q_grads = objective.grad_estimate(sub_key, ((), (1.0, 1.0)))\n",
@@ -310,7 +249,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 9,
+   "execution_count": null,
    "id": "6079ea7c",
    "metadata": {},
    "outputs": [],
@@ -347,19 +286,28 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 10,
+   "execution_count": null,
    "id": "56a6b73f",
    "metadata": {},
    "outputs": [],
    "source": [
     "# setup the optimizer\n",
-    "adam = optax.adam(5e-3)\n",
+    "adam = optax.adam(5e-4)\n",
     "svi_updater = svi_update(model, guide, adam)\n",
     "\n",
     "# initialize parameters\n",
     "alpha = jnp.array(2.0)\n",
     "beta = jnp.array(2.0)\n",
     "\n",
+    "# here we use some facts about the Beta distribution\n",
+    "start_mean = alpha / (alpha + beta)\n",
+    "factor = beta / (alpha * (1.0 + alpha + beta))\n",
+    "start_std = start_mean * jnp.sqrt(factor)\n",
+    "print(\n",
+    "    \"\\nStarting mean and std \"\n",
+    "    + \"is %.3f +- %.3f\" % (start_mean, start_std)\n",
+    ")\n",
+    "\n",
     "params = (alpha, beta)\n",
     "opt_state = adam.init(params)\n",
     "\n",
@@ -378,7 +326,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 11,
+   "execution_count": null,
    "id": "e0e2594a",
    "metadata": {},
    "outputs": [],
@@ -387,7 +335,7 @@
     "# Gradient Steps\n",
     "#####################\n",
     "\n",
-    "for step in range(2000):\n",
+    "for step in range(5000):\n",
     "    key, sub_key = jax.random.split(key)\n",
     "    params, loss, opt_state = svi_updater(key, data, params, opt_state)"
    ]
@@ -402,19 +350,10 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 12,
+   "execution_count": null,
    "id": "9d153225",
    "metadata": {},
-   "outputs": [
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "\n",
-      "Based on the data and our prior belief, the fairness of the coin is 0.293 +- 0.191\n"
-     ]
-    }
-   ],
+   "outputs": [],
    "source": [
     "#####################\n",
     "# Inferred parameters\n",
@@ -444,7 +383,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 13,
+   "execution_count": null,
    "id": "0fa5e755-98ef-4a17-ae9c-923415a84765",
    "metadata": {},
    "outputs": [],
@@ -468,12 +407,12 @@
     "        return updater\n",
     "\n",
     "    # setup the optimizer\n",
-    "    adam = optax.adam(5e-3)\n",
+    "    adam = optax.adam(5e-4)\n",
     "    svi_updater = svi_update(model, guide, adam)\n",
     "    \n",
     "    # initialize parameters\n",
-    "    alpha = jnp.array(2.0)\n",
-    "    beta = jnp.array(2.0)\n",
+    "    alpha = jnp.array(15.0)\n",
+    "    beta = jnp.array(15.0)\n",
     "    \n",
     "    params = (alpha, beta)\n",
     "    opt_state = adam.init(params)\n",
@@ -483,7 +422,7 @@
     "    _ = svi_updater(key, data, params, opt_state)\n",
     "\n",
     "    losses = []\n",
-    "    for step in range(2000):\n",
+    "    for step in range(5000):\n",
     "        key, sub_key = jax.random.split(key)\n",
     "        params, loss, opt_state = svi_updater(key, data, params, opt_state)\n",
     "        losses.append(loss)\n",
@@ -512,24 +451,15 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 14,
+   "execution_count": null,
    "id": "3a3fe4d2-86b2-422b-9de4-fb2da6b1b95b",
    "metadata": {},
-   "outputs": [
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "\n",
-      "Based on the data and our prior belief, the fairness of the coin is 0.627 +- 0.208\n"
-     ]
-    }
-   ],
+   "outputs": [],
    "source": [
     "data = []\n",
-    "for _ in range(9):\n",
+    "for _ in range(8):\n",
     "    data.append(True)\n",
-    "for _ in range(1):\n",
+    "for _ in range(2):\n",
     "    data.append(False)\n",
     "\n",
     "data = jnp.array(data)\n",
@@ -555,7 +485,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 15,
+   "execution_count": null,
    "id": "d56ad497-64b4-454f-b49f-86a7c5afe3a7",
    "metadata": {},
    "outputs": [],
@@ -595,19 +525,10 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 16,
+   "execution_count": null,
    "id": "69ca6d58-defb-4fc3-85ae-ca3302015a8b",
    "metadata": {},
-   "outputs": [
-    {
-     "name": "stdout",
-     "output_type": "stream",
-     "text": [
-      "\n",
-      "Based on the data and our prior belief, the fairness of the coin is 0.628 +- 0.209\n"
-     ]
-    }
-   ],
+   "outputs": [],
    "source": [
     "run_experiment(key, data, iwelbo)"
    ]
@@ -638,7 +559,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 17,
+   "execution_count": null,
    "id": "2ffec31d-6e9f-4cf1-ba2c-30c3c7275265",
    "metadata": {},
    "outputs": [],