更新了09.02

lijin-THU · lijin-THU · commit 53d1aead8f68 · 2015-10-08T18:20:10.000+08:00
diff --git a/09. theano/09.02 theano basics.ipynb b/09. theano/09.02 theano basics.ipynb
@@ -484,7 +484,285 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "Theano"
+    "`Theano` 中可以定义共享的变量，它们可以在多个函数中被共享。"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 19,
+   "metadata": {
+    "collapsed": false
+   },
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "TensorType(float64, matrix)\n"
+     ]
+    }
+   ],
+   "source": [
+    "shared_var = theano.shared(np.array([[1.0, 2.0], [3.0, 4.0]]))\n",
+    "\n",
+    "print shared_var.type"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "可以通过 `set_value` 方法改变它的值："
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 20,
+   "metadata": {
+    "collapsed": true
+   },
+   "outputs": [],
+   "source": [
+    "shared_var.set_value(np.array([[3.0, 4], [2, 1]]))"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "通过 `get_value()` 方法返回它的值："
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 21,
+   "metadata": {
+    "collapsed": false
+   },
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "array([[ 3.,  4.],\n",
+       "       [ 2.,  1.]])"
+      ]
+     },
+     "execution_count": 21,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "shared_var.get_value()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "共享变量进行运算："
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 22,
+   "metadata": {
+    "collapsed": false
+   },
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "[[  9.  16.]\n",
+      " [  4.   1.]]\n"
+     ]
+    }
+   ],
+   "source": [
+    "shared_square = shared_var ** 2\n",
+    "\n",
+    "f = theano.function([], shared_square)\n",
+    "\n",
+    "print f()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "这里函数不需要参数，因为共享变量隐式地被认为是一个参数。\n",
+    "\n",
+    "得到的结果会随这个共享变量的变化而变化："
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 23,
+   "metadata": {
+    "collapsed": false
+   },
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "[[  1.   4.]\n",
+      " [  9.  16.]]\n"
+     ]
+    }
+   ],
+   "source": [
+    "shared_var.set_value(np.array([[1.0, 2], [3, 4]]))\n",
+    "\n",
+    "print f()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "一个共享变量的值可以用 `updates` 关键词在 `theano` 函数中被更新："
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 24,
+   "metadata": {
+    "collapsed": true
+   },
+   "outputs": [],
+   "source": [
+    "subtract = T.matrix('subtract')\n",
+    "\n",
+    "f_update = theano.function([subtract], shared_var, updates={shared_var: shared_var - subtract})"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "这个函数先返回当前的值，然后将当前值更新为原来的值减去参数："
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 25,
+   "metadata": {
+    "collapsed": false
+   },
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "before update:\n",
+      "[[ 1.  2.]\n",
+      " [ 3.  4.]]\n",
+      "the return value:\n",
+      "[[ 1.  2.]\n",
+      " [ 3.  4.]]\n",
+      "after update:\n",
+      "[[ 0.  1.]\n",
+      " [ 2.  3.]]\n"
+     ]
+    }
+   ],
+   "source": [
+    "print 'before update:'\n",
+    "print shared_var.get_value()\n",
+    "\n",
+    "print 'the return value:'\n",
+    "print f_update(np.array([[1.0, 1], [1, 1]]))\n",
+    "\n",
+    "print 'after update:'\n",
+    "print shared_var.get_value()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## 导数"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "`Theano` 的一大好处在于它对符号变量计算导数的能力。\n",
+    "\n",
+    "我们用 `T.grad()` 来计算导数，之前我们定义了 `foo` 和 `bar` （分别是 $x$ 和 $x^2$）,我们来计算 `bar` 关于 `foo` 的导数（应该是 $2x$）："
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 26,
+   "metadata": {
+    "collapsed": false
+   },
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "array(20.0)"
+      ]
+     },
+     "execution_count": 26,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "bar_grad = T.grad(bar, foo)  # 表示 bar (x^2) 关于 foo (x) 的导数\n",
+    "\n",
+    "bar_grad.eval({foo: 10})"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "再如，对之前的 $y = Ax + b$ 求 $y$ 关于 $x$ 的雅可比矩阵（应当是 $A$）："
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 27,
+   "metadata": {
+    "collapsed": false
+   },
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "[[ 9.  8.  7.]\n",
+      " [ 4.  5.  6.]]\n"
+     ]
+    },
+    {
+     "name": "stderr",
+     "output_type": "stream",
+     "text": [
+      "C:\\Anaconda\\lib\\site-packages\\theano\\scan_module\\scan_perform_ext.py:133: RuntimeWarning: numpy.ndarray size changed, may indicate binary incompatibility\n",
+      "  from scan_perform.scan_perform import *\n"
+     ]
+    }
+   ],
+   "source": [
+    "y_J = theano.gradient.jacobian(y, x)\n",
+    "\n",
+    "print y_J.eval({A: np.array([[9.0, 8, 7], [4, 5, 6]]), #A\n",
+    "                x: np.array([1.0, 2, 3]),              #x\n",
+    "                b: np.array([4.0, 5])})                #b"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "`theano.gradient.jacobian` 用来计算雅可比矩阵，而 `theano.gradient.hessian` 可以用来计算 `Hessian` 矩阵。"
    ]
   }
  ],
diff --git a/README.md b/README.md
@@ -165,6 +165,9 @@ conda update anaconda
 	 - [08.11 接口](08. object-oriented programming/08.11 interfaces.ipynb)
 	 - [08.12 共有，私有和特殊方法和属性](08. object-oriented programming/08.12 public private special in python.ipynb)
 	 - [08.13 多重继承](08. object-oriented programming/08.13 multiple inheritance.ipynb)
+- [09. **Theano**](09. theano)
+	 - [09.01 Theano 简介及其安装](09. theano/09.01 introduction and installation.ipynb)
+	 - [09.02 Theano 基础](09. theano/09.02 theano basics.ipynb)
 - [10. **有趣的主题**](10. something interesting)
 	 - [10.01 使用 basemap 画地图](10. something interesting/10.01 maps using basemap.ipynb)
-	 - [10.02 使用 cartopy 画地图](10. something interesting/10.02 maps using cartopy.ipynb)
+	 - [10.02 使用 cartopy 画地图](10. something interesting/10.02 maps using cartopy.ipynb)
diff --git a/generate index.ipynb b/generate index.ipynb
@@ -42,6 +42,7 @@
     "           '06. matplotlib',\n",
     "           '07. interfacing with other languages',\n",
     "           '08. object-oriented programming',\n",
+    "           '09. theano',\n",
     "           '10. something interesting'\n",
     "          ]"
    ]
@@ -69,6 +70,7 @@
     "           '06. **Matplotlib**',\n",
     "           '07. **使用其他语言进行扩展**',\n",
     "           '08. **面向对象编程**',\n",
+    "           '09. **Theano**',\n",
     "           '10. **有趣的主题**']"
    ]
   },
@@ -238,6 +240,9 @@
       "    08.11 接口\n",
       "    08.12 共有，私有和特殊方法和属性\n",
       "    08.13 多重继承\n",
+      "09. Theano\n",
+      "    09.01 Theano 简介及其安装\n",
+      "    09.02 Theano 基础\n",
       "10. 有趣的主题\n",
       "    10.01 使用 basemap 画地图\n",
       "    10.02 使用 cartopy 画地图\n"
diff --git a/index.ipynb b/index.ipynb
@@ -185,6 +185,9 @@
     "\t - [08.11 接口](08. object-oriented programming/08.11 interfaces.ipynb)\n",
     "\t - [08.12 共有，私有和特殊方法和属性](08. object-oriented programming/08.12 public private special in python.ipynb)\n",
     "\t - [08.13 多重继承](08. object-oriented programming/08.13 multiple inheritance.ipynb)\n",
+    "- [09. **Theano**](09. theano)\n",
+    "\t - [09.01 Theano 简介及其安装](09. theano/09.01 introduction and installation.ipynb)\n",
+    "\t - [09.02 Theano 基础](09. theano/09.02 theano basics.ipynb)\n",
     "- [10. **有趣的主题**](10. something interesting)\n",
     "\t - [10.01 使用 basemap 画地图](10. something interesting/10.01 maps using basemap.ipynb)\n",
     "\t - [10.02 使用 cartopy 画地图](10. something interesting/10.02 maps using cartopy.ipynb)\n"
diff --git a/index.md b/index.md