Kurikulum
Kurikulum
Kurikulum
Abstract: Online learning initia- took the initiative to post course materials, those described above (http://www.saylor.
tives over the past decade have including video, in widely varying for- org).
become increasingly comprehen- mats. Some adopted the use of Khan- In the fall of 2011, a highly publicized
sive in their selection of courses style videos or tablet-based screencasts of online course, Introduction to Artificial
and sophisticated in their presen- the sort popularized by the Kahn Acad- Intelligence (AI), was conducted by
tation, culminating in the recent emy with its vast library of instructional Stanford University Prof. Sebastian Thrun
announcement of a number of videos, which started as a viral You- and Googles Director of Research, Peter
consortium and startup activities Tube sensation and has now become its Norvig, based on the Stanford AI course.
that promise to make a university own well-funded institution (http://www. It ran live in the sense that new videos
education on the internet, free of khanacademy.org). were released and homework assignments
charge, a real possibility. At this YouTube indeed became the destina- collected on a weekly basis, and quizzes
pivotal moment it is appropriate to tion of many academic videos, which are and exams were given at set times, while
explore the potential for obtaining now aggregated by institution under discussion logs allowed for some degree of
comprehensive bioinformatics interaction. The course attracted 160,000
YouTube EDU (http://www.youtube.
training with currently existing free students from 190 countries, 22,000 of
video resources. This article pre- com/education). Apple has also put its
distinctive stamp on online learning whom finished successfully and were
sents such a bioinformatics curric-
with iTunes U (http://www.apple.com/ granted certificates of completion [1].
ulum in the form of a virtual course
education/itunes-u), also organized by Shortly afterwards, MIT set up a similar
catalog, together with editorial
commentary, and an assessment institution but with integrated search approach on a new platform called MITx,
of strengths, weaknesses, and likely capability and, of course, deployment to offering a course in electronic circuits that
future directions for open online iPad and iPhone apps. Countless aggrega- attracted comparable numbers of students
learning in this field. tors also assemble collections of video (https://6002x.mitx.mit.edu).
courses, but generally with little value The trend to structured presentation
added. and high production quality then acceler-
ated remarkably, and took an entrepre-
Online Learning Comes of Age Yale University began in 2007 to release
Open Yale Courses (http://oyc.yale.edu) neurial turn. The AI course was effectively
Online academic courseware at the in a more curated and consistent format spun off by Prof. Thrun into a Web
university level has now been available to than most other efforts, including high- startup called Udacity (http://www.
the public for a decade, the earliest udacity.com), which is currently live with
quality video and extensive syllabi; courses
concerted effort having originated in six courses. In April of 2012, two other
appeared incrementally, with just under
2002 with the Massachusetts Institute of Stanford scientists, Profs. Andrew Ng and
50 available to date. Then, in 2011, MIT
Technology (MIT) and their OpenCour- Daphne Koller, announced a similar
revamped several of its online courses into
seWare initiative (http://ocw.mit.edu). newco called Coursera (https://www.
a much more structured instructional
This project offered up the syllabi, lecture coursera.org), with backing from major
format, with learning modules in outline
notes, quizzes, exams, and/or other study Silicon Valley venture capital firms. Cour-
form containing videos interspersed with
materials for a very large number of sera, also now live, is being stocked with
courses, at the discretion of professors self-assessment and other activities. In a
courses from academic partners Stanford,
but with strong support and encourage- somewhat different vein, the non-profit
Princeton University, the University of
ment from the MIT administration. Only Saylor Foundation compiled a compre- Pennsylvania, and the University of Mi-
in a minority of cases were videos of hensive online university curriculum com- chigan; this list was recently augmented
lectures posted. prising courses that are essentially mash- with a tranche of a dozen more top-tier
Even before this, The University of ups of video and text resources from many universities. And in May of 2012, barely
California, Berkeley, had started webcast- existing sources, including a number of six months after MIT had rolled out its
ing lectures, and eventually began posting
both audio and video for public consump- Citation: Searls DB (2012) An Online Bioinformatics Curriculum. PLoS Comput Biol 8(9): e1002632. doi:10.1371/
tion at their Berkeley Webcast site (http:// journal.pcbi.1002632
webcast.berkeley.edu), though without the Editor: Fran Lewitter, Whitehead Institute, United States of America
ancillary materials of MITs OpenCourse-
Published September 13, 2012
Ware. A number of other universities
followed suit, though seldom so extensive- Copyright: 2012 David B. Searls. This is an open-access article distributed under the terms of the Creative
Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium,
ly; among these was Stanford with its provided the original author and source are credited.
ClassX streaming service (http://classx.
Funding: The author received no specific funding for this article.
stanford.edu/ClassX) and an earlier effort
Competing Interests: The author has declared that no competing interests exist.
called Stanford Engineering Everywhere
(http://see.stanford.edu/see/courses.aspx). * E-mail: david.b.searls@gmail.com
In many cases, individual faculty members David B. Searls is an Associate Editor of PLOS Computational Biology.
Course Source BA DM BT SW CB
Course Source BA DM BT SW CB
Bioinformatics Tools (BT). This the-large, at a level sufficient to participate science and engineering disciplines re-
track is meant to afford the capability to in or lead the development of major levant to the sciences of complexity, infor-
develop standalone tools of significant bioinformatics systems and/or products, mation, and systems.
sophistication for bioinformatics analysis, for instance supporting data management
visualization, presentation, and local data and analysis from novel technological Independent Study
management. It requires programming platforms through complex downstream Even in a university environment, it is
skills in a variety of languages and the analysis pipelines. not unusual for the classes that are
ability to implement complex algorithms Computational Biology (CB). This necessary or desirable for a given course
efficiently, based on solid biological track is intended to prepare individuals to of study to be unavailable when needed.
domain knowledge. do original research in biological modeling Certainly the curriculum above is con-
Bioinformatics Systems (BS). This and analysis by way of advanced mathe- strained by the available online courses, as
track adds to the previous one the matical and computational techniques. It discussed below in the conclusion. In
competency for software engineering in- provides a deeper grounding in computer addition the patchwork nature of the
Course Source BA DM BT SW CB
Course Source BA DM BT SW CB
courses, arising as they do from many may be true of the Data Mining track, One undeniable truism is that indepen-
institutions, can be a strength but also a though these individuals are probably dent study requires motivation and disci-
weakness, with less opportunity for coor- more likely to be committed to a career pline in the extreme. Students must be
dination and seamless sequencing of in exclusively dry biology. committed to doing assigned readings,
course contents. As in academia, any gaps Students in the two software tracks, exercises and assessments faithfully to
can be addressed, or special interests Bioinformatics Tools and Bioinformatics achieve maximum uptake, the more so
accommodated, by independent study. Systems, may wish to take additional for being on their own. A companion
The major disadvantage is the lack of a courses in subjects such as machine article by the author, Ten Simple Rules
faculty mentor, which requires students to architecture, operating systems, or theory for Online Learning [44], attempts to
be proactive, self-sufficient, and conscien- of programming languages, but by far the provide practical advice along these lines.
tious in discerning the needs and means most important requirement for indepen- A particular piece of advice it offers is to
for supplementing their coursework. Per- dent study is actual programming experi- pay special attention to doing program-
haps the best way to approach this is for ence. These individuals would be well ming projects in the biological domain.
students to make a habit of reading the key advised to take on substantial projects in One great risk to the proposition of online
journals in their field so as to discover the biological domain that go beyond the bioinformatics education is that students
systematic gaps in their knowledge. requirements of the courses taken. never really get to grips with applying
The type of independent study needed Finally, the Computational Biology newfound computational or analytic skills
will depend on the background of the track may call for independent study in a to real biological data and actual problems
student and on the track they are follow- variety of topics in advanced mathematics in the full context of the scientific estab-
ing. Some suggestions for individual tracks and computer science as well as biological lishment. To be sure, biological databases
are indicated by plus signs in Tables 14. background necessary for a particular are readily accessible and datasets may be
A plus to the right of a course symbol specialization. The curriculum offered found online that can serve as challenge
(whether prerequisite or core) indicates here is slanted toward systems biology in problems for classification, and so forth.
that advanced work in the topic area of this regard, but individuals may prefer to But that is not the same as the interactive
that course is recommended for students in study topics such as evolutionary dynamics process of designing a novel experimental
that track. Often some specific suggestions or mathematical genetics that would program, acquiring data direct from
for additional study are indicated in the require additional study. instrumentation, cleaning and reducing
Going Further sections of the course it, and taking responsibility for storing it
catalog, but where specialized courses are Conclusion in both persistent and queryable form. Nor
not to be found online (as is likely), one does classroom learning by itself, virtual or
hopes that the basic course has provided As noted at the outset, any proposed otherwise, fully prepare one for establish-
sufficient background for the student to curriculum must be based on the shifting ing real-world error models, dealing with
learn by self-study of more advanced texts sands of available offerings, and moreover missing data, establishing a statistical case
and journals. is necessarily a matter of opinion, both for some result, arguing and defending
For Bioinformatics Analysis, additional scientific and pedagogical. Without a scientific positions, navigating the publica-
biology coursework or other study would doubt there are gaps, and quality is not tion process, and sundry other practical
be required for the student to approach uniform. For instance, there are few skills.
problems with the expected degree of suitable resources in important areas such Thus, a useful adjunct to online learning
domain sophistication, so that interpreta- as neuroscience and structural biology, in bioinformatics might be a portfolio of
tions of data are placed in an appropriate and several other areas are thin. But the suggested projects based on real-world
biological context. Ideally this would offerings are only getting better and more datasets that would help exercise the skills
include exposure to laboratory science, numerous, and so any imperfections in the of trainees, perhaps in the context of an
which of course is unlikely in the case of current collection should be increasingly online community of peers. One can even
online learners. However, it is expected easy to correct with the passage of time. A imagine a future in which the use of virtual
that many individuals embarking on this more pertinent question is whether an laboratories makes it possible for students
track would already be degreed biologists online education is an adequate substitute to undertake mixed wet/dry studies of
who are seeking additional training to do for what is termed a resident education, in their own. Just as the Amazon Cloud now
advanced analyses with their own data or general and in the particular case of makes large-scale computing accessible
that of others. To some degree the same bioinformatics. and economically feasible without the
References
1. Markoff J (18 Apr 2012) Online education 15. Aho AV, Ullman JD (1994) Foundations of bridge University Press. 640 p. Available: http://
venture lures cash infusion and deals with 5 top computer science. San Francisco, CA: W.H. www.inference.phy.cam.ac.uk/mackay/itila. Ac-
universities. The New York Times. Available: Freeman. 786 p. Available: http://i.Stanford. cessed 16 August 2012.
http://www.nytimes.com/2012/04/18/ edu/,ullman/focs.html. Accessed 16 August 30. Oppenheim AV, Willsky AS, Hamid S (1996)
t ec hn olo gy /co ur sera -pl an s-t o- an noun ce - 2012. Signals and systems (2nd edition). Englewood
university-partners-for-online-classes.html. Ac- 16. Sipser M (1997) Introduction to the theory of Cliffs, NJ: Prentice Hall.
cessed 16 August 2012. computation. Boston, MA: PWS Publishing. 31. Gray RM, Davisson LD (2010) Introduction to
2. Means B, Toyama Y, Murphy R, Bakia M, Jones 396 p. statistical signal processing. Cambridge, UK:
K (SRI International) (2009) Evaluation of 17. Gurari E (1989) An introduction to the theory of Cambridge University Press. 478 p. Available:
evidence-based practices in online learning: a computation. New York, NY: Computer Science http://ee.stanford.edu/,gray/sp.html. Accessed
meta-analysis and review of online learning Press. 314 p. Available: http://www.cse.ohio- 16 August 2012.
studies. Final Report September 2010. Washing- state.edu/,gurari/theory-bk/theory-bk.html. 32. Evans D (2011) Introduction to computing:
ton (D.C.): Department of Education. Contract Accessed 16 August 2012. explorations in language, logic, and machines.
number ED-04-CO-0040 Task 0006. 66 p. 18. Graham RL, Knuth DE, Patashnik O (1989) Charleston, SC: CreateSpace. 266 p. Available:
Available: http://www2.ed.gov/rschstat/eval/ Concrete mathematics. Reading, MA: Addison- http://www.computingbook.org. Accessed 16
tech/evidence-based-practices/finalreport.pdf. Wesley. 625 p. August 2012.
Accessed 16 August 2012. 19. Bender EA, Williamson SG (2004) A short course 33. Abelson H, Sussman GJ, Sussman J (1996)
3. Mayer RE (2001) Multimedia learning. New in discrete mathematics. New York: Dover. 256 p. Structure and interpretation of computer pro-
York, NY: Cambridge University Press. Available: http://cseweb.ucsd.edu/,gill/ grams. 2nd edition. Cambridge, MA: MIT Press.
4. Garrett RH, Grisham CM (2004) Biochemistry. BWLectSite. Accessed 16 August 2012. Available: http://mitpress.mit.edu/sicp/full-
3rd edition. St. Paul, MN: Brooks/Cole Publish- 20. Flagolet P, Sedgewick R (2012) Analytic combi- text/book/book.html. Accessed 16 August 2012.
ing. Available: http://www.web.virginia.edu/ natorics. Cambridge: Cambridge University. 824 34. Bates B, Sierra K (2003) Head first java: your
Heidi/home.htm p. Available: http://ac.cs.princeton.edu/home. brain on java - a learners guide. Sebastopol, CA:
5. Strachan T, Reed A (2010) Human molecular Accessed 16 August 2012. OReilly Media.
genetics. 4th edition. New York: Garland Science. 21. Bender EA, Williamson SG (2006) Foundations of 35. Cormen TH, Leiserson CE, Rivest RL, Stein C
807 p. combinatorics with applications. New York: (2009) Introduction to algorithms. 3rd edition.
6. Strang G (1991) Calculus. Wellesley, MA: Well- Dover. 480 p. Available: http://cseweb.ucsd. Cambridge, MA: MIT Press.
esley-Cambridge Press. 615 p. Available: http:// edu/,gill/FoundCombSite. Accessed 16 August
36. Gusfield D (1997) Algorithms on strings, trees and
ocw.mit.edu/resources/res-18-001-calculus- 2012.
sequences: computer science and computational
online-textbook-spring-2005/textbook. Accessed 22. Wilf HS (2005) generatingfunctionology. 3rd
biology. Cambridge, UK: Cambridge University
16 August 2012. edition. Natick, MA: A.K Peters/CRC Press.
Press. 556 p.
7. Williamson SG (1987) Top-down calculus. Rock- 245 p. Available: http://www.math.upenn.edu/
37. Jones NC, Pevzner PA (2004) An introduction to
ville, MD: Computer Science Press. 429 p. ,wilf/DownldGF.html. Accessed 16 August
bioinformatics algorithms. Cambridge, MA: MIT
Available: http://cseweb.ucsd.edu/,gill/ 2012.
Press.
TopDownCalcSite. Accessed 16 August 2012. 23. Easley D, Kleinberg J (2010) Networks, crowds
8. Kaw A, Kalu EE (2011) Numerical methods with and markets: reasoning about a highly connected 38. Russell S, Norvig P (2009) Artificial intelligence: a
applications. Raleigh, NC: Lulu. 740 p. Available: world. Cambridge, UK: Cambridge University modern approach. 3rd edition. Englewood Cliffs,
http://numericalmethods.eng.usf.edu/topics/ Press. 744 p. Available: http://www.cs.cornell. NJ: Prentice Hall. 1152 p.
textbook_index.html. Accessed 16 August 2012. edu/home/kleinber/networks%2Dbook. Ac- 39. Rowe NC (1988) Artificial intelligence through
9. Strang G (2007) Computational science and cessed 16 August 2012. prolog. 2nd edition. Englewood Cliffs, NJ:
engineering. Wellesley, MA: Wellesley-Cam- 24. Boyd S, Vandenberghe L (2004) Convex optimi- Prentice Hall. 481 p. Available: http://faculty.
bridge Press. 713 p. zation. Cambridge, UK: Cambridge University nps.edu/ncrowe/book/book.html. Accessed 16
10. Krijnen WP (2009) Applied statistics for bioinfor- Press. 730 p. Available: http://www.stanford. August 2012.
matics using R. Available: http://cran.r-project. edu/,boyd/cvxbook. Accessed 16 August 2012. 40. Sowa JF (2000) Knowledge representation. Pacific
org/doc/contrib/Krijnen-IntroBioInfStatistics. 25. Luke S (2009) Essentials of metaheuristics. Grove, CA: Brooks Cole Publishing. 594 p.
pdf.Accessed 16 August 2012. Raleigh, NC: Lulu. 230 p. Available: http://cs. 41. Abu-Mostafa YS, Magdon-Ismail M, Lin H-T
11. Grinstead CM, Snell JL (1997) Introduction to gmu.edu/,sean/book/metaheuristics. Accessed (2012) Learning from data. Pasadena: AMLBook.
probability. New York: American Mathematical 16 August 2012. 42. Hastie T, Tibshirani R, Friedman J (2009) The
Society. 510 p. Available: http://www. 26. Poli R, Langdon WB, McPhee NF (2008) A field elements of statistical learning: data mining,
dartmouth.edu/,chance/teaching_aids/books_ guide to genetic programming. Raleigh, NC: inference, and prediction. 2nd edition. New York:
articles/probability_book/book.html. Accessed Lulu. 252 p. Available: http://www.gp-field- : Springer. 768 p. Available: http://www-stat.
16 August 2012. guide.org.uk. Accessed 16 August 2012. stanford.edu/,tibs/ElemStatLearn. Accessed 16
12. Wasserman L (2003) All of statistics. New York: 27. Cover TM, Thomas JA (1991) Elements of August 2012.
Springer. 461 p. information theory. New York: Wiley. 748 p. 43. Bird S, Klein E, Loper E (2009) Natural language
13. Ewens WJ, Grant GR (2001) Statistical methods 28. Gray RM (2011) Entropy and information. 2nd processing with python. Sebastopol, CA: OReilly
in bioinformatics. New York: Springer. 476 p. edition. New York: Springer. 436 p. Available: Media. Available: http://www.nltk.org/book.
14. Gray RM (2010) Probability, random processes, http://ee.stanford.edu/,gray/it.html. Accessed Accessed 16 August 2012.
and ergodic properties. 2nd edition. New York: 16 August 2012. 44. Searls DB (2012) Ten simple rules for online
Springer. 357 p. Available: http://ee.stanford. 29. MacKay D (2003) Information theory, inference, learning. PLoS Comp Biol 8: e1002631.
edu/,gray/arp.html. Accessed 16 August 2012. and learning algorithms. Cambridge, UK: Cam- doi:10.1371/journal.pcbi.1002631