85308
85308
85308
com
OR CLICK HERE
DOWLOAD NOW
Making
Presentation
Math Computable
A Context-Sensitive Approach
for Translating LaTeX to Computer
Algebra Systems
Making Presentation Math Computable
André Greiner-Petter
Making Presentation
Math Computable
A Context-Sensitive Approach for
Translating LaTeX to Computer
Algebra Systems
André Greiner-Petter
Berlin, Germany
© The Editor(s) (if applicable) and The Author(s) 2023. This book is an open access publication.
Open Access This book is licensed under the terms of the Creative Commons Attribution 4.0
International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing,
adaptation, distribution and reproduction in any medium or format, as long as you give appropriate
credit to the original author(s) and the source, provide a link to the Creative Commons license and
indicate if changes were made.
The images or other third party material in this book are included in the book’s Creative Commons
license, unless indicated otherwise in a credit line to the material. If material is not included in the
book’s Creative Commons license and your intended use is not permitted by statutory regulation or
exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
The use of general descriptive names, registered names, trademarks, service marks, etc. in this
publication does not imply, even in the absence of a specific statement, that such names are exempt
from the relevant protective laws and regulations and therefore free for general use.
The publisher, the authors, and the editors are safe to assume that the advice and information in this
book are believed to be true and accurate at the date of publication. Neither the publisher nor the
authors or the editors give a warranty, expressed or implied, with respect to the material contained
herein or for any errors or omissions that may have been made. The publisher remains neutral with
regard to jurisdictional claims in published maps and institutional affiliations.
This Springer Vieweg imprint is published by the registered company Springer Fachmedien
Wiesbaden GmbH, part of Springer Nature.
The registered company address is: Abraham-Lincoln-Str. 46, 65189 Wiesbaden, Germany
Front Matter
Contents
FRONT MATTER . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . i
List of Figures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
List of Tables . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Abstract . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Zusammenfassung . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Acknowledgements . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
CHAPTER 1
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.1 Motivation & Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.2 Research Gap . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
1.3 Research Objective . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.4 Thesis Outline . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
1.4.1 Publications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
1.4.2 Research Path . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
CHAPTER 2
Mathematical Information Retrieval . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17
2.1 Background and Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
2.2 Mathematical Formats and Their Conversions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
2.2.1 Web Formats . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
2.2.2 Word Processor Formats. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
2.2.3 Computable Formats . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
2.2.4 Images and Tree Representations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
2.2.5 Math Embeddings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
2.3 From Presentation to Content Languages . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
2.3.1 Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
2.3.2 Benchmarking MathML . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
2.3.3 Evaluation of Context-Agnostic Conversion Tools . . . . . . . . . . . . . . . . . . . . . . . 48
2.3.4 Summary of MathML Converters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
2.4 Mathematical Information Retrieval for LaTeX Translations . . . . . . . . . . . . . . . . . . . . . 51
v
CHAPTER 3
Semantification of Mathematical LaTeX . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
3.1 Semantification via Math-Word Embeddings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59
3.1.1 Foundations and Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
3.1.2 Semantic Knowledge Extraction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63
3.1.3 On Overcoming the Issues of Knowledge Extraction Approaches . . . . . . . 68
3.1.4 The Future of Math Embeddings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70
3.2 Semantification with Mathematical Objects of Interest . . . . . . . . . . . . . . . . . . . . . . . . . . . 70
3.2.1 Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
3.2.2 Data Preparation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
3.2.3 Frequency Distributions of Mathematical Formulae . . . . . . . . . . . . . . . . . . . . . 76
3.2.4 Relevance Ranking for Formulae . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
3.2.5 Applications . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
3.2.6 Outlook . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
3.3 Semantification with Textual Context Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
3.3.1 Semantification, Translation & Evaluation Pipeline . . . . . . . . . . . . . . . . . . . . . . 91
CHAPTER 4
From LaTeX to Computer Algebra Systems . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95
4.1 Context-Agnostic Neural Machine Translation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96
4.1.1 Training Datasets & Preprocessing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 96
4.1.2 Methodology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97
4.1.3 Evaluation of the Convolutional Network . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97
4.2 Context-Sensitive Translation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101
4.2.1 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101
4.2.2 Related Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104
4.2.3 Formal Mathematical Language Translations . . . . . . . . . . . . . . . . . . . . . . . . . . . . 104
4.2.4 Document Pre-Processing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108
4.2.5 Annotated Dependency Graph Construction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108
4.2.6 Semantic Macro Replacement Patterns . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110
CHAPTER 5
Qualitative and Quantitative Evaluations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113
5.1 Evaluations on the Digital Library of Mathematical Functions . . . . . . . . . . . . . . . . . . . 114
5.1.1 The DLMF dataset . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116
5.1.2 Semantic LaTeX to CAS translation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117
5.1.3 Evaluation of the DLMF using CAS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 123
5.1.4 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 128
5.1.5 Conclude Quantitative Evaluations on the DLMF . . . . . . . . . . . . . . . . . . . . . . . . 131
5.2 Evaluations on Wikipedia . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132
5.2.1 Symbolic and Numeric Testing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 133
5.2.2 Benchmark Testing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 133
5.2.3 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 134
5.2.4 Error Analysis & Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137
5.2.5 Conclude Qualitative Evaluations on Wikipedia . . . . . . . . . . . . . . . . . . . . . . . . . 139
vi Contents
CHAPTER 6
Conclusion and Future Work. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 141
6.1 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 141
6.2 Contributions and Impact of the Thesis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 150
6.3 Future Work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 153
6.3.1 Improved Translation Pipeline . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 154
6.3.2 Improve LaTeX to MathML Converters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 155
6.3.3 Enhanced Formulae in Wikipedia . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 156
6.3.4 Language Independence . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 158
Contents vii
Front Matter
List of Figures
ix
Front Matter
List of Tables
xi
FRONT MATTER
Abstract
This thesis addresses the issue of translating mathematical expressions from LATEX to the syntax
of Computer Algebra Systems (CAS). Over the past decades, especially in the domain of Science,
Technology, Engineering, and Mathematics (STEM), LATEX has become the de-facto standard
to typeset mathematical formulae in publications. Since scientists are generally required to
publish their work, LATEX has become an integral part of today’s publishing workflow. On the
other hand, modern research increasingly relies on CAS to simplify, manipulate, compute, and
visualize mathematics. However, existing LATEX import functions in CAS are limited to simple
arithmetic expressions and are, therefore, insufficient for most use cases. Consequently, the
workflow of experimenting and publishing in the Sciences often includes time-consuming and
error-prone manual conversions between presentational LATEX and computational CAS formats.
To address the lack of a reliable and comprehensive translation tool between LATEX and CAS,
this thesis makes the following three contributions.
First, it provides an approach to semantically enhance LATEX expressions with sufficient semantic
information for translations into CAS syntaxes. This, so called, semantification process analyzes
the structure of the formula and its textual context to conclude semantic information. The
research for this semantification process additionally contributes towards related Mathematical
Information Retrieval (MathIR) tasks, such as mathematical education assistance, math recom-
mendation and question answering systems, search engines, automatic plagiarism detection,
and math type assistance systems.
Second, this thesis demonstrates the first context-aware LATEX to CAS translation framework
LACAST. LACAST uses the developed semantification approach to transform LATEX expressions
into an intermediate semantic LATEX format, which is then further translated to CAS based
on translation patterns. These patterns were manually crafted by mathematicians to assure
accurate and reliable translations. In comparison, this thesis additionally elaborates a non-
context aware neural machine translation approach trained on a mathematical library generated
by Mathematica.
Third, the thesis provides a novel approach to evaluate the performance for LATEX to CAS
translations on large-scaled datasets with an automatic verification of equations in digital math-
ematical libraries. This evaluation approach is based on the assumption that equations in digital
mathematical libraries can be computationally verified by CAS, if a translation between both
systems exists. In addition, the thesis provides an in-depth manual evaluation on mathematical
articles from the English Wikipedia.
The presented context-aware translation framework LACAST increases the efficiency and reliability
of translations to CAS. Via LACAST, we strengthened the Digital Library of Mathematical Functions
(DLMF) by identifying numerous of issues, from missing or wrong semantic annotations to sign
errors. Further, via LACAST, we were able to discover several issues with the commercial CAS
Maple and Mathematica. The fundamental approaches to semantically enhance mathematics
developed in this thesis additionally contributed towards several related MathIR tasks. For
xiii
instance, the large-scale analysis of mathematical notations and the studies on math-embeddings
motivated new approaches for math plagiarism detection systems, search engines, and allow
typing assistance for mathematical inputs. Finally, LACAST translations will have a direct real-
world impact, as they are scheduled to be integrated into upcoming versions of the DLMF and
Wikipedia.
xiv Abstract
FRONT MATTER
Zusammenfassung
Diese Dissertation befasst sich mit der Problematik von Übersetzungen mathematischer For-
meln zwischen LATEX und Computeralgebrasystemen (CAS). Im Laufe des digitalen Zeitalters
wurde LATEX zum Quasistandard für das Schreiben mathematischer Formeln auf dem Computer,
insbesondere in den Disziplinen Mathematik, Informatik, Naturwissenschaften und Technik
(MINT). Da Wissenschaftler gemeinhin ihre Arbeit publizieren, ist LATEX zu einem integralen
Bestandteil moderner Forschung geworden. Gleichermaßen verlassen sich Wissenschaftler
immer mehr auf die Möglichkeiten moderner CAS, um effektiv mit mathematischen Formeln
zu arbeiten, zum Beispiel, indem sie diese umformen, lösen oder auch visualisieren. Die mo-
mentanen Ansätze, welche eine Übersetzung von LATEX zu CAS erlauben, wie beispielsweise
interne Import-Funktionen einiger CAS, sind jedoch häufig auf einfache arithmetische Aus-
drücke beschränkt und daher nur wenig hilfreich im realen Arbeitsalltag. Infolgedessen ist die
Arbeit moderner Wissenschaftler in den MINT Disziplinen häufig geprägt von zeitraubenden
und fehleranfälligen manuellen Übersetzungen zwischen LATEX und CAS.
Die vorliegende Dissertation leistet die folgenden Beiträge, um das Problem des Übersetzens
von mathematischen Ausdrücken zwischen LATEX und CAS zu lösen.
Zunächst ist LATEX ein Format, welches lediglich die visuelle Präsentation mathematischer Aus-
drücke kodiert, nicht jedoch deren semantische Informationen. Die semantischen Informationen
sind jedoch notwendig für CAS, welche keine mehrdeutigen Eingaben erlauben. Daher führt
die vorliegende Arbeit als ersten Schritt für eine Übersetzung eine sogenannte Semantifizierung
mathematischer Ausdrücke ein. Diese Semantifizierung extrahiert semantische Informationen
aus dem Kontext und den Bestandteilen der Formel, um Rückschlüsse auf ihre Bedeutung zu
ziehen. Da die Semantifizierung eine klassische Aufgabe auf dem Gebiet der mathematischen
Informationsgewinnung darstellt, leistet dieser Teil der Dissertation auch Beiträge zu verwand-
ten Themengebieten. So sind die hier vorgestellten Ansätze auch nützlich für pädagogische
Programme, Frage-Antwort Systeme, Suchmaschinen und die digitale Plagiatserkennung.
Als zweiten Beitrag, stellt die vorliegende Dissertation das erste kontextbezogene LATEX zu
CAS Übersetzungsprogramm vor, genannt LACAST. LACAST nutzt die zuvor eingeführte Seman-
tifizierung, um LATEX in ein Zwischenformat zu transformieren, welches die semantischen
Informationen explizit darstellt. Dieses Format wird semantisches LATEX genannt, da es eine
technische Erweiterung von LATEX ist. Die weitere Übersetzung zu CAS wird durch heuristi-
sche Übersetzungsmuster für mathematische Funktionen realisiert. Diese Übersetzungsmuster
wurden in Zusammenarbeit mit Mathematikern definiert, um eine korrekte Übersetzung in
diesem letzten Schritt zu gewährleisten. Um die Vorzüge einer kontextbezogenen Übersetzung
besser zu verstehen, stellt diese Arbeit zum Vergleich auch eine Maschinenübersetzung auf
neuronalen Netzen vor, welche den Kontext einer Formel nicht berücksichtigt.
Der dritte Beitrag dieser Dissertation führt eine neue Methode zur Evaluierung von mathe-
matischen Übersetzungen ein, welche es erlaubt, auch eine große Anzahl an Übersetzungen
auf ihre Korrektheit hin zu überprüfen. Diese Methode folgt dem Ansatz, dass Gleichungen
xv
in mathematischen Bibliotheken auch nach der Übersetzung in ein CAS noch korrekt sein
müssten. Ist dies nicht der Fall, ist entweder die Ausgangsgleichung, die Übersetzung, oder
das CAS fehlerhaft. Hierbei ist zu beachten, dass jede Fehlerquelle einen Mehrwert für das
jeweilige System darstellt. Zusätzlich zu dieser automatischen Evaluierung, erfolgt noch eine
manuelle Analyse von Übersetzungen auf Basis englischer Wikipedia Artikel.
Zusammenfassend ermöglicht das kontextbezogene Übersetzungsprogramm LACAST eine effizi-
entere Arbeitsweise mit CAS. Mit Hilfe dieser Übersetzungen konnten auch mehrere Probleme,
wie falsche Informationen oder Vorzeichenfehler, in der Digital Library of Mathematical Func-
tions (DLMF) sowie Fehler in den kommerziell vertriebenen CAS Maple und Mathematica
automatisch aufgedeckt und behoben werden.
Die hier vorgestellte Grundlagenforschung zum semantischen Anreichern mathematischer
Ausdrücke, hat zudem etliche Beiträge zu verwandten Forschungsthemen geleistet. Zum Bei-
spiel hat die Analyse der Verteilung von mathematischen Notationen in großen Datensätzen
neue Ansätze in der digitalen Plagiatserkennung ermöglicht. Des Weiteren wird zurzeit daran
gearbeitet, die Übersetzungen von LACAST in kommende Versionen von Wikipedia und der DLMF
zu integrieren.
xvi Zusammenfassung
FRONT MATTER
Acknowledgements
This thesis would not have been possible without the tremendous help and support from nu-
merous family members, friends, colleagues, supervisors, and several international institutions.
In the following, I want to take the opportunity to thank all the individuals and organizations
that helped me along the way to make this work possible.
My first sincere wishes go to my prodigious doctoral advisers Bela Gipp and Akiko Aizawa.
Their continuous support and counsel enabled me to realize this thesis at marvelous places and
together with numerous wonderful people from all over the world. Their enduring encourage-
ment and assistance, Bela’s abiding and infectious positivity, and Akiko’s steadfast and kind
endorsement empowered my personal and professional life. Both of their competent and sincere
guidance helped me to find my way in the intricate maze of research and career decisions and
turned my often onerous time into a joyful and memorable experience.
Moreover, I am very grateful to my adviser and friend Moritz Schubotz, who supported and
guided me throughout the entire time of my doctoral thesis and even beyond. Our fruitful and
always engaging discussions, even when exhausting, enriched and positively affected most, if
not all, of my work. It is not an exaggeration to admit that my career, including my Master’s
thesis and this doctoral thesis, would not have been possible and nearly as successful and joyful
as it has been without his continuous and sincere support over the years. I am wholeheartedly
thankful for all the years we worked together.
I further wish to gratefully acknowledge my friends, colleagues, and advisers Howard Cohl,
Abdou Youssef, and Bruce Miller at the National Institute of Standards and Technology (NIST) for
their valuable advice, continuous drive to perfection, and our rewarding collaborations. I thank
Jürgen Gerhard at Maplesoft, who kindly provided me access and support for Maple on several
occasions. I am just as thankful for the assistance and support from Norman Meuschke, who
always helped me to overcome governmental and organizational hurdles, Corinna Breitinger,
who never failed to refit my gibberish, and my colleagues and friends Terry Lima Ruas and
Philipp Scharpf for many visionary discussions. I also thank all my collaborators and colleagues
with whom I had the distinct opportunity to work together, including Takuto Asakura, Fabian
Müller, Olaf Teschke, William Grosky, Marjorie McClain, Yusuke Miyao, Malte Ostendorff,
Bonita Saunders, Kenichi Iwatsuki, Takuma Udagawa, Anastasia Zhukova, and Felix Hamborg.
I further want to thank the students I worked with, including Avi Trost, Rajen Dey, Joon Bang,
Kevin Chen, and Felix Petersen. I especially appreciate the help and assistance from people
at the National Institute of Informatics (NII) to overcome governmental and daily life issues. I
wish to especially thank Rie Ayuzawa, Noriko Katsu, Akiko Takenaka, and Goran Topic.
My genuine gratitude also goes to my host organizations and those that provided financial
support for my research. I am thankful for the German Academic Exchange Program (DAAD)
for enabling two research stays at the NII in Tokyo, the NII for providing me a wonderful
work environment, the German Research Foundation (DFG) to financially support many of
my projects, the NIST for hosting me as a guest researcher, and Maplesoft for offering me
xvii
an internship during my preliminary research project on the Digital Library of Mathematical
Functions (DLMF). I finally thank the ACM Special Interest Group on Information Retrieval
(SIGIR), the University of Konstanz, the University of Wuppertal, and Maplesoft for supporting
several conference participations.
My last and most crucial gratitude goes to my family and friends, who always cheered me in
good and bad times and constantly backed and supported me so that I could selfishly pursue my
dreams. I am deeply grateful for my lovely parents Rolf & Regina, who have always been on
my side and make all this possible behind the scenes. I am also tremendously thankful for the
enduring personal support from my dear friends Kevin, Lena, Vici, Dong, Peter, Vitor, Ayuko,
and uncountably more. Finally, I thank my lovely partner Aimi for brightening even the darkest
times and pushing every possible obstacle aside. I dedicate this thesis to my lovely parents, my
dear friends, and my enchanting girlfriend.
xviii Acknowledgements
Other documents randomly have
different content
feeling of a cavern wall will grow upon him, of a cavern deep, below
roaring seas, in which the waves are there, though they do not enter
in upon him; or rather not the waves, but the very bowels of the
ocean. He will feel as though the floods surrounded him, coming and
going with their wild sounds, and he will hardly recognize that
though among them he is not in them. And they, as they fall with a
continual roar, not hurting the ear, but musical withal, will seem to
move as the vast ocean waters may perhaps move in their internal
currents. He will lose the sense of one continued descent, and think
that they are passing round him in their appointed courses. The
broken spray that rises from the depth below, rises so strongly, so
palpably, so rapidly, that the motion in every direction will seem
equal. And, as he looks on, strange colours will show themselves
through the mist; the shades of grey will become green or blue, with
ever and anon a flash of white; and then, when some gust of wind
blows in with greater violence, the sea-girt cavern will become all
dark and black. Oh, my friend, let there be no one there to speak to
thee then; no, not even a brother. As you stand there speak only to
the waters.
North America (London, 1862).
NIAGARA FALLS
(NORTH AMERICA)
CHARLES DICKENS
We called at the town of Erie, at eight o’clock that night, and lay
there an hour. Between five and six next morning, we arrived at
Buffalo, where we breakfasted; and being too near the Great Falls to
wait patiently anywhere else, we set off by the train, the same
morning at nine o’clock, to Niagara.
It was a miserable day; chilly and raw; a damp mist falling; and the
trees in that northern region quite bare and wintry. Whenever the
train halted, I listened for the roar; and was constantly straining my
eyes in the direction where I knew the Falls must be, from seeing
the river rolling on towards them; every moment expecting to behold
the spray. Within a few moments of our stopping, not before, I saw
two great white clouds rising up slowly and majestically from the
depths of the earth. That was all. At length we alighted; and then
for the first time, I heard the mighty rush of water, and felt the
ground tremble underneath my feet.
The bank is very steep, and was slippery with rain, and half-melted
ice. I hardly know how I got down, but I was soon at the bottom,
and climbing, with two English officers who were crossing and had
joined me, over some broken rocks, deafened by the noise, half-
blinded by the spray, and wet to the skin. We were at the foot of the
American Fall. I could see an immense torrent of water tearing
headlong down from some great height, but had no idea of shape,
or situation, or anything but vague immensity.
When we were seated in the little ferry-boat, and were crossing the
swollen river immediately before both cataracts, I began to feel what
it was—but I was in a manner stunned, and unable to comprehend
the vastness of the scene. It was not until I came on Table Rock,
and looked—Great Heaven, on what a fall of bright green water!—
that it came upon me in its full might and majesty.
Then, when I felt how near to my Creator I was standing, the first
effect, and the enduring one—instant and lasting—of the
tremendous spectacle, was Peace. Peace of Mind, tranquillity, calm
recollections of the Dead, great thoughts of Eternal Rest and
Happiness: nothing of gloom or terror. Niagara was at once stamped
upon my heart, an Image of Beauty; to remain there, changeless
and indelible, until its pulses cease to beat, for ever.
Oh, how the strife and trouble of daily life receded from my view,
and lessened in the distance, during the ten memorable days we
passed on that Enchanted Ground! What voices spoke from out the
thundering water; what faces, faded from the earth, looked out
upon me from its gleaming depths; what Heavenly promise glistened
in those angels’ tears, the drops of many hues, that showered
around, and twined themselves about the gorgeous arches which
the changing rainbows made!
I never stirred in all that time from the Canadian side, whither I had
gone at first. I never crossed the river again; for I knew there were
people on the other shore, and in such a place it is natural to shun
strange company. To wander to and fro all day, and see the cataracts
from all points of view; to stand upon the edge of the great Horse-
Shoe Fall, marking the hurried water gathering strength as it
approached the verge, yet seeming too, to pause before it shot into
the gulf below; to gaze from the river’s level up at the torrent as it
came streaming down; to climb the neighbouring heights and watch
it through the trees, and see the wreathing water in the rapids
hurrying on to take its fearful plunge; to linger in the shadow of the
solemn rocks three miles below; watching the river as, stirred by no
visible cause, it heaved and eddied and awoke the echoes, being
troubled yet, far down beneath its surface, by its giant leap; to have
Niagara before me, lighted by the sun and by the moon, red in the
day’s decline, and grey as evening slowly fell upon it; to look upon it
every day, and wake up in the night and hear its ceaseless voice:
this was enough.
FUJI SAN.
It is a circuit of 120 miles to go all round the base of Fuji-San. If you
could cut a tunnel through her from Yoshiwara to Kawaguchi, it
would be forty miles long. Generally speaking, the lower portion of
the mountain is cultivated to a height of 1,500 feet, and it is a whole
province which thus climbs round her. From the border of the farms
there begins a rough and wild, but flowery moorland, which
stretches round the hill to an elevation of 4,000 feet, where there
the thick forest-belt commences. This girdles the volcano up to 7,000
feet on the Subashiri side and 8,000 on the Murayama fall, but is
lower to the eastward. Above the forest extends a narrow zone of
thicket and bush, chiefly dwarfed larch, juniper and a vaccinium;
after which comes the bare, burnt, and terribly majestic peak itself,
where the only living thing is a little yellow lichen which grows in the
fissures of the lava blocks, for no eagle or hawk ventures so high,
and the boldest or most bewildered butterfly will not be seen above
the bushes half-way down.
The best—indeed, the only—time for the ascent of the mountain is
between July 15th and September 5th. During this brief season the
snow will be melted from the cone, the huts upon the path will be
opened for pilgrims, and there will be only the danger of getting
caught by a typhoon, or reaching the summit to find it swathed day
after day in clouds, and no view obtainable. Our party of three
started for the ascent on August 25th, taking that one of the many
roads by which Fuji is approached that goes by Subashiri. Such an
expedition may be divided into a series of stages. You have first to
approach the foot of the mountain by train or otherwise, then to ride
through the long slope of cultivated region. Then, abandoning
horses or vehicles, to traverse on foot the sharper slopes of the
forest belt. At the confines of this you will reach the first station,
called Sho or Go; for Japanese fancy has likened the mountain to a
heap of dry rice and the stations are named by rice-measure. From
the first station to the ninth, whatever road you take, all will be
hard, hot, continuous climbing. You must go by narrow, bad paths,
such as a goat might make, in loose volcanic dust, gritty pumice, or
over the sharp edges of lava dykes, which cut boots and sandals to
shreds....
At daybreak the horses are brought, and the six coolies, two by two,
bind upon their backs the futons and the food. We start, a long
procession, through a broad avenue in the forest, riding for five
miles, under a lovely dawn, the sun shining gloriously on the
forehead of Fuji, who seems further off and more immensely lofty
the nearer we approach. The woodland is full of wild strawberries
and flowers; including tiger-lilies, clematis, Canterbury bells, and the
blue hotari-no hana, or fire-fly blossom. At 6:30 a. m., we reach
Uma-Gayeshi, or “turn-the-horses-back”; and hence to the mountain
top there is nothing for it but to walk every step of the long, steep,
and difficult path. Two of the men with the lightest loads led the way
along the narrow path, in a wood so thick that we shall not see Fuji
again till we have passed through it. It takes us every now and then
through the gates and precincts of little Shinto temples, where the
priests offer us tea or mountain water. In one of them, at Ko-mitake,
we are invited to ring the brass gong in order that the Deity may
make our limbs strong for the task before us. And this is solemnly
done by all hands, the ninsoku slapping their brown thighs piously
after sounding the bell....
The shortest time in which the ascent has been made is six hours
and a half. We, taking it more easily, made no attempt to beat the
record, and stopped frequently to botanize, geologize, etc. The
rarefaction of the air gave our Japanese companion, Takaji San, a
slight headache, which soon passed as the circulation became
accustomed to the atmosphere; but Captain Ingles and I, being I
suppose, both in excellent health and strength, experienced no
inconvenience worth mentioning.
At half-past four next morning, while I was dreaming under my thick
coverings, a hand touched me and a voice said softly, “Danna Sama,
hi no de!” “Master, here is the sun!” The shoji at my feet were
thrown open. I looked out, almost as you might from the moon, over
a prodigious abyss of space, beyond which the eastern rim of all the
world seemed to be on fire with flaming light. A belt of splendid rose
and gold illumined all the horizon, darting long spears of glory into
the dark sky overhead, gilding the tops of a thousand hills, scattered
over the purple plains below, and casting on the unbroken
background of clouds beyond an enormous shadow of Fuji. The
spectacle was of unparalleled splendour, recalling Lord Tennyson’s
line—
We made two long stays at the Glacier House, and I never enjoyed
anything more in my life than the effect of the snug little châlet, with
its velvety lawn, in the stronghold of the giant mountains, brought
into touch with the great world twice a day by the trains east and
west, which echoed their approach and departure miles on miles
through the ranges.
On the Cars and Off (London, 1895).
MAUNA LOA
(HAWAII)
LADY BRASSEY
At 6:30 a. m., we made the island of Hawaii, rather too much to
leeward, as we had been carried by the strong current at least
eighteen miles out of our course. We were therefore obliged to beat
up to windward, in the course of which operation we passed a large
bark running before the wind—the first ship we had seen since
leaving Tahiti—and also a fine whale, blowing close to us. We could
not see the high land in the centre of the island, owing to the mist in
which it was enveloped, and there was great excitement and much
speculation on board as to the principal points which were visible. At
noon the observations taken proved that Tom was right in his
opinion as to our exact position. The wind dropped as we
approached the coast, where we could see the heavy surf dashing
against the black lava cliffs, rushing up the little creeks, and
throwing its spray in huge fountain-like jets high above the tall
cocoanut-trees far inland.
We sailed along close to the shore, and by two o’clock were near the
entrance to the Bay of Hilo. In answer to our signal for a pilot, a
boat came off with a man who said he knew the entrance to the
harbour, but informed us that the proper pilot had gone to Honolulu
on a pleasure trip.
It was a clear afternoon. The mountains, Mauna Kea and Mauna
Loa, could be plainly seen from top to bottom, their giant crests
rising nearly 14,000 feet above our heads, their tree and fern clad
slopes seamed with deep gulches or ravines, down each of which a
fertilizing river ran into the sea. Inside the reef, the white coral
shore, on which the waves seemed too lazy to break, is fringed with