The International Math Union's Electronic Initiatives
Jonathan Michael Borwein, FRSC
Shrum Professor and Canada Research Chair
Director, Centre for Experimental and
Simon Fraser University, Burnaby, V5A 1S6, BC
Great changes are underway in the way we compute and communicate. This extended and annotated abstract discusses a few of the changes that impinge especially on the mathematical sciences, concentrating on the initiatives of the IMU. Other threads are explored in Terry Stanway's contribution "From G.H.H. and Maple to Littlewood and XML: Changing Needs and Expectations in Mathematical Knowledge Management." This is a complicated world, for example, the CMS adheres to the IMU within the International Council of Scientific Unions (ICSU, www.icsu.org), via the Canadian National Research Council (NRC)!
As evidence of the changes, I refer to two recent issues of Computers in Science and Engineering, a journal that is jointly published by the ACM and the IEEE. The first, in February 2000, was on the Top Ten Algorithms having "the greatest influence on the development and practice of science and engineering in the 20th century", see also the Science News coverage at (www.sciencenews.org/20000212/mathtrek.asp) entitled `Algorithms for the Ages'.
A follow up issue in May 2001 was on Tomorrows Hardest Problems, (http://computer.org/cise/cs2001/c3toc.htm). It includes my article with P. Borwein, discussing "Challenges in Mathematical Computing" for which a true preprint -- the editor of the article actually edited -- is available at www.cecm.sfu.ca/preprints/2001pp.html#01:160. Therein, we make the claim that
"Almost all interesting mathematical algorithmic questions relate to NP-hard questions and such computation is prone to explode exponentially. More space, more speed and processors, and even say massive parallelism will have an impact but it will be largely at a `micro not macro' level. We anticipate the greatest benefit accruing from mathematical platforms that allow for highly computer assisted insight generation (more `aha's' per cycle), not from solution of grand challenge problems."
While I shall talk primarily from the perspective of a researcher, it is important to remember how many `players' there are and how small a part of the scheme mathematical science publishing and computing really is. Though we, like Tom Lehrer, know math is everywhere, in reality even the bioscience disciplines are likely to tread over us. For example, Nobel Laureate Richard J.Roberts recently wrote in "PubMed Central: The GenBank of the published literature", Proc. Natl. Acad. Sci. USA, Vol. 98, Issue 2, 381--382, January 16, 2001 (http://www.pnas.org/cgi/content/full/041601398v1)
"What is a reasonable delay? I would argue that 6 months seems a reasonable time for a journal to monopolize the content. Most of us would not dream of scanning the contents of a journal published 6 months ago unless we were searching for a specific article. Thus it seems unlikely that a large number of subscriptions would be lost if 6-month-old issues were made freely available. I think rather few worthwhile journals would be adversely affected if they were to institute such a policy."
Such an outcome would destroy most current mathematics society publishing. Despite that such issues are not high on mathematicians agendas. For example, only a handful out of over ten thousand mathematicians have chosen to grant the AMS permission to publish rather than transfer copyright, since they were offered the choice more than five years ago. Nor do they often distinguish public from commercial initiatives such as www.mathpreprints.com (which actively hides its Elsvier roots).
Neutrality (From Nature, Sep 6 , 2001)
"The OAI and CrossRef strategies therefore differ fundamentally from more centralized systems proposed by PubMed Central (PMC) (see Jo McEntyre & David Lipman, GenBank - a model community resource?), operated by the US National Library of Medicine, and E-Biosci, being developed by the European Molecular Biology Organization (see Frank Gannon, Boycott! & Les Grivell, E-Biosci: a European approach to handling biological information).
The question of whether to merge datasets or content in a single centralized repository - 'datawarehousing' - or in a federation of distributed resources is a common one facing scientific databases and international research collaborations. The former works well for large amounts of data that are in standard formats, and originate from a handful of large centres. This is the case for gene sequence data, which explains the success of Genbank. In many other areas, such as s ructural and functional genomics, data are more heterogeneous, and their source s more dispersed, and these communities tend to put the emphasis on federated databases curated by those closest to the data.
But PMC and E-Biosci highlight the urgent need to index the full text of papers and their metadata and not just abstracts, as is the practice of PubMed and other aggregators. Services that require publishers to deposit full text only for indexing and improving search are useful, and they can simplify some of the technical issues associated with federating distributed resources (see Matt Cocker ill, Distributed and centralized technologies: complementary tools to build a permanent digital archive). Unfortunately, PMC, unlike E-Biosci, confounds this primarily technological issue with an economic one, by requiring that all text be made available free after, at most, one year. It is regrettable that PMC has not in the first instance sought full-text indexing itself as a goal, as this in itself would be an immediate boon to researchers. It would also probably have been more successful in attracting publishers.
The reality is that all of those involved in scientific publishing are in a period of intense experimentation, the outcome of which is difficult to predict (see Tim O'Reilly, Information wants to be valuable). Getting there will require novel forms of collaboration between publishers, databases, digital libraries and other stakeholders. It would be unwise to put all of one's eggs in the basket of any one economic or technological 'solution'. Diversity is the best bet"
To set the stage:
``The work then proceeds in a manner unique to science. Because practitioners publish their work electronically, through the e-print archives at the Los Alamos National Laboratory in New Mexico, the entire community can read a paper hours after its authors finish typing the last footnote. As a result, no one theorist or even a collaboration does definitive work. Instead, the field progresses like a jazz performance: A few theorists develop a theme, which others quickly take up and elaborate. By the time it's fully developed, a few dozen physicists, working anywhere from Princeton to Bombay to the beaches of Santa Barbara, may have played important parts.''
Gary Taubes, ``String Theorists Find a Rosetta Stone," Science July 23 (1999) 513.
The International Mathematical Union's (www.mathunion.org/) Committee on Electronic Information and Communication, CEIC, (www.ceic.math.ca/) was established in 1998 and given a very broad mandate (see Appendix A) to advise the IMU on electronic matters.
The Committee met face-to-face in Berlin (November 13-14, 1998), at Berkeley (December 1-5, 1999), in Vienna (October 5-7, 2000), and in Princeton (May 12-14, 2001) at which time it reported directly to the Executive of the IMU. It meets twice in 2002 -- in Vancouver (February 15-17, 2002) and at the next ICM in Beijing (August 22-27, 2002), where various activities are planned.
Each meeting has comprised a working meeting along with a workshop or public presentations. The most ambitious workshop, which had over a hundred participants, was that in Berkeley on The Future of Mathematical Communication, 1999: www.msri.org/calendar/workshops/9900/Future_of_Math_Communications/. The presentations can be viewed on line in streaming video through the URL above.
As a member of the CEIC since its inception, and more recently deputy chair, I am fairly well placed to describe its present activities and future plans. Mathematics is perhaps the most international and transportable of the sciences, and hence has a great deal to gain from global mathematical knowledge management initiatives such as the current MKM meeting.
In particular, I draw attention to
I also note the growing attempts to collect mathematical data in ``global data-bases" such as arXiv: http://front.math.ucdavis.edu/. The arXiv has grown out of the Los Alamos preprint server which is now moving to Cornell (www.nature.com/nature/debates/e-access/Articles/ginsparg.html) and so should mesh better with the Digital Library Initiative, Euclid and like projects. I especially mention CEIC's own involvement with:
These are described in detail in Swanzl and Sperber's contribution to this meeting: "MathNet: Semantics, Visualization and Internalization."
I conclude by discussing our hopes to further develop tools for coordinating these efforts --and the issues and obstacles involved. For example, in our mandate we were asked to "consider transferring the World Directory of Mathematicians to an electronic freely accessible form". For logistic and IP reasons, we have recommended not now doing so.
Whatever the obstacles, the rewards can be amazing:
``Three years ago, said Dr. Zanelli in Chile, one of his own students posted a paper and the next day received an e-mail message from Dr. Witten. The student was at first so shocked that he accused Dr. Zanelli of playing an elaborate practical joke.
We learned that great physicists do read the archives daily and they browse through all the preprints,' Dr. Zanelli said, `even if they come from an obscure place in the end of the world.'
Dr. Witten's instant response, Dr. Zanelli said, `was like having the pope drop by for tea.' ''
James Glanz, ``Web Archive Opens a New Realm of Research,'' New York Times, April 30, 2001.
Building on the enabling resolution passed by the General Assembly in Dresden on August 16, 1998, the Executive Committee of the International Mathematical Union establishes a
Committee on Electronic Information and Communication (CEIC) of the International Mathematical Union (IMU)
Committee on Electronic Information and Communication (CEIC) of the International Mathematical Union (IMU)
Terms of Reference.
a) The CEIC shall be a
- standing committee of the Executive Committee (EC) of the IMU,
- to be reviewed every four years by the EC at its meeting preceding that of the GA.
- Members will be appointed for four year terms by procedures identical to those for its Commissions. The Executive Committee will appoint one of its members to serve on the CEIC.
b) The CEIC may
- meet as necessary in each four year period,
- review the development of Electronic Information and Communication as it impacts the international mathematical community and
- submit a report to EC.
c) The CEIC may
- organize or sponsor international meetings or forums to bring together representatives of all interested parties, including
+ libraries, and
- publish and otherwise disseminate
+ reviews of recent developments, and
+ technical surveys for the use of the mathematical community.
d) The CEIC may
- recommend international standards on issues related to electronic communication. Such recommendations should be reviewed by the EC and, if approved, may be published and promoted in the name of the IMU.
e) During its first 4 year term, the CEIC is specifically asked to address
- the coordination of world-wide efforts to establish web-based servers for
+ mathematical papers,
+ journals and
- This includes issues of uniformizing
+ document identifiers and
+ supported formats,
+ mirroring and
+ the development of search engines for mathematical material and
+ coordination of existing servers.
- It should publish its findings with the goal of making
+ the use of these servers universally understood and
+ usable by the whole mathematical community.
- It is also asked to consider transferring the World Directory of Mathematicians to an electronic freely accessible form.
f) Membership (in 2000):
Endorsed by the Executive Committee of the IMU in its 68th's session in Princeton, NJ, May 14-15, 2001
The number of mathematical papers that are stored or circulated as electronic files is increasing steadily. It is important that copyright agreements should keep in step with this development, and not inhibit mathematical authors or their publishers from making best use of the electronic medium together with more traditional media. While most mathematicians have no desire to learn the subtleties of copyright law, there are some general principles that they should keep in mind when discussing copyright for research papers with their publishers.
One of the main purposes of your copyright agreement is to control how your publisher or you make the paper available to this third group. Publishers will hardly allow individual authors to dictate agreements with libraries. But if you know that a certain journal publisher makes life hard for libraries, you can take this into account when choosing where to submit your paper.
The complete copyright checklist was written by Wilfrid Hodges. It was approved and is recommended by the Committee on Electronic Information and Communication of the International Mathematical Union (IMU). A final version will be posted in the near future (http://www.maths.qmw.ac.uk/~wilfrid/copyright.html).
Endorsed by the IMU Executive Committee on May 15, 2001 in its 68th's session in Princeton, NJ.
Open access to the mathematical literature is an important goal. Each of us can contribute to that goal by making available electronically as much of our own work as feasible.
Our recent work is likely already in computer readable form and should be made available variously in TeX source, dvi, pdf (Adobe Acrobat), or PostScript form. Publications from the pre-TeX era can be scanned and/or digitally photographed. Retyping in TeX is not as unthinkable as first appears.
Our action will have greatly enlarged the reservoir of freely available primary mathematical material, particularly helping scientists working without adequate library access.
I shall finish by briefly touching on some of the following where my interests overlap:
For example, we aim to build low-end desk-top GridStations and 3 to 6 person GridRooms which will facilitate use of the WestGrid. Each of the four main Universities (UBC, SFU, Calgary), Alberta) has at least one group engaged in collaborative or visualization research. Thus, under the aegis of my recently awarded Canada Research Chair, we are presently establishing the SFU CoLaboratory modeled on the Stanford iRoom (www.cecm.sfu.ca/personal/jborwein/colab.html) to explore mathematical collaboration within heterogeneous and ubiquitous computing environments.
CISTI like many organizations has a somewhat over-constrained mandate: (i) to serve Canadians (its Journals are free on line to .ca suffixes thanks to the Depository Services Program (DSP)of Public Works and Government Services Canada), (ii) to recover costs, but (iii) not to compete with the private sector.
The Canadian Association of Research Libraries (CARL) (www.createchange.org) has recently made a very good and strong statement about open publishing. This link also leads to the more general Public Library of Science movement spear-headed by ex NIH director Harold Varmus and others.