The OpenArXiv Project Overview
The arXiv (http://arxiv.org) is one of the popular scientific digital libraries, and has been the major forum for dissemination of scientific results in disciplines such as Physics, Mathematics, Nonlinear Sciences, Computer Science, and Quantitative Biology. It currently contains about 300,000 scientific publications in various formats (e.g., ps, pdf, doc, tex).
The OpenArXiv project aims
to significantly improve this arXiv digital library in two ways: (1) By
exploiting the state-of-the-art database techniques available in Microsoft SQL
Server, we will build a large-scale scientific digital library solely using an
RDBMS; and (2) By utilizing the standard XML-based Web Services paradigm and
Microsoft .NET framework, we will build a programmable interface to arXiv so
that not only human users but also software agents can freely access the
contents of arXiv in many applications.