The Open Text Mining Interface (OTMI) is a proposed method for making available the text of journal articles for indexing and analysis, while preserving any subscription model that funds the journals. This approach, presented in a Web 2.0 session at the Bio-IT World conference earlier this week, uses an Atom XML version of each article, with OTMI namespaced extensions, to provide all the sentences of the article in alphabetical order. Some extra information such as word frequency is also presented, but this could presumably be derived from the sentence text anyway.
All the articles in the 2020 Computing issue of Nature have OTMI files linked using <link rel="OTMI" type="application/atom+xml" href=""/> - here's an example file.
Comments
All fields are optional, email address will not be shown; no HTML, URLs are automatically hyperlinked.

In its present form, OTMI is flawed: it makes assumptions about the nature of the text mining that will be applied. By listing the sentences out-of-order (in alphabetical order and not in article order) and not including paragraph and other document structure, techniques which take advantage of the information clustering and flow that the structure of the article provides - which represent newer and more effective analysis techniques - cannot be applied. Even fairly traditional things like proximity search will not work using an OTMI source if the two words of interest are not in the same sentence.
The stopwords and term frequency are completely redundent and the latter suggests a vector-space model view of text mining.
I do realize that publishers would be more reluctant to release this information if the sentences were in article order, but feel that OTMI as it stands is too limiting for the real world.
I also understand that this is a proposal, open to input for changes/improvements.
I must confess that I have not been able to find a primary source of information on OTMI, only that of blogs.
I've posted some more details of OTMI here:
http://blogs.nature.com/wp/nascent/2006/04/open_text_mining_interface_1.html