Issue351

Title RDFa extraction only works for valid XML documents.
Priority bug Status chatting
Project DiscoverEd Milestone
Superseder Nosy List mlinksva, nathan
Assigned To Keywords

Created on 2009-05-04.17:13:18 by nathan, last changed 2009-05-27.22:55:37 by nathan.

Files
File name Uploaded Type Edit Remove
RDFa2RDf.xsl nathan, 2009-05-27.22:55:37 application/xml
RDFaExtractor.java nathan, 2009-05-27.22:55:13 text/x-java
Messages
msg1122 (view) Author: nathan Date: 2009-05-27.22:55:37
Attaching XSLT based approach for reference.
msg1052 (view) Author: nathan Date: 2009-05-04.17:13:17
There doesn't seem to be an RDFa parser for Java that handles HTML4/5;
possibilities:

* Jython + pyRDFa [rdflib seems to need special attention to make it skip
building extension modules]
* pass documents through jTidy [http://jtidy.sf.net] first
History
Date User Action Args
2009-05-27 22:55:37nathansetfiles: + RDFa2RDf.xsl
status: unread -> chatting
messages: + msg1122
2009-05-27 22:55:14nathansetfiles: + RDFaExtractor.java
2009-05-04 17:13:18nathancreate