Even though I’ve been a member of Github since 2010, I never had any repos of my own up. But now, I have my very own project! TEI to PDF is the code I wrote to turn the 2013 Digital Humanities abstracts into a nice looking book/PDF. The code is really just a bunch of XSLT scripts (and a bash script supplied by Keith Nickum – thanks Keith!) that turn a pile of TEI into a TEI corpus file (useful for all your text analysis needs), and then turns that corpus file into a .fo file. Finally, you just run that .fo through the Apache FOP (Formatting Objects Processor) to get your final PDF.
Right now, the code is really only for Digital Humanities conference abstract files–it is hard coded to divide the book into Workshops/Panels/Papers/Posters. Keith and I would like to further develop the script so that you can give it any TEI, set a few config options (what sections do you want, where in the TEI will those sections come from, how to order, etc.) and it will generate a nice looking PDF of your content. In the age of digital everything it seems silly to work on such a script, but I think it would be really nice to be able to take a PDF to one of those book printing machines and get a hard copy. If nothing else, library created TEI websites might want a “critical edition” to put in the stacks. The TEI website has a list of other ways to accomplish the TEI to book process, but all seem to have drawbacks (or are really expensive).
If you’d like to contribute, you can just push changes to the Github project, but if you’d like to be added as a contributor or have any other ideas, feel free to contact me through Github or here. Github has a handy wiki and issue tracker too, so you could add ideas/issues there.