José Calvo Tello's Website – Digital Humanities and Spanish Philology

Clásicos Hispánicos

The most important project besides my activity at the University has taken place in the independent collection of eBooks Clásicos Hispánicos. This collection publishes Spanish classics in ePUB and mobi format with texts prepared by specialists and reviewed by a second specialist. We have developed our

Quijote I and II in CH

Some published texts

Coplas de Manrique

Bécquer in the XML-TEI of CH

Some contributors of CH

teiHeader of the Quijote

Corpus of the Spanish Novel from 1880-1940

As part of my work at the CLiGS research group, I have already published a small corpus of Spanish novels in XML-TEI called Corpus of Spanish Novels from 1880-1940. We have published in our GitHub repository different versions: XML-TEI, plain text, linguistic annotated XML, and PDF.

This corpus is only a teaser of the real corpus I am currently working on, which will be published at the end of the project.

Corpus of Spanish Novels used for Stylometry

Toolbox

As part of my work at the CLiGS I have also contributed to our repository of scripts in Python. My main contributions are related to the conversion from HTML to XML-TEI, the treatment and extraction of metadata and the work with stylometric matrixes.

Extraction of places from La Regente using regex

Extraction of places from La Regenta using regex

XML-TEI-Bible

I am currently editing chapter by chapter the Bible in Spanish, marking with identifiers people, places, groups, and direct speech (with the specification of who is talking to whom). After editing, I am also extract the information and visualise it as graphs.
Everything about this project is published on the GitHub repository.

Graph based on the Genesis of the Bible

Projects, corpora, and data

Clásicos Hispánicos

Corpus of the Spanish Novel from 1880-1940

Toolbox

XML-TEI-Bible

Stylometry on Political Text

Casa de Citas

Der-die-das

Web

Lately in the blog