HTML tables have become pervasive on the Web. Extracting their data automatically is difficult because finding relationships between cells not trivial due to many different layouts, encodings, and formats available. In this article, we introduce Melva, which an unsupervised domain-agnostic proposal extract from without requiring any external knowledge bases. It relies a clustering approach that...