!

versão em português                                                           Coordination: Prof. Dr. Mário Eduardo Viaro



GMHP Corpus


LISTAS DE PALAVRAS
Latim Inglês Galego Mirandês


OBRAS EM LATIM

versão da Vulgata de São Jerônimo
Antigo Testamento
Novo Testamento
Glossário Lista de palavras


OBRAS DA LITERATURA DE LÍNGUA PORTUGUESA POR SÉCULOS
Medievais XVI XVII XVIII XIX XX


IMPORTANT INFORMATION
a) This material was organized with the aim of helping, especially researches that have been done by members of the GHMP;

b) The GHMP, in turn, uses specific programs for handling data in order to work on the material on hand, in such a way that the text files were standardized according to the specifications of such programs, showing certain markings that did not belong to the original work, but will be used as patterns for such programs;

c) The markings are small headlines inserted throughout some works that show the following format:
\num [xxx]
\txt
in which \num indicates the beginning of the headline, [xxx] indicates any type of division in the work (chapter, scene, etc.), and \txt indicates the ending of the line of the headline and beginning of text;

d) For the reason already stated above, the files were stored in simple, compacted text format, making it unnecessary options of formatting using italics and bold characters, which does not interfere with the type of analysis which this material is aimed at, making it inappropriate, therefore, for publishing purposes;

e) The material that is available was prepared to be used strictly by electronic means and for academic use, not having any commercial purposes and also not being adequate for any type of printing;

f) The works were distributed by centuries, given the fact that the studies carried out by GHMP are focused on the historical-evolutionary feature of language. However, it should be made clear that the organization of files by centuries provides only a panoramic view of the works over time, since it is not possible to determine with certainty the date of production of all the texts;

g) Still on the subject of chronological organization of the works, it is important to bear in mind that there is a great number of authors whose literary production comprises different centuries. In such cases, each author’s periods of greater production were considered and the literary schools to which those authors are traditionally linked in order to situate them at a certain period. It is the responsibility of each member, therefore, to make sure that the given text – in case such a text eventually reveals itself relevant for his/her research – is squared into that period;

h) Besides the distribution by centuries, a division of the works into five major literary genres was done in the following way: (1) novel-novella, (2) short story-chronicle, (3), theater, (4) poetry and (5) prose (others). This division was done only with the aim at an attempt to group together texts with similar styles, with no intention to settle issues of literary nature as, for instance, the issue of the difference between a short story and a novella and other similar cases. In doubtful cases, some criteria were followed, as the recurring classification related to the work (mentioned in textbooks and other reliable sources), as well as its extension and general structure;

i) It should still be stressed that most of the content of this corpus was drawn from material that had been previously typed and available through other media and/or web sites, in such a way that the members of the GHMP should not be answerable for occasional mistakes found in these files, although many problems have already been detected and corrected;

j) The material shown on this page is constantly updated and all data are subject to corrections and changes. It is asked, then, that any mistake found, be it in the date and genre classification, be it in the files, that it is informed the GHMP for an enhancement of the corpus;

k) k) We suggest that the AntConc 3.2.2 software be used for the text processing.

UNIVERSIDADE DE SÃO PAULO

Faculdade de Filosofia, Letras e Ciências Humanas

Grupo de Morfologia Histórica do Português

gmhp@usp.br