Difference between revisions of "Help:AKC Transcription guidelines"
Line 212: | Line 212: | ||
<p>Servo indegno in <abbr>Chr.o</abbr></p> | <p>Servo indegno in <abbr>Chr.o</abbr></p> | ||
<p>Gaspare Schott</p> | <p>Gaspare Schott</p> | ||
− | || Molto Reverendo Padre in | + | || Molto Reverendo Padre in Christo<br /> |
Pax Christi<br /> | Pax Christi<br /> | ||
− | Quasi | + | Quasi nello stesso tempo ho avuto il desiderato avviso dal Nostro Padre, <br /> |
e da Vostra Reverenza, e dal nostro Padre Provinciale, e Padre Rettore di questo <br /> | e da Vostra Reverenza, e dal nostro Padre Provinciale, e Padre Rettore di questo <br /> | ||
Collegio, che alla rinfrescata vada a Roma, per essere Compagno <br /> | Collegio, che alla rinfrescata vada a Roma, per essere Compagno <br /> | ||
di Vostra Reverenza; il che farò con gusto mio indicibile, per potere servire <br /> | di Vostra Reverenza; il che farò con gusto mio indicibile, per potere servire <br /> | ||
− | + | a un tanto grande successo, e imparare da lui. L'inclusa <br /> | |
ho consegnato al Signore D. Carlo Ventimiglia, il quale l'ha ricevuto <br /> | ho consegnato al Signore D. Carlo Ventimiglia, il quale l'ha ricevuto <br /> | ||
con sentimento di allegrezza particolare, e me l'ha mostrato, <br /> | con sentimento di allegrezza particolare, e me l'ha mostrato, <br /> | ||
e letto. La sua andata a Roma è stata impedita da un <br /> | e letto. La sua andata a Roma è stata impedita da un <br /> | ||
− | tristissimo avvenimento, ed è il seguente. | + | tristissimo avvenimento, ed è il seguente. Al 4 di Agosto stava <br /> |
egli in una seggetta avanti della sua casa, per vedere <br /> | egli in una seggetta avanti della sua casa, per vedere <br /> | ||
− | gioco di | + | gioco di fuoco che per ordine suo si doveva fare avanti <br /> |
la Casa in onore della Beata Vergine Maria ad Nives; stava in mezzo <br /> | la Casa in onore della Beata Vergine Maria ad Nives; stava in mezzo <br /> | ||
di molta gente un cocchio di due mule, le quali cominciandosi <br /> | di molta gente un cocchio di due mule, le quali cominciandosi <br /> | ||
Line 230: | Line 230: | ||
sopra la seggetta del Signore Carlo, e gettandolo in terra, gli <br /> | sopra la seggetta del Signore Carlo, e gettandolo in terra, gli <br /> | ||
spezzarono con una ruota del cocchio l'osso della gamba in tre <br /> | spezzarono con una ruota del cocchio l'osso della gamba in tre <br /> | ||
− | pezzi; onde | + | pezzi; onde sta egli a letto, e starà per molti giorni, con <br /> |
grandissimi dolori, ma, come si spera, senza pericolo della vita. <br /> | grandissimi dolori, ma, come si spera, senza pericolo della vita. <br /> | ||
Io quasi ogni giorno vado a visitarlo. Ha pensiero, ricevuta <br /> | Io quasi ogni giorno vado a visitarlo. Ha pensiero, ricevuta <br /> | ||
Line 238: | Line 238: | ||
di Vostra Reverenza. Ho scritto al Signore Scafiti, e aspetto la risposta. La <br /> | di Vostra Reverenza. Ho scritto al Signore Scafiti, e aspetto la risposta. La <br /> | ||
spesa del mio viaggio sarà moderatissima, e spero di averla <br /> | spesa del mio viaggio sarà moderatissima, e spero di averla <br /> | ||
− | o tutta, o in parte | + | o tutta, o in parte dagli amici. Vostra Reverenza interima preghi il Signore, <br /> |
che prosperi il viaggio. Palermo 18 d'Agosto 1652.<br /> | che prosperi il viaggio. Palermo 18 d'Agosto 1652.<br /> | ||
Revision as of 09:10, 31 March 2017
These guidelines are valid for the transcription of the Athanasius Kircher Correspondence letters only. To read other projects guidelines, please visit their related pages. |
In these guidelines you will find not only the transcription prescriptions, but also descriptions of other tasks that users have to complete in order to guarantee a good functioning of GATE and to ease the work of other users. These simple and quick tasks must be done within the same page where you will transcribe the letters and their main function is to control and maintain updated the whole advancement of the work.
In any case remember that GATE is a collaborative project, so not all the tasks must be completed by the same user.
These transcription guidelines are available in the following languages:
Transcription guidelines
To transcribe the letters we are following several simple rules, chosen to produce standard transcriptions despite the number of people who will work within this project.
First of all we decided to do two kinds of transcription for each letter:
- a diplomatic transcription, to reproduce the original structure and words spelling of the original text: within this transcription a small group of TEI tags will be used;
- a reviewed transcription, where all the abbreviations and some other features of the original text will be regularized to ease the reading, the full text search and the semantic annotation too.
Diplomatic transcription
To do a diplomatic transcription of the letters, it has been implemented the TEI Tags extension, devoloped within the Transcribe Bentham Project by Richard Davis. This extension allows the use of a group of TEI tags to give a minimal logical structure to the text and to highlight some text phenomena.
The diplomatic transcription is performed within the Page namespace.
Following there is a brief guide to use the tags.
To use tags (see the video on the right):
|
- click on 'Edit' within the page you want to transcribe
- select the 'Annotation' toolbar
- select the text to mark
- click on the button you want to use
- save the page.
Please remind: to work correctly all tags, except for <lb/>
and <pb/>
, must contain the text you want to mark.
<head>
Head: "the <head>
element is used to identify a heading prefixed to the start of any textual division, at any level" (TEI P5).[1]
This tag must be used to mark the salutation at the beginning of the letters, including the first line only.[2]
Button to use in the 'Annotation' toolbar:
<p>
Paragraph: the <p>
tag "marks paragraphs in prose" (TEI P5).[3]
This tag is used to mark any text block except for the salutation at the beginning of the letter (see <head>
). Use this tag also to mark single lines of the final salutation.
Button to use in the 'Annotation' toolbar:
<lb/>
Line break: "<lb>
(line break) marks the start of a new (typographic) line in some edition or version of a text" (TEI P5).[4]
Use this tag to mark each line of the text; you don't have to use it to break the line after <head>
and <p>
. This tag can be indented within a <p>
tag, when a paragraph is composed of several lines.
Button to use in the 'Annotation' toolbar:
<pb/>
Page break: "<pb>
(page break) marks the start of a new page in a paginated document" (TEI P5).[5]
Use this tag at the end of the transcription of each page of a multiple pages letter; do not use it when the back page includes only the address.
Button to use in the 'Annotation' toolbar:
<abbr>
Abbreviation: "<abbr>
(abbreviation) contains an abbreviation of any sort." (TEI P5).[6]
Use this tag to mark abbreviated words, e.g. <abbr>R.do</abbr>
, <abbr>Ill.mo</abbr>
. In case of two or more consecutive abbreviated words, mark each of them with a separate tag, e.g. <abbr>V.</abbr> <abbr>R.</abbr>
.
When abbreviations show up as symbols, transcribe only the first letter of the abbreviated word followed by a dot and mark this text using <abbr>
. E.g. when the symbols for per
occurs, mark it as <abbr>p.</abbr>
.
Button to use in the 'Annotation' toolbar:
<add>
Addition: "<add>
(addition) contains letters, words, or phrases inserted in the source text by an author, scribe, or a previous annotator or corrector" (TEI P5).[7]
Use this tag to mark, e.g., additions of words between two lines.
Button to use in the 'Annotation' toolbar:
<del>
Deletion: "<del>
(deletion) contains a letter, word, or passage deleted, marked as deleted, or otherwise indicated as superfluous or spurious in the copy text by an author, scribe, or a previous annotator or corrector" (TEI P5).[8]
Button to use in the 'Annotation' toolbar:
<gap/>
Gap: "<gap>
indicates a point where material has been omitted in a transcription, whether for editorial reasons described in the TEI header, as part of sampling practice, or because the material is illegible, invisible, or inaudible" (TEI P5).[9]
Use this tag when you can't read the text for any of the reasons mentioned above.
Button to use in the 'Annotation' toolbar:
<unclear>
Unclear: "<unclear>
contains a word, phrase, or passage which cannot be transcribed with certainty because it is illegible or inaudible in the source" (TEI P5).[10]
Button to use in the 'Annotation' toolbar:
<sic>
Sic: "<sic>
(Latin for thus or so) contains text reproduced although apparently incorrect or inaccurate" (TEI P5).[11]
Button to use in the 'Annotation' toolbar:
<underline>
Underline: marks underlined text.[12]
Button to use in the 'Annotation' toolbar:
<superscript>
Superscript: marks a superscripted text.[13]
Use this tag to represent the style of abbreviations, e.g. <abbr>R.<hi rend="superscript">do</hi></abbr>
gives R.do
; do not use this tag as substitute of addition (<add>
).
Button to use in the 'Annotation' toolbar:
<note>
Note: "<note>
contains a note or annotation" (TEI P5).[14]
Within manuscript letters this tag shouldn't be necessary, since it is more suitable for printed texts; however sometimes it could be useful, e.g. to mark the text of an asterisk which refers to an addition on the margin of the letter: in such cases you have to transcribe the text of the note within the whole text of the letter and mark all of it with <note>
.
Button to use in the 'Annotation' toolbar:
Other rules
- Diphthongs: you must insert diphthongs using the 'Special characters' toolbar and choosing the 'Latin' set of characters.
- Do not use spaces or start a new lines more than necessary.
- Words on two lines (at end of one line and at the beginning of the following line) must be transcribed as they appear, transcribing the dash to start a new line, followed by the
<lb/>
tag, e.g.:
What you type | What you get |
---|---|
philo-<lb/>
|
philo-
|
- The same rule applies to words on two pages, where the first part of the word must be transcribed in the page where it begins (followed by a dash) and the second part in the following page.
- In some cases the reading of the encoding could be really difficult; to check if everything is fine, use the 'Show preview' button at the bottom of the page (close to the 'Save page' button).
Example of diplomatic transcription
What you type | What you get |
---|---|
<head>Lorem ipsum dolor sit amet,</head> <p>consectetur <abbr>adipiscing</abbr> elit. Maecenas <add>accumsan</add> fermentum purus,<lb/> |
Lorem ipsum dolor sit amet,
consectetur adipiscing elit. Maecenas accumsan fermentum purus,
|
Reviewed transcription
The reviewed transcription is performed within 'Main' namespace. This transcription does not use TEI tags, but just some elements of Wiki markup.
Accent marks
When accent marks are absent in the original text, they must be reported within the reviewed transcription, e.g.:
“virtu” → “virtù”
;“umanita” → “umanità”
- etc.
The same style is valid for other languages too (French, Latin etc.). This choice implies the deletion of accent marks no longer used in modern languages, e.g. "mandarla à me" → "mandarla a me"
.
Abbreviations
All abbreviations must be extended. This rule is valid for honorary titles, salutations and any other kind of abbreviated word. If in doubt, leave abbreviated words as they appear in the original text.
This rule applies to any language.
Spaces
Diacrital marks (including punctuation) are always followed by a space; this rule doesn't applies to apostrophes used in articles or prepositions, e.g.:
“ill.mo” NOT “ill. mo”
;“V. S.” NOT “V.S.”
;“l’altra” NOT “l’ altra”
;“chiesa, casa” NOT “chiesa,casa”
;“dell’altra” NOT “dell’ altra”
.
Other rules
- Any other integrations are allowed ONLY to return a more correct text, e.g. restoring concurrence between articles and words, e.g.
“dei numero” → “del numero”
. - Outdated forms must be returned to the current use, e.g.
“haver” → "aver"
,“humanità” → "umanità"
,“Venetia” → "Venezia"
,“doppo” → "dopo"
,“colegio” → "collegio"
etc.. - Words on two lines (at end of one line and at the beginning of the following line) must be integrally transcribed, without any breaking sign; words on two pages must be integrally transcribed in the page where they begin.
Example of reviewed transcription
Diplomatic transcription | Reviewed transcription |
---|---|
Molto R.do P. in Chr.o
P. C. Quasi nell'istesso tempo ho havuto il desiderato avviso dal Nostro Padre, Di V.a Re.a Servo indegno in Chr.o Gaspare Schott |
Molto Reverendo Padre in Christo Pax Christi Di Vostra Reverenza Servo indegno in Cristo Gaspare Schott |
Please note that it is not necessary that the reviewed transcription respects the length of the lines of the original text: insert a <br />
only when there is a new paragraph and to separate single text blocks.
Management tasks
To maintain work progress up to date, users must change the transcription status when they finish to proofread the texts.
Further information about proofread tools, see Wiki help#Proofread.
Change transcription status
- when you visit a page not yet proofread, chosen in the not proofread letters list, before the text beginning it contains the indication This page has not been proofread;
- this indication informs about the transcription status of that letter; this status is set by users using a toolbar at the bottom of the page in 'Edit' mode;
- when users end their corrections, before saving, they must assign to the page the Proofread status (marked in yellow);
- after setting the new status and saving the page, before the text beginning you can see the new indication This page has been proofread.
Change transcription status categories
When you are in 'Edit' mode within a not proofread page, before the text beginning (or at the end of it), you can read the code [[Category:AKC Not proofread]]
. This code represents a category assigned to that page to group it with similar pages, in this case pages containing letters not yet proofread within the Athanasius Kircher Correspondence; as any other category, it is shown at bottom of the page (see Wiki help#Categories for further information).
Once users ended their revisions, before saving, they must change this category to [[Category:AKC Proofread]]
: this will allow to maintain the not proofread letters list up to date.
Before leaving the page, check if the new category is shown at the bottom of the page and if it is in blue, not red. If it is red, it means the code used is not correct. It is enough to write [[Category:AKC proofread]]
(with lowercase p
) to generate the error.
References
- ↑ http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-head.html
- ↑ At the moment, due to a technical issue, other tags inserted within
<head>
are not properly parsed; if you use other tags within<head>
, after saving you will notice a parsing error. You can ignore it and leave the code you used, since in the future this bug will be fixed, or you can use<big></big>
tag instead (you will find it in the 'Advanced' toolbar, button ). - ↑ http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-p.html
- ↑ http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-lb.html
- ↑ http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-pb.html
- ↑ http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-abbr.html
- ↑ http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-add.html
- ↑ http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-del.html
- ↑ http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-gap.html
- ↑ http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-unclear.html
- ↑ http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-sic.html
- ↑ http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-hi.html
- ↑ http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-hi.html
- ↑ http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-note.html