Text Encoding Initiative

Fra Wikipedia, den frie encyklopædi
Gå til: navigation, søg

Text Encoding Initiative (TEI) er et konsortium af institutioner og forskningsprojekter, der samarbejder om at udvikle og vedligeholde en standard for repræsentation af tekster i digital form. Det væsentligste resultat fra TEI er den semantiske katalogisering af tekst-elementer, ligeledes konform til XML markup language. Siden 1994 har disse retningslinjer i vidt omfang været brugt som standard.

Kort eksempel[redigér | redigér wikikode]

s = sætning ; cl = udsagn 
<s>
  <cl>It was about the beginning of September, 1664,
  <cl>that I, among the rest of my neighbours,
       heard in ordinary discourse
   <cl>that the plague was returned again to Holland; </cl>
   </cl>
  </cl>
  <cl>for it had been very violent there, and particularly at
     Amsterdam and Rotterdam, in the year 1663, </cl>
  <cl>whither, <cl>they say,</cl> it was brought,
  <cl>some said</cl> from Italy, others from the Levant, among some goods
  <cl>which were brought home by their Turkey fleet;</cl>
  </cl>
  <cl>others said it was brought from Candia;
     others from Cyprus. </cl>
 </s>
 <s>
  <cl>It mattered not <cl>from whence it came;</cl>
  </cl>
  <cl>but all agreed <cl>it was come into Holland again.</cl>
  </cl>
 </s>

Der er henved 500 tags, som dog ikke allesammen er målrettet at direkte præsentere semantiske tekstelementer.

Ekstern henvisning[redigér | redigér wikikode]

Programmering Stub
Denne artikel om datalogi eller et datalogi-relateret emne er kun påbegyndt. Hvis du ved mere om emnet, kan du hjælpe Wikipedia ved at udvide den.