Ressources pour la segmentation automatique des corpus.

L'objectif est de segmenter un corpus en sous-parties homogènes en utilisant les notions d'accroissement du vocabulaire et les variations de sa diversité. Pour plus d'informations voir :

Cyril Labbé, Dominique Labbé, Pierre Hubert, « Automatic Segmentation of Texts and Corpora », Journal of Quantitative Linguistics, 2004, Vol.11, N°3, pp193-213.

Pour utiliser le programme de segmentation voir le fichier "PrgSegmentationlisez_moi.txt".

Merci à Gaetan Peaquin.

 Name                    Last modified       Size  Description

 Parent Directory        25-Oct-2005 09:31      -  
 DCDFLIB.C               09-Feb-2005 13:29    82k  
 DCDFLIB.H               09-Feb-2005 13:29     3k  
 DivTBTotal              09-Feb-2005 13:29    10k  
 Doxyfile                09-Feb-2005 13:29    39k  
 FILEID.DAT              09-Feb-2005 13:29     1k  
 FINDER.DAT              09-Feb-2005 13:29     3k  
 IPMPAR.C                09-Feb-2005 13:29     9k  
 IPMPAR.H                09-Feb-2005 13:29     1k  
 LISTE.C                 09-Feb-2005 13:29     1k  
 LISTE.H                 09-Feb-2005 13:29     2k  
 LabbeHubert2002.pdf     09-Feb-2005 13:29   140k  
 Makefile                09-Feb-2005 13:29     1k  
 OPTION.H                09-Feb-2005 13:29     2k  
 PrgSegmentationlisez..> 09-Feb-2005 13:29     5k  
 ProgSegmen.txt          09-Feb-2005 13:29     1k  
 ProgSegmenter.exe       09-Feb-2005 13:29   388k  
 ProgSegmenter.mcp       09-Feb-2005 13:29   112k  
 ProgSegmenter.old.mcp   09-Feb-2005 13:29   112k  
 TBOutput.txt            09-Feb-2005 13:29   126k  
 outputfile.tmp          09-Feb-2005 13:29   108k  
 segmenter.c             09-Feb-2005 13:29    28k  
 segmenter.h             09-Feb-2005 13:29     6k  
 segmenter.man           09-Feb-2005 13:29     6k

Apache/1.3.33 Server at www-lsr.imag.fr Port 80