About CRIM

PACTE - Collaborative electronic text annotation platform


PACTE is a collaborative annotation web platform for text content that integrates an array of practical tools for research groups. It offers two annotation modes, manual and automatic, and is available in English and French to facilitate research and collaboration. 

Manual annotation is carried out by means of an interface optimized to allow rapid entry of the data enriching a text. The automatic mode is composed of all the specialized and configurable annotation services (named entities, disambiguated terminology, etc.).

Users can interact with PACTE using the online user interface or the application programming interface API (REST API) to access all their data and facilitate algorithm development based on rich resources. 

With this collaborative web platform, analyses and annotations can be shared between users, thereby facilitating cooperation and opening the door to large-scale multi-partner studies. Using PACTE generates considerable gains in productivity by significantly reducing annotation time while facilitating the analysis of annotated data.
 

Services

 
Morphosyntactic analysis

Parse each word’s morphosyntactic function, grammatical gender, number, lemma, etc.

 
 
Document profile

Identify a document’s relevant domains through a specialized terminology lexicon.


 
Terminology disambiguation

Annotate texts using lexicon's terms that are directly related to a given field.


 
Recognize a named entity

Isolate expressions designating named entities: people, places, organizations, geopolitical entities...


 
Active learning (in development)

Train a prediction model with minimal annotation effort thanks to new algorithms selecting the examples, thus providing the best information for the model. 

 

Main features

  • Create, import and access your corpora. 
  • Import text documents in various formats (doc, docx, pdf, html, etc.).
  • Create and modify your annotations.
  • Consult an existing lexicon or built your own!
  • Assign annotation tasks to team members.
  • Manage your contacts and share your corpora and annotation schemas.
  • Launch one of our linguistic, lexical or semantic annotation tools.
  • Work in English or in French, depending on team members' preferences.
  • Structure the information you need to enrich your documents.
  • Find the relevant documents according to annotation type.
  • No service matches your needs? Train your own, customized annotator! (in development)

 

Discover and try PACTE here!

 

 

Read the white paper

Scientific publications

Ménard, P. A. and Barrière, C. "PACTE: a collaborative platform for textual annotation" in Proc of the 12th International Conference on Computational Semantics (IWCS 2017). Montpellier, France, September 19 to 22, 2017 
 
Bernier-Colborne, G., Barrière, C. and Ménard, P. A. "Fine-grained domain classification of text using TERMIUM Plus" in Proc of the 12th International Conference on Computational Semantics (IWCS 2017). Montpellier (France), September 19 to 22, 2017 

 

Reasearch platforms

Created as part of CANARIE-funded projects, the research platforms developed by CRIM are available on the CANARIE's Research Software Portal, which gives access to a range of platforms and software services to researchers in order to accelerate discovery. 

 Discover VESTA: a collaborative multimedia content annotation platform

 

Teams

Upcoming event

  • Séminaire R-D : L'art de la cyberrésilience: les leçons du secteur bancaire
    30 January 2020 11:00
    CRIM (405, avenue Ogilvy, bureau 101, Montréal)
    Une conférence du Prof. Benoît Dupont, Titulaire de la Chaire de recherche du Canada en cybersécurité,Titulaire de la Chaire de recherche en prévention de la cybercriminalité, Directeur scientifique – Réseau intégré sur la cybersécurité (SERENE-RISC), Université de Montréal.
    +
  • #CRIMtravels ????Joao Monteiro, automatic speech processing intern supervised by Jahangir Alam, presents an article at… https://t.co/7bsFunP7na
  • Antoine Normand, PDG de BlueBear a discuté d'un sujet sensible lors de la Journée Techno sur la Désinformation à l’… https://t.co/FlBn1vdEaE

Recent Publications

  • Deep learning speech recognition for low-resource languages

    +
  • Forage de données géospatiales, quelques applications

    +