Metadata-Version: 2.1
Name: texta-crf-extractor
Version: 2.2.0
Summary: texta-crf-extractor
Home-page: https://git.texta.ee/texta/texta-crf-extractor-python
Author: TEXTA
Author-email: info@texta.ee
License: GPLv3
Description: # TEXTA CRF Extractor
        
        ![Py3.8](https://img.shields.io/badge/python-3.8-green.svg)
        ![Py3.9](https://img.shields.io/badge/python-3.9-green.svg)
        ![Py3.10](https://img.shields.io/badge/python-3.10-green.svg)
        
        ## Requirements
        
        * Python >= 3.8
        * SciPy installation for scikit-learn (requires BLAS & LAPACK system libraries).
        
        ## Installation:
        
        ```
        # For debian based systems (ex: debian:buster) to install binary dependencies.
        
        apt-get update && apt-get install python3-scipy
        
        # Install without MLP
        
        pip install texta-crf-extractor
        
        # Install with MLP 
        
        pip install texta-crf-extractor[mlp]
        
        ```
        
        ## Usage:
        
        ```
        from texta_crf_extractor.crf_extractor import CRFExtractor
        from texta_mlp.mlp import MLP
        
        mlp = MLP(language_codes=["en"], default_language_code="en")
        
        # prepare data
        texts = ["foo", "bar"]
        mlp_docs = [mlp.process(text) for text in texts]
        
        # create extractor
        extractor = CRFExtractor(mlp=mlp)
        
        # train the CRF model
        extractor.train(mlp_docs)
        
        # tag something
        extractor.tag("Tere maailm!")
        
        ```
        
Platform: UNKNOWN
Description-Content-Type: text/markdown
Provides-Extra: mlp
