Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a document-keyphrase matrix.
-
Updated
Nov 8, 2024 - Python
Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a document-keyphrase matrix.
A sentiment analysis classifier for short texts in Python
Repository for the lectures taught in the course named "Natural Language Processing" at the University of Guilan, Department of Computer Engineering.
A support vector machine based topic classifier for Nepali text
(1) Train large language models to help people with automatic essay scoring. (2) Extract essay features and train new tokenizer to build tree models for score prediction.
To find the top 20 features for a set of documents given.
Convert raster image to SVG, create icons and icon sheets.
Pandas dataframe easy inspection, filtering, transformation: Get label distribution metrics, visualize multilabel columns through Chord diagram, filter label occurring less than a threshold, one-liner text/monolabel/multilabel columns vectorization, and many more to come.
This is a Python-based spam detection system that uses machine learning to classify messages as spam or not spam (ham). The system connects to a MySQL database for training data, uses TF-IDF vectorization for text processing, and employs logistic regression for classification.
Email Spam Detector - Machine Learning Model (Dockerized) that classifies messages as spam or not spam using a trained Naive Bayes model. The model is built using scikit-learn and is packaged inside a Docker container for easy deployment and usage.
PNG, JPG -> SVG Converter(OpenCV): Transform your PNG and JPG images into clean, scalable vector graphics with intelligent color reduction and transparency support.
summarize paragraph into other languages
A Python tool for converting raster images into stylized, vector-like artwork using machine learning techniques.
Bridging raster & vector processing workflows without compromises - a lightweight, local, and non-destructive image-processing app
memprediksi kalimat positif atau negatif dan mengatur bobot tf-idf dengan model MultinomialNB
This repo contains code on study of a covid long-hauler group
CPU-only embedding service for Weaviate
Using Natural Language Processing (NLP) and pandas, numpy, scikit-learn for classification and applying logistic regression as it is a supervised model, lastly NLTK. Pickle library used for saving and running the model anywhere.
Suggest similar tracks based on vectorized semantic similarity.
A semantic applicant–job matching microservice built on gRPC. Send complete Job and Applicant messages defined in matcher.proto — no raw JSON parsing in clients. The server uses a hybrid semantic + heuristic scorer with explainable factor breakdowns (skills, experience, education, etc.).
Add a description, image, and links to the vectorizer topic page so that developers can more easily learn about it.
To associate your repository with the vectorizer topic, visit your repo's landing page and select "manage topics."