text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
-
Updated
May 1, 2025 - Python
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。
搜索所有中文NLP数据集,附常用英文NLP数据集
中文文本分析工具包(包括- 文本分类 - 文本聚类 - 文本相似性 - 关键词抽取 - 关键短语抽取 - 情感分析 - 文本纠错 - 文本摘要 - 主题关键词-同义词、近义词-事件三元组抽取)
Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard
对四种句子/文本相似度计算方法进行实验与比较
⚛️ It is keras based implementation of siamese architecture using lstm encoders to compute text similarity
法研杯2019相似案例匹配第二名解决方案(附数据集和文档),CAIL2020/2021司法考试赛道冠军队伍
Machine Learning APIs for common use cases, include: General OCR (Simplified/Traditional Chinese), Custom OCR, Image Similarity, Object Recognition, Face Detection, Face Comparison, Human Image Segmentation, Human Attribute Recognition, Pornography Detection, Image Super Resolution, Text Similarity, Car License Plate, etc.
A PyTorch-based toolkit for natural language processing
Mimix: A Text Generation Tool and Pretrained Chinese Models
GTS Engine: A powerful NLU Training System。GTS引擎(GTS-Engine)是一款开箱即用且性能强大的自然语言理解引擎,聚焦于小样本任务,能够仅用小样本就能自动化生产NLP模型。
Expose a Top2Vec model with a REST API.
Arabic support for textblob
[ACM-CIKM] 2nd place solution at CIKM AnalytiCup 2018, a task for determining short text similarities.
文本相似度,语义向量,文本向量,text-similarity,similarity, sentence-similarity,BERT,SimCSE,BERT-Whitening,Sentence-BERT, PromCSE, SBERT
Generate custom detailed survey paper with topic clustered sections and proper citations, from just a single query in just under 30 mins !!
Text preprocessing, representation, similarity calculation, text search and classification. Let's go and play with text!
A python client for connecting to all the services provided by https://dandelion.eu
Add a description, image, and links to the text-similarity topic page so that developers can more easily learn about it.
To associate your repository with the text-similarity topic, visit your repo's landing page and select "manage topics."