Highlights
- Pro
-
word_cloud Public
Python word cloud library for use within Jupyter notebook and Python apps.
-
resources Public
Forked from opinosis-analytics/blog-articlesCurated List of Blog Posts From Opinosis Analytics
UpdatedAug 14, 2021 -
OpinRank Public
OpinRank Dataset. Dataset containing user reviews for entities namely cars and hotels. Full reviews from Tripadvisor (~259,000 reviews) and Edmunds (~42,230 reviews)
-
nlp-in-practice Public
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, …
-
data-science-blogs Public
Forked from rushter/data-science-blogsA curated list of data science blogs
-
stop-words Public
Stop word lists
-
ROUGE-2.0 Public
ROUGE automatic summarization evaluation toolkit. Support for ROUGE-[N, L, S, SU], stemming and stopwords in different languages, unicode text evaluation, CSV output.
-
opinosis-summarization Public
This repo contains code and dataset for the Opinosis Summarization Framework
-
-
phrase-at-scale Public
Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than English
-
-
SIF_mini_demo Public
Forked from PrincetonML/SIF_mini_demominimal example for sentence embedding by Smooth Inverse Frequency weighting scheme
Python MIT License UpdatedMar 13, 2018 -
text-mining-and-nlp-apis Public
Forked from RxNLP/nlp-cloud-apisAPIs for clustering sentences, extracting topics, counting words & n-grams, extracting text from html or URL, computing similarity between texts and more.
-
pyrxnlp Public
Forked from RxNLP/PyRXNLPSuper simple NLP tools. Cluster sentences, get multiple text similarity measures including cosine, jaccard and dice, generate topics, extract text from html and more
-
clinical-concepts Public
Discovering Related Clinical Concepts using Large Amounts of Clinical Notes. An unsupervised graphical approach to 10000 mine related concepts by leveraging the volume within large amounts of clinical no…
-
-
python-examples Public
Working examples in python
-
ROUGE-Utility Public
Utility tools to prepare and evaluate ROUGE scores. Perl script to convert perl output of ROUGE to CSV.
1 UpdatedDec 15, 2017 -
-
-
Dataset for Micropinion Generation. Dataset is based on user reviews from CNET. The reviews are on products from various categories like tv, cell phones, gps etc.
2 UpdatedJul 14, 2017 -
bootstrap Public
Forked from twbs/bootstrapThe most popular HTML, CSS, and JavaScript framework for developing responsive, mobile first projects on the web.
-
electron Public
Forked from electron/electronBuild cross platform desktop apps with JavaScript, HTML, and CSS
C++ MIT License UpdatedNov 4, 2016 -
GeoSpark Public
Forked from apache/sedonaA Cluster Computing System for Processing Large-Scale Spatial Data
Java MIT License UpdatedNov 3, 2016 -
spark-lucenerdd Public
Forked from zouzias/spark-lucenerddSpark RDD with Lucene's query capabilities
Scala Apache License 2.0 UpdatedNov 2, 2016 -
spectron Public
Forked from electron-userland/spectronTest Electron apps using ChromeDriver
JavaScript MIT License UpdatedOct 27, 2016 -
-
spark Public
Forked from apache/sparkMirror of Apache Spark
Scala Apache License 2.0 UpdatedOct 27, 2016 -
stanza Public
Forked from stanfordnlp/stanza-oldStanford NLP group's shared Python tools.
Python Apache License 2.0 UpdatedOct 26, 2016 -
CoreNLP Public
Forked from stanfordnlp/CoreNLPStanford CoreNLP: A Java suite of core NLP tools.
Java GNU General Public License v3.0 UpdatedOct 22, 2016