-
Bryn Mawr College
- NYC
- https://azpoliak.github.io
- @azpoliak
Highlights
- Pro
More
Stars
A free tutorial for Apache Spark.
Diverse Natural Language Inference Collection - NLI dataset that can used to evaluate how well models perform distinct types of reasoning (EMNLP 2018)
Spanish NLI Corpus with Negation-based Adversarial Examples
Probe how GPT-n performs on statutory reasoning
materials for EAAI 23 paper "Exploring Social Biases of Large Language Models in a College Artificial Intelligence Course
Homework for NLP course at University of Maryland
Homework assignments for Computational Linguistics I
LaTeX source for Think Java, 2nd edition, by Allen Downey and Chris Mayfield.
A library of extension and helper modules for Python's data analysis and machine learning libraries.
Workshop "Analyzing Social Media Data" at the Big Data and Development Conference
A simple module to collect video, text, and metadata from Tiktok.
Solve puzzles. Learn CUDA.
Exact paired permutation significance test for accuracy
Demos of Iggy Enrich (https://www.askiggy.com/)
Tools for collecting social media data around focal events
An evolving list of electronic media data sets used to model mental-health status.
A simple tool for splitting up an ebook into its chapters. Works well with Project Gutenberg texts. May also be used to clean up books for computational text analysis.
I wanted all of plaintext Project Gutenberg in an easy-to-use format, so I made this
Cornell INFO 3350: Text mining for history and literature, Fall 2020
A Python wrapper around the topic modeling functions of MALLET.
Live Python Notebooks with any Editor
Interactive Jupyter Notebooks for learning materials
The textbook Computational and Inferential Thinking: The Foundations of Data Science