codepujan

🎯

Focusing

Pujan codepujan

🎯

Focusing

PhD student at Boston University Security Lab (https://seclab.bu.edu/)

15 followers · 11 following

Boston
21:31 (UTC -12:00)

Achievements

Highlights

Stars

pb0316 / SpeedupLLM

Python 4 Updated May 27, 2025

donnemartin / system-design-primer

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 302,410 50,099 Updated May 21, 2025

astorfi / LLM-Alignment-Project

A comprehensive template for aligning large language models (LLMs) using Reinforcement Learning from Human Feedback (RLHF), transfer learning, and more. Build your own customizable LLM alignment so…

Python 33 2 Updated Dec 15, 2024

bltlab / query-ner

The QueryNER dataset, developed by Brandeis University and eBay.

Python 5 1 Updated May 16, 2024

seleniumbase / SeleniumBase

Python APIs for web automation, testing, and bypassing bot-detection.

Python 10,770 1,344 Updated May 27, 2025

zillow / fair-housing-guardrail

Fair Housing Guardrail

Python 31 5 Updated Nov 25, 2024

Seezo-io / llm-security-101

Delving into the Realm of LLM Security: An Exploration of Offensive and Defensive Tools, Unveiling Their Present Capabilities.

163 28 Updated Oct 13, 2023

frutik / awesome-search

Awesome Search - this is all about the (e-commerce, but not only) search and its awesomeness

HTML 1,452 124 Updated May 25, 2025

annis-souames / brand-ner

A brand tagging system in product titles and user generated text

Jupyter Notebook 35 3 Updated Jan 17, 2022

google-research-datasets / common-crawl-domain-names

Corpus of domain names scraped from Common Crawl and manually annotated to add word boundaries (e.g. "commoncrawl" to "common crawl").

17 1 Updated Oct 2, 2020

dair-ai / Transformers-Recipe

🧠 A study guide to learn about Transformers

1,592 156 Updated Jun 3, 2023

google-deepmind / long-form-factuality

Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".

Python 610 73 Updated Apr 29, 2025

microsoft / generative-ai-for-beginners

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Jupyter Notebook 84,553 44,008 Updated Jun 2, 2025

osanseviero / hackerllama

My personal site

Jupyter Notebook 75 5 Updated Aug 4, 2024

christianversloot / machine-learning-articles

🧠💬 Articles I wrote about machine learning, archived from MachineCurve.com.

3,622 768 Updated Jun 28, 2024

teacherpeterpan / Logic-LLM

The project page for "LOGIC-LM: Empowering Large Language Models with Symbolic Solvers for Faithful Logical Reasoning"

C 322 60 Updated Jun 13, 2024

pacman100 / LLM-Workshop

LLM Workshop by Sourab Mangrulkar

Jupyter Notebook 381 133 Updated Jun 16, 2024

jlgleason / google-image-scraper

python Google/Bing image scraper

Python 1 Updated Feb 19, 2024

taufeeque9 / codebook-features

Sparse and discrete interpretability tool for neural networks

Python 63 4 Updated Feb 12, 2024

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 50,492 7,323 Updated Apr 20, 2025

huggingface / alignment-handbook

Robust recipes to align language models with human and AI preferences

Python 5,203 445 Updated Apr 30, 2025

cxli233 / FriendsDontLetFriends

Friends don't let friends make certain types of data visualization - What are they and why are they bad.

R 6,726 266 Updated Dec 10, 2024

gesiscss / awesome-computational-social-science

A list of awesome resources for Computational Social Science

R 695 85 Updated May 14, 2025

VikParuchuri / marker

Convert PDF to markdown + JSON quickly with high accuracy

Python 25,515 1,640 Updated May 30, 2025

eth-sri / language-model-arithmetic

Controlled Text Generation via Language Model Arithmetic

Python 221 14 Updated Sep 15, 2024

SystemsLab-Sapienza / TGDataset

A collection of over 120'000 Telegram Channels

Python 39 8 Updated Feb 22, 2024

facebookresearch / ImageBind

ImageBind One Embedding Space to Bind Them All

Python 8,667 812 Updated Jul 31, 2024

markowanga / stweet

Advanced python library to scrap Twitter (tweets, users) from unofficial API

Python 606 69 Updated Jul 25, 2023

allenanie / DisExtract

The library that uses dependency parsing to preprocess text to train DisSent model

Python 33 4 Updated Mar 26, 2020

r-three / t-few

Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"

Python 451 62 Updated Sep 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pujan codepujan

Achievements

Achievements

Highlights

Block or report codepujan

Stars

pb0316 / SpeedupLLM

donnemartin / system-design-primer

astorfi / LLM-Alignment-Project

bltlab / query-ner

seleniumbase / SeleniumBase

zillow / fair-housing-guardrail

Seezo-io / llm-security-101

frutik / awesome-search

annis-souames / brand-ner

google-research-datasets / common-crawl-domain-names

dair-ai / Transformers-Recipe

google-deepmind / long-form-factuality

microsoft / generative-ai-for-beginners

osanseviero / hackerllama

christianversloot / machine-learning-articles

teacherpeterpan / Logic-LLM

pacman100 / LLM-Workshop

jlgleason / google-image-scraper

taufeeque9 / codebook-features

rasbt / LLMs-from-scratch

huggingface / alignment-handbook

cxli233 / FriendsDontLetFriends

gesiscss / awesome-computational-social-science

VikParuchuri / marker

eth-sri / language-model-arithmetic

SystemsLab-Sapienza / TGDataset

facebookresearch / ImageBind

markowanga / stweet

allenanie / DisExtract

r-three / t-few