Fuzzy Name Matcher
Gradio app to perform fuzzy name matching on entity names and merge financial datasets in the absence of unique keys. Allows for docker deployment.
A list of small data science and coding projects, TILs (Today I Learned), and ongoing research projects, alongside links to supplementary resources (e.g., code, models, online appendices, etc.).
Gradio app to perform fuzzy name matching on entity names and merge financial datasets in the absence of unique keys. Allows for docker deployment.
DreamBooth is a fine-tuning technique for large, pretrained text-to-image models (e.g., DALL-E2, Imagen, Stable Diffusion). Based on a small reference set of training images of a given subject or object (henceforth concept), the DreamBooth technique learns a custom identifier for the given concept and implants the concept embedding into the model’s output domain. It enables the model to synthesize images of the underlying concept in different contexts and settings with very high-quality.
This project utilizes OpenAI’s LLMs and publicly available data, including ESG reports, SEC 10-K filings, and earnings call transcripts, to build an app that searches and summarizes these data to empower users with ESG-related information needs to invest responsibly.
CLI tool for downloading various types of SEC filings from the EDGAR database.
A write-up that summarizes my personal learnings and experimentations with CLIP-guided image synthesis. It covers VQGAN, CLIP, Inference-by-Optimization, as well as various text-to-image and image-to-image experiments.