Julia Kruk

Researcher & Data Scientist

Hello, my name is Julia! I am a Graduate Researcher with the Stanford NLP Group under Prof. Diyi Yang, Research Affiliate at the Georgia Institute of Technology, and Data Scientist III at Bombora.

My research interests include Computational Social Science, Multimodality, Coded Language and Social Norms. I enjoy working at the intersection of Machine Learning, NLP and Computer Vision to study human phenomena and develop AI technologies that reflect human values.

On a personal note - I have a background in Physics and the Visual Arts. Where as Physics naturally finds its way into my professional career, I pratice analog photography and oil painting as a creative outlet! Ask me about my work :)

Projects & Publications

Cinque Terre
Semi-truths: A Large-Scale Dataset of AI-Augmented Images for Evaluating Robustness of AI-Generated Image Detectors

NeurIPS 2024

A novel method of augmenting images which provides control over what is changed and how, with limited human supervision. We present Semi-Truths: a dataset of ~1.5 million images for evaluation and training of AI-Generated Image Detectors.

Cinque Terre
Silent Signals, Loud Impact: LLMs for Word-Sense Disambiguation of Coded Dog Whistles

ACL 2024

Dog whistles are coded communication that is often weaponized for racial and socioeconomic discrimination. We present an approach for word-sense disambiguation of dog whistles using LLMs, and a dataset of 16,550 high-confidence coded examples of dog whistles.

Cinque Terre
Impressions: Visual Semiotics and Aesthetic Impact Understanding

EMNLP 2023

A novel dataset of photojournalism through which to investigate the semiotics of images, and how specific visual features and design choices can elicit specific emotions, thoughts and beliefs.

Cinque Terre
Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts

EMNLP 2019

Computing author intent from multimodal data requires modeling a complex relationship between text and image. Neither caption nor image is a mere transcript of the other. We present 3 orthogonal taxonoies that describe image-text relationships, and show that encorporating image and text can improve document intent classification.

Cinque Terre
Environmental Monitoring: A Coupling Function Calculator

CalTech REU 2016

Developed software for detecting ambient environmental noise in the interferometer, as well as determines the affected area. The program has shown to decrease false dismissals of gravitational waves by up to a factor of 10, and continues to be used in the vetting process.

Cinque Terre
A Modern Update and Usage of Historical Variable Star Catalogs

AAS 2015

We present an updated and modernized catalog of Cepheid Variable Stars in the Magellanic Clouds originally constructed by Henrietta Swan Leavitt during her tenure at the Harvard College Observatory (HCO) in the early 1900s.

Patents

Machine Learning Techniques for Internet Protocol Address to Domain Resolution Systems

2023 [ App: US20230252324A1 ]

Spatial-temporal anomaly and event detection using night vision sensors

2023 [ US20240212350A1 ]

Determining intent from multimodal content embedded in a common geometric space

2020 [ US20200134398A1 ]

Embedding multimodal content in a common non-euclidean geometric space

2019 [ US20190325342A1 ]