A girl biting on a pencil stressed about a quiz. There is text on the image. It reads: What data team member are you? Take the quiz to go find out!

Touch Base

Corporate for "I forgot what this is about but I need to make noise before someone notices".

Dashboard

An interactive report that executives will ignore until they ask for the same data… in an Excel sheet.

Fax Machine

Technologically frozen in 1995. Still thinks "the cloud" is for rain and refuses to click anything newer than Solitaire.

An ad for Secoda which says, experiencing metadata migraines? Ask your data engineer about Secoda.

Clustering

Grouping similar things together—useful for customer segmentation, but also how your closet naturally organizes itself into chaos.

Data Transformation

Because raw data is just too ugly.

Feedback Fountain

Spews directives like "make it intuitive" with all the specificity of a drunk fortune cookie.

One-Hot Encoding

Transforming categorical data into numerical form—because computers just don’t get words.

Let's Park That

Ignoring that data quality issue until it causes real problems.

EDA (Exploratory Data Analysis)

Poking around in your data to find trends, outliers, and problems before they ruin your model.

Identity & Access Management (IAM)

Keeping unauthorized users out - until someone shares a password.

Database

Where your data goes to sleep.

Cloud Computing

Someone else’s computer, but shinier.

XGBoost

A gradient boosting algorithm that wins Kaggle competitions—because sometimes brute force just works.

Penetration Testing

Hacking yourself before someone else does.

Low-Hanging Fruit

The easiest SQL query that someone still wants to call a "data-driven insight."

BI Developer

Makes dashboards for people who will ignore them and then ask you for the same numbers in a spreadsheet.

Data Quality

Because bad data leads to bad decisions and lots of excuses.

Boil the Ocean

“Can you analyze all our data from the last 10 years for a report we’ll ignore?”

Data Analysis

Sifting through data, hoping for something insightful.

Data Intelligence

Data’s glow-up into something actually useful.

Security Frameworks

Blueprints for security that companies try to follow.

Drinking from the Firehose

Getting access to the full raw data without documentation or guidance.

Calendar Necromancer

Schedules pre-meetings for the pre-meeting's pre-brief because they couldn't read an email to save their life.

Automated Testing

Because manually checking your code is for the weak.

HPPO (Highest Paid Person’s Opinion)

Corporate deity whose random breakfast thoughts outrank your entire research department.

Pipeline Poltergeist

Invisible data hero who's seen SQL horrors that would make junior devs cry.

Data Privacy

Protecting user info while secretly monetizing it.

Optimize

“We ran the same SQL query but indexed a column, so now it’s 2% faster.”

Apache Airflow

Workflow automation, so you don’t have to babysit data pipelines.

Operational Efficiency

Doing more work with fewer complaints—on a good day.

Deliverables

The dashboards and reports that will be outdated within a week.

Data Protection

A fancy term for “don’t let hackers steal our stuff.”

Predictive Analytics

Trying to guess the future based on past data—like a digital crystal ball, but with spreadsheets.

Data Augmentation

Artificially inflating your dataset so your model learns better—kind of like stretching the truth on a résumé.

Audit Trail

A digital breadcrumb trail for when things inevitably go wrong.

Compliance Frameworks

A checklist of rules to follow… until regulations change again.

Excel Goblin

Nesting IF statements like Russian dolls and defending their desktop spreadsheet hoard like a caffeinated dragon.

Latency

The thing everyone blames but nobody fixes.

Data Restoration

Because mistakes were made.

DBMS (Database Management System)

Because spreadsheets just don’t scale.

CI/CD (Continuous Integration/Continuous Deployment)

The reason your software updates faster than you can blink.

Personality Hire

Fluent in stakeholder management, and can turn vague requests into scarily accurate dashboards. Built half the team's workflows on vibes and somehow made it work.

Data Lineage

Because “I have no idea where this data came from” is not a great answer.

Data Mining

Digging through massive datasets, hoping to strike gold.

Data Validation

Double-checking data before it makes a fool of you.

Make Your Own Weather

A corporate delusion tactic to feign control, optimism, or progress in the face of complete chaos.

GA (Google Analytics)

A free tool for tracking website traffic—until privacy laws step in.

Paradigm Shift

Slapping AI on the same old nonsense.

Slide Sorcerer

Transforms your bullet point into 40 slides featuring at least two mountain-climbing metaphors.

Red Flags

All the missing data that everyone pretends doesn’t exist.

Data Retention

Holding onto data just long enough to avoid legal trouble.

Cross Tabulation

When you pivot data just to confirm what you already knew.

AI Whisperer

Fine-tunes LLMs like they’re sourdough starters. Has five GPU credits left and no intention of using them responsibly.

Statistical Data

The numbers that make up your analysis—sometimes useful, sometimes just noise.

Cybersecurity

The never-ending battle between hackers and IT teams running on coffee.

Random Forest

A bunch of decision trees working together to make better predictions—because one tree alone isn’t enough.

DML (Data Manipulation Language)

The reason your database admin hates you.

Pain Points

The frustrations of explaining, again, why two reports don’t match.

ELT (Extract, Load, Transform)

Load first, transform later—modern data integration in action.

Kanban Kool-Aid

Creates JIRA tickets to track their JIRA tickets while drowning in chaos.

Unstructured Data

Data that refuses to fit into neat tables—think text, images, and the chaos of the internet.

Dimensionality Reduction

Cutting down the number of variables in your dataset—because sometimes, less is more (especially in Excel).

Rubber Ducking

Talking to inanimate objects because humans are worse.

Data Replication

Keeping multiple copies of your data in sync.

Data Mesh

The buzzword architects love, but engineers fear.

It Depends

The universal answer to every data question, forever and always.

Scalable

We built it for five people and are praying it doesn’t break at ten.

Gradient Descent

The algorithm that helps machine learning models learn—think of it as slowly rolling downhill to the right answer.

Automation Solutions

Getting machines to do the boring stuff for you.

Seamless Integration

“This data connector technically works, but barely.”

Take It Offline

“This dashboard is broken, but let’s not discuss it in front of leadership.”

Indexing

The magic that makes your slow queries slightly less slow.

Time Series Analysis

Predicting trends over time—useful for stocks, weather, and figuring out when your Wi-Fi will crash again.

Access Control

The fine art of deciding who gets in and who gets a "403 Forbidden."

Econometrics

When economics meets statistics and things get extra nerdy.

Backpropagation

The magic behind neural networks—basically, trial and error on steroids until the model gets it right.

Bandwidth

“I have 10 dashboards to fix and zero time for your ad-hoc request.”

Rockstar/Ninja

A job posting for a data analyst who can also engineer pipelines and train AI models.

Fail Fast

“Throw some data models at the wall and see what sticks.”

Governance Guardian

Worships clean metadata and version control. Lives for data lineage and will fight you over naming conventions.

Data Storytelling

Turning numbers into narratives people might actually remember.

Overfitting

When your model is too smart for its own good and memorizes the training data instead of learning useful patterns.

Low Visibility

“I forgot to check the dashboard before this meeting.”

DQL (Data Query Language)

Because SQL SELECT wasn’t fancy enough.

Metadata Management

Making sure your data descriptions don’t live in someone’s forgotten spreadsheet.

Analytics

Turning raw data into fancy charts that people ignore.

Ping Me

A passive-aggressive way to say “this will be your problem soon.”

Supervised Learning

Teaching models with labeled data—kind of like school, but for algorithms.

Data Warehouse

The Costco of structured data.

Data Ingestion

Feeding your data pipeline a never-ending buffet.

Future-Proof

“This report is valid until next quarter, when everything changes.”

Relational Database

Where your data has commitment issues.

Data Hub

A central place for data that everyone fights over.

Holistic Approach

“We’ll consider all possible factors… except the ones that make us look bad.”

CDP (Customer Data Platform)

Stalking customers, but make it “data-driven.”

Auto Recovery

When your system crashes but pretends it never happened.

Data Lake

Where structured data goes to drown.

Outlier Detection

Spotting the oddballs in your data, because sometimes anomalies are fraud, and sometimes they’re just mistakes.

Continuous Integration

Automating code merges so your team doesn’t go crazy.

Machine Learning

Teaching computers to recognize patterns so they can pretend to be smart—until they overfit and fail.

Data Processing & Optimization

Making your inefficient queries slightly less embarrassing.

Right-Sizing

Cutting back on data storage costs until everything runs painfully slow.

Risk Management

Preparing for disasters that will still somehow surprise you.

URBAN DATA DICTIONARY IS WRITTEN WITH YOU

Submit a word