SOCIAL MEDIA EDITION

dbt™ Data
Modeling Challenge

📣 The challenge has ended—more coming soon! Stay tuned! 🔥
Social media logos icon image collage
Logo icon - ParadimeLogo icon - LightdashLogo icon - GithubLogo icon - Github
Movie poster photo collage

Why Join This Challenge?

  • Showcase your SQL, dbt™, and analytics expertise
  • Work with cutting-edge tools used by top tech companies
  • Analyze real-world social media data to showcase your creativity and analytical prowess
  • Win Amazon gift cards worth up to $3,000
  • Build your portfolio and career prospects
  • Network with industry professionals and peers
Start Your Data Adventure 🌟
Movie poster photo collage

Why Join This Challenge?

  • Showcase your SQL, dbt™, and analytics expertise
  • Work with cutting-edge data tools used by top tech companies
  • Analyze real-world social media data to showcase your creativity and analytical prowess
  • Win Amazon gift cards worth up to $3,000
  • Build your portfolio and career prospects
  • Network with industry professionals and peers
Start Your Challenge 🌟

Challenge Overview

Welcome to the dbt™ Data Modeling Challenge - Social Media Edition! This global competition invites data practitioners to transform raw social media data into powerful insights. Use Paradime, MotherDuck, and Hex to craft SQL queries, develop dbt™ models, and create compelling visualizations. Uncover trends and tell data-driven stories that will wow the judges for a chance to win $3,000..

Key dates:
  • Start: July 30th, 2024
  • September 9, 2024, at 11:59 PM PT
  • Winners Announced: September 12, 2024
Join Now 💡
Movie poster photo collage

Prizes

Show off your data skills and win big!
#1
$3,000
Amazon gift card
#2
$2,000
Amazon gift card
#3
$1,000
Amazon gift card
Compete for the Top Spot 🏆
Exciting winning movie scene Polaroid collage represents the prize for data modeling challenge winners
Movie poster photo collage

Prizes

Show off your data skills and win big!
#1
$3,000
Amazon gift card
#2
$2,000
Amazon gift card
#3
$1,000
Amazon gift card
Compete for the Top Spot 🏆

About the challenge

A whistle-blowing icon represents the start of the challenge.
Ready, Set, Analyze
July 30th, 2024
An alarming alarm clock icon represents the challenge timeline's deadline.
Submission deadline
September 9, 2024
A loudspeaker icon indicates the time of the challenge winner announcement.
Winners announced
September 12, 2024

How it works

  • 1. Sign up for the challenge. If you meet the entry requirements, you’ll receive all credentials to get started.
  • 2. Use MotherDuck for data warehousing and compute.
  • 3. Use Paradime to develop your dbt™ models.
  • 4. Use Hex to analyze the datasets and create visualizations
  • 5. Submit your GitHub repository with code and insights in README.md.
  • 6. Our expert judges independently score each submission.
  • 7. Winners announced!
Detailed instructions and resources provided upon registration. Check out the Challenge’s github repo for more details.
Start the Challenge 🗺️

Who should participate?

  • Data Analysts
  • Analytics Engineers
  • Data Engineers
  • Data Scientists
  • Anyone passionate about data and social media analytics!
This is a solo challenge to showcase your individual talents. Participants should have hands-on experience with SQL, dbt™, and Git.
Show Us What You've Got! 💪

Tools you'll use

  • Paradime: The AI-powered, one-stop-shop analytics engineering platform for dbt™.
  • MotherDuck: The ducking simple data warehouse without the overhead - join data and files between local dev environments and the cloud to get a bird’s eye view of your data
  • Hex: Hex brings the entire analytics workflow together in one collaborative, AI-powered workspace. Data science, self-serve reporting, and exploratory analysis all work together seamlessly.
  • GitHub: Version control and project submission
Master Industry-Leading Tools 🛠️

Judging panel and criteria

The judges will evaluate submissions based on the following criteria:
  • Value of insights (1-10)
  • Complexity of insights (1-10)
  • Quality of materials (1-10)
  • Integration of new data (1-10)
For a detailed breakdown of each criterion, please refer to the full challenge details below.
Our judges are looking for creative, insightful, and well-executed projects that demonstrate your data modeling and analysis skills. Good luck!

Challenge details

A plus icon, which serves as the accordion trigger in FAQ section

Entry requirements

  • Participants must be current or former data professionals (Data Analysts, Analytics Engineers, Data Engineers, Data Scientists, etc.).

  • Solo participation only (no teams).

  • Must have hands-on experience with SQL, dbt™, and Git.

  • Participants must use, but are not limited to, the following tools:

  • Must be able to explain their code and insights. You can use ChatGPT, but you better understand it! 🤣

A plus icon, which serves as the accordion trigger in FAQ section

Challenge deliverables

Use Paradime, MotherDuck, and Hex to uncover compelling insights from social media data. Aim for accurate, relevant, and engaging discoveries.
Participants are expected to submit:

  • A GitHub branch containing:

    • Code - Your dbt™ models, and any other applicable code (Example from previoius challenge)

    • README.md - Here you narrate your submission — How you built it, what you built, including data visualizations and accompanying insights (Example from previoius challenge)

A plus icon, which serves as the accordion trigger in FAQ section

Judging criteria

Judges will score each submission based on:

  • Value of insights (1-10):
    Are the insights interesting and relevant?

    • Get creative! Uncover something fun and accurate that you'd find interesting if you saw it on social media, for example.

  • Complexity of insights (1-10):
    Are you creating relationships between datasets and providing in-depth analytical conclusions?

    • Complexity ≠ value, but you should use multiple datasets to generate valuable insights.

  • Quality of materials (1-10):
    Is your code of professional quality? Are your data visualizations well-designed? Are your insights' conclusions clear to the reader?

    • If your submission isn't good enough to share with peers, it won't be good enough for the judges.

  • Integration of new data (1-10):
    How effectively have you integrated new, relevant data to enhance your project?

    • Incorporating additional datasets has the potential to score you higher in other categories: value of findings, complexity of findings, and quality of materials.

A plus icon, which serves as the accordion trigger in FAQ section

Judges

A picture of Emily Hawkins, the judge of the data modeling challenge.
(Analytics Engineering,  GlossGenius)
A picture of Jake Hannan, the judge of the data modeling challenge.
(Data Platform, Sigma)
fabio
(Co-founder, Paradime)

Frequently Asked Questions

How can I obtain technical support?

A plus icon, which serves as the accordion trigger in FAQ section

For all technical support and challenge inquiries, use Paradime's #social-media-data-challenge Slack channel. You can also find support in the MotherDuck Slack Community.

Is seeking external assistance permitted?

A plus icon, which serves as the accordion trigger in FAQ section

Yes. While your submission must be your own work, you can use your network, online resources, and even ChatGPT for inspiration and learning. However, all actual work must be done by you alone.

Can I utilize additional tools beyond the required ones?

A plus icon, which serves as the accordion trigger in FAQ section

Yes. Beyond the required tools (Paradime, MotherDuck, Hex, and GitHub), you are encouraged to use additional tools and technologies that enhance your project.

Do participants begin from scratch?

A plus icon, which serves as the accordion trigger in FAQ section

Not at all! Paradime provides the following resources:

  • Paradime for SQL & dbt™ development.

  • MotherDuck for data storage and compute.

  • Hex for data analysis and visualizations.

  • GitHub for version control.

Am I allowed to incorporate additional data sources?

A plus icon, which serves as the accordion trigger in FAQ section

Yes, incorporating additional data is required. The sample data we provide is optional. Any data you bring in must be user-generated social media data or relevant supplementary data.

Is this event a hackathon?

A plus icon, which serves as the accordion trigger in FAQ section

Not exactly. This is a 6-week, asynchronous competition. Participants work on their own time and submit by the deadline.

If I win, how will I receive a gift card?

A plus icon, which serves as the accordion trigger in FAQ section

Winners will receive an Amazon e-gift card via email.

What are some key strategies for developing insights from social media data sets?

A plus icon, which serves as the accordion trigger in FAQ section

Aim to generate insights that are accurate and interesting. They should be scroll stoppers! For inspiration, here are some intriguing insights to explore:

  • COVID-19 Sentiment Analysis

    • Analysis Question: How has the sentiment around COVID-19 on Reddit changed over time? Why?

    • Optional/Supplementary Data: Key dates, news, events, and/or anything that points to why sentiment has changed over time.

  • Donald Trump Popularity Trends

    • Analysis Question: How has Donald Trump's popularity changed over time?

    • Required Social Media Data: A sample of Twitter posts, mentions, and engagement, containing the words "Donald Trump" over the last 10 years.

    • Optional/Supplementary Data: Key dates, news, events, and/or anything that points to why popularity has changed over time.

  • Top YouTube Creators Study

    • Analysis Question: Who are the biggest YouTube creators, and why?

    • Required Social Media Data: YouTube comments, engagement metrics, etc.

    • Optional/Supplementary Data: Trending YouTube Video statistics, or similar datasets.

  • 2022 NFL Superbowl Commercial Impact

    • Analysis Question: Which Commercials were most popular during the 2022 NFL Superbowl?

    • Required Social Media Data: Twitter and/or Reddit posts, mentions, and engagement during the 4-hour time block of the NFL Superbowl. Only pull data that contains information about brands that had Superbowl commercials.

  • Optional/Supplementary Data:

    • For public companies that advertised, pull stock market data to see if there's any correlation between Superbowl commercial success and stock price.

    • Using Superbowl advertisement cost data, identify which brands had the highest social engagement per dollar spent.

Time is running out! Join the challenge now

📣 Please review the entry requirements to ensure you qualify
Thank you! We have received your submission and will be in touch shortly with next steps!
Looks like something went wrong while submitting the form. Can you try again?