Official government website of the Government of the Kingdom of Saudi Arabia
Links to official Saudi websites end withgov.sa
All links to official websites of government agencies in the Kingdom ofSaudi Arabia end with .gov.sa
Government websites use theHTTPSprotocol for encryption and security.
Secure websites in the Kingdom of Saudi Arabia use the HTTPS protocolfor encryption.

The Data Innovation Hackathon is part of Road to Riyadh events, aiming to empower innovators and experts to address challenges in the collection and processing of national statistical data. The Hackathon focuses on two main technical tracks: innovative data collection with non-traditional data sources and intelligent processing and classification by employing artificial intelligence techniques for data cleansing and categorization, thereby contributing to enhancing the speed and accuracy of official statistics.
This project is being implemented in close collaboration with national entities to ensure alignment with national priorities and strategic directions.
- Saudi Data & AI Authority (SDAIA).
- Saudi Association for Statistical Sciences (SASS) and Professional Association for Statisticians and Data Scientists (PASDS).
- First Stage (Recruitment and Screening): Launching the official website and receiving executive summaries of proposed ideas, followed by a screening process to select qualified teams for participation in the Hackathon.
- Second Stage (Implementation and Development): Providing selected teams with an intensive timeframe to develop a fully integrated prototype of the proposed solution.
- Third Stage (Judging and Closing): This stage includes virtual judging (Demo Day) and the recognition of winners during a virtual closing ceremony.
Committee: Consists of five judges responsible for evaluating participants across both tracks, distributed as follows:
- Two judges from within the General Authority for Statistics (GASTAT).
- Three judges from outside the Authority (local experts).
- First Place: A cash prize of SAR 30,000.
- Second Place: Two cash prizes, each valued at SAR 15,000.
- Third Place: Three cash prizes, each valued at SAR 5,000. (Note: Cash prizes will be disbursed later, to be announced).
- Participants: Participation is open to citizens and residents within Saudi Arabia, including students, academics, freelancers, and employees.
- Teams: Each team must consist of three, four, or five members.
To ensure the quality of technical evaluation, finalist teams are required to submit the following deliverables through the digital portal:
- Programming platform links: Submission of source code or technical workflows through links to GitHub or any other approved development platform (to be announced later).
- Setup and deployment guide: Providing clear and comprehensive instructions on how to run or deploy the technical solution, including all required software and environmental specifications, to enable the judging committee to review and test the solution thoroughly.
- Demonstration Video: Submission of a video not exceeding one minute, providing a concise explanation of how the solution addresses the stated problem, along with a practical demonstration of the prototype’s functionality.
Track one: Innovative data collection from non-traditional sources
Challenge name: Smart observatory for measuring platform-based employment
Challenge statement
Design and develop an automated observatory for the digital labor market that estimates the size and characteristics of the workforce in the Gig economy. By leveraging publicly available digital traces. The proposed solution should address the “blind spot” in traditional administrative records, which often fail to capture workers operating through digital platforms adequately.
Background
The labor market is undergoing a structural shift toward the digital economy (ride-hailing applications, delivery platforms, and freelance work). However, a substantial proportion of individuals engaged in these activities are not registered within formal social insurance systems, creating a significant statistical gap that constrains an accurate assessment of the true scale and characteristics of the flexible workforce.
Objective
Develop a proof-of-concept (POC) for an intelligent observatory capable of estimating the number of platform-based workers and classifying them by activity type. The solution should generate an interactive dashboard that compares official statistics with digitally observed labor market indicators, providing evidence-based insights to support decision-makers.
Data sources
Integrate data from freelance platforms, digital search trend indicators, and app store analytics—such as the volume and frequency of user reviews—as proxy indicators for estimating the size and activity levels of the workforce. These sources are provided for illustrative purposes only. Participants are encouraged to identify, innovate, and incorporate any additional publicly available data sources that strengthen the robustness of their proposed solution.
Proposed methodology
Participants are encouraged to leverage open-source solutions and low-code/no-code platforms. The proposed approach should incorporate automated tools for extracting publicly available data, data integration frameworks for consolidating heterogeneous datasets, and analytical models designed to estimate patterns and forecast future trends.
Track two: Intelligent processing and AI-powered automated classification
Challenge name: Intelligent logical error detection and correction system (Semantic Guardian)
Challenge statement
Develop an intelligent verification assistant that integrates seamlessly with survey platforms to detect semantic inconsistencies and complex logical errors in real time (such as mismatches between reported job titles and educational qualifications). The solution should move beyond rigid, rule-based validation checks by incorporating adaptive, intelligence-driven mechanisms that enhance data accuracy at the point of collection.
Background
Field data collection processes often encounter logical inconsistencies and conflicting responses that compromise data quality and necessitate extensive manual post-collection processing. The situation underscores the need for an intelligent solution capable of detecting and resolving such issues in real time during the data collection process.
Objective
Develop an integrated proof of concept (POC) that connects a large language model (LLM) to the data collection system to evaluate input quality and assign each response a confidence score. The solution must demonstrate real-time interactive integration with the survey instrument and the capability to process data dynamically during collection—not merely present a standalone (fine-tuned) language model.
Data sources
After the screening phase, qualified teams will be provided with a file containing 50 to 100 anonymized records or survey forms that include common logical errors and contextual inconsistencies observed in real field operations. This dataset will be used to test the system’s ability to process cross-variable data and ensure the plausibility of the results.
Proposed methodology
Use Application Programming Interfaces (APIs) for Large Language Models (LLMs) in conjunction with few-shot prompting techniques and employ orchestration frameworks to ensure actual integration with the survey platform. The solution should include a live demonstration interface that demonstrates the detection of errors immediately upon data entry.
Competition Framework
The competition adopts an “open participation” model at the national level:
- Operating model: A fully virtual (online) Hackathon to enable participation from all regions of Saudi Arabia, with the closing ceremony streamed through approved digital channels.
- Accepted Tools: Traditional programming languages (Python, R) and low-code development platforms, such as KNIME.

It is a national innovation platform aimed at developing intelligent technological solutions (utilizing artificial intelligence (AI) and the internet of things) to address challenges related to the collection and processing of statistical data. The Hackathon is one of the initiatives under the “Road to Riyadh” program launched by the General Authority for Statistics (GASTAT) as part of its preparations to host the United Nations World Data Forum 2026.
No. In its current edition, the Hackathon is a local initiative open exclusively to citizens and residents within the Kingdom of Saudi Arabia.
No. The Hackathon is fully virtual (online) across all its stages—from registration and technical development to final presentations and the closing ceremony—allowing participants to take part from any city in Saudi Arabia.
Each team must consist of three, four, or five members. Individual participation and teams with fewer than three members will not be accepted.
The Hackathon focuses on two tracks:
• Track one: Innovative Data Collection (utilizing emerging technologies).
• Track two: Intelligent Processing and Classification (leveraging artificial intelligence and automation).
Not necessarily. Diversity in team skill sets is encouraged. The use of low-code development platforms, such as KNIME Analytics Platform, is strongly supported, as they enable statisticians and analysts to develop AI-powered solutions without writing complex code.
Submissions will be evaluated electronically through the digital portal by a committee composed of five judges (two GASTAT experts and three external experts) to ensure transparency and objectivity.
According to the schedule below:
- Submission of ideas: 15 February–5 March 2026.
- Announcement of shortlisted teams: 29 March 2026.
- Technical implementation phase: 30 March–2 April 2026.
- Virtual judging: 14 April 2026.
- Announcement of winners: 15 April 2026.
A total of SAR 75,000 has been allocated for cash awards for the top six teams, distributed as follows:
- First Place is awarded to one team, receiving SAR 30,000.
- Second Place is awarded to two teams, each receiving SAR 15,000.
- Third Place is awarded to three teams, each receiving SAR 5,000.
The closing ceremony on 15 April will include only the announcement of the winners’ names. The actual disbursement of cash awards will take place at a separate event or on a date to be announced later.
Submissions must be made exclusively through the digital portal by completing the electronic “Executive Summary Form,” which includes a description of the problem, the proposed technical methodology, and the expected impact of the solution.
Yes. Shortlisted teams will be provided with sample datasets and technical support through dedicated digital channels to support them in successfully developing their prototypes.
All intellectual property rights to the technical and software solutions developed and delivered during the Hackathon (implementation phase) shall be owned by GASTAT.
Yes. Teams are required to attend the live online judging session. Failure to attend will result in disqualification from the evaluation process conducted by the judging committee.
Yes. As a mandatory component of the final submission process, shortlisted teams must submit a demonstration video not exceeding one minute in duration. The video must include a concise elevator pitch explaining how the selected challenge is addressed, along with a demonstration of the prototype’s functionality.
Register for the Data Innovation Hackathon
Unified number : 199009
Email Address : i.hackathon@stats.gov.sa