Data Populator

The Intuition Data Populator is a specialized tool designed to help users and developers efficiently populate the Intuition knowledge graph with high-quality, structured data. This application streamlines the process of creating atoms, triples, and establishing meaningful relationships within the decentralized knowledge network.

Why did we create the Data Populator?

Do you have a large amount of data you want to add to the Intuition System, but not enough time or patience to manually create each data point as an atom or triple in the Portal? Look no further! The Data Populator V1 is here!

What it does, in brief

You can use the Data Populator to load CSV files which contain long lists of atom data. The user interface will allow you to perform some basic proofreading of the data, and when you're ready you can submit it to the Intuition System in one fell swoop. This works by batching all of the atom data together into very large EVM transactions, after automatically pinning and filtering the metadata onto IPFS. This is very similar to the behavior in the Portal App, except it's designed to work with much larger volumes of data.

Where you can access it

There are two deployments for the Data Populator - one for testing, which runs on Base Sepolia, and one for production which runs on Base.

Testnet Data Populator: https://data-populator-dev.onrender.com
Mainnet Data Populator: https://upload.intuition.systems

It's strongly recommended, especially for first time users, to start off with the Testnet version to verify everything looks good with your data before spending real Base Ethereum on populating it.

Overview

The Data Populator serves as a bridge between traditional data sources and the Intuition ecosystem, enabling users to:

Bulk import data - Efficiently create multiple atoms and triples from structured datasets
Validate data quality - Ensure imported data meets Intuition's standards and requirements
Establish relationships - Create meaningful connections between imported entities
Maintain data integrity - Preserve the semantic structure and provenance of imported information

Key Features

🔄 Bulk Data Import

Import large datasets in various formats (CSV, JSON, XML)
Automated atom and triple creation from structured data
Batch processing for efficient handling of large datasets
Progress tracking and error reporting

✅ Data Validation

Automatic validation of data structure and format
Quality checks for imported atoms and triples
Duplicate detection and resolution
Semantic consistency verification

🔗 Relationship Mapping

Automatic creation of triples based on imported relationships
Support for complex relationship structures
Mapping of traditional data schemas to Intuition's semantic model
Preservation of data provenance and metadata

🎯 Quality Assurance

Data quality scoring and reporting
Conflict resolution for conflicting claims
Automated tagging and categorization
Integration with Intuition's trust mechanisms

Use Cases

🎓 Academic Research

Import research datasets and establish connections between findings
Create atoms for research entities, methodologies, and conclusions
Build knowledge graphs for specific research domains
Enable collaborative research through shared knowledge structures

💼 Business Intelligence

Import organizational data and establish entity relationships
Create comprehensive knowledge graphs for business domains
Enable data-driven decision making through structured insights
Facilitate cross-organizational knowledge sharing

📚 Content Curation

Import content from various sources and establish semantic relationships
Create comprehensive knowledge graphs for specific domains
Enable content discovery and recommendation systems
Build curated collections of related information

Instruction Guide

Authentication

The first thing you will need to do, if you haven't already, is authenticate and log into the system. If you are not authenticated yet, you will be shown a Connect button. Click the connect button and authorize using your method of choice. Follow the prompts shown on screen to complete the log-in process.

User Interface Overview

After logging in, you'll see the main interface with:

Top menu bar: Contains a helpful tooltip toggle button and a history button
Version indicator: Shows which version of the Data Populator app you're using (TESTNET or MAINNET)
Four main tabs: Select Atom Type, Load CSV, Publish Atoms, and Tag Atoms

Select Atom Type

As of right now, you have 3 options here. Most users can stick with the default value of Schema.org <Thing> to create and tag basic metadata atoms. If you want to populate CAIP-10 atoms or Raw URI atoms, you can select either option from the drop-down before pressing Continue.

Schema.org <Thing>

We currently support 1 type from schema.org, the <Thing>. Most of the atoms in the Intuition Portal are **<Thing>**s. The <Thing> has 6 fields:

Field	Description
@context	This will always be https://schema.org - it's used to identify where the metadata type is defined
@type	This will always be <Thing> - it's used to determine which metadata type is being used
name	This is the name of your atom. This could be the name of an article, person, place, or thing
description	A description of your atom. This is a more in-depth blob of text which helps people understand the thing being referenced
image	A URL pointing to an image for your atom. This might point to a logo, a twitter avatar, or anything which visually demonstrates the thing being referenced
url	A URL pointing to a website for your atom. This could be a social media profile, the website for a business, an article, or anything else on the web

CAIP-10

A CAIP-10 is a string of text which is used to reference smart contracts. CAIP-10 atoms comprise of only one variable, the CAIP-10 itself. The CAIP-10 identifier is comprised of 4 : delimited values:

CAIP10 : [NAMESPACE] : [CHAIN_ID] : [ADDRESS]

Examples:

# Ethereum mainnet (canonicalized with EIP-55 checksum)
CAIP10:eip155:1:0xab16a96D359eC26a11e2C2b3d8f8B8942d5Bfcdb

# Bitcoin mainnet
CAIP10:bip122:000000000019d6689c085ae165831e93:128Lkh3S7CkDTBZ8W7BbpsN3YYizJMp8p6

# Cosmos Hub
CAIP10:cosmos:cosmoshub-3:cosmos1t2uflqwqe0fsj0shcfkrvpukewcw40yjj6hdc0

You can read more about the CAIP-10 standard here: https://chainagnostic.org/CAIPs/caip-10

Raw URI

The Intuition System also uses Raw URIs. These cannot be proofread, and are for advanced users only. One example use case for these is the https://schema.org/keywords Raw URI. The Intuition Portal uses this as a Predicate in Semantic Triples in order to denote that something has a Tag Keyword.

Load CSV

The Load CSV box allows you to load a .csv file from your PC which contains numerous Atoms.

Schema.org <Thing> CSV Example:

@context,@type,name,url,description,image
https://schema.org,Thing,Google,https://www.google.com,"A website where you can search for stuff.",https://google.com/google_logo.jpg
https://schema.org,Thing,Facebook,https://www.facebook.com,"A social media website.",https://facebook.com/facebook_logo.jpg

CAIP-10 CSV Example:

CAIP10
eip155:1:0x1234567890abcdef1234567890aBcdef12345678
eip155:1:0x1234567890abcfeF1234567890aBCDEf12345675
eip155:1:0x1234567890aBcdeF1234567890AbcDef12345672

Important Notes:

The column order, capitalization, and naming is crucial - ensure it matches the specification before exporting
If there's anything weird going on with your .csv file, the Data Populator will show you a Proofreading window which flags any unusual characters or errors
The application will attempt to fix any errors on its own, and can automatically checksum Ethereum addresses

Publish Atoms

The Publish Atoms tab shows you some basic information about the Atoms you have loaded. Once you've reviewed them, you can select them and publish them using the Publish Selected Atoms button.

Publishing Process:

Select atoms using the checkbox on the left of their rows
Press Publish Selected Atoms
Confirm the transaction in the dialog
Approve the EVM transaction in your wallet
Wait for confirmation and see green "Atom Already Exists" icons

Atoms View

The Atoms View appears below all four tabs in the application, and displays the Atoms you currently have loaded. It has several features:

Edit and Preview: Cells can be selected and freely edited
Sort Data: Click on any column header to sort the data
Duplicate Detection: Identify partial duplicates before publishing
Delete Rows: Select unwanted rows and press Delete Row
Save CSV: Save a new copy to your PC after making edits
Add New Row: Manually enter atom data directly into the Data Populator

Tag Atoms

If you want to add your newly published Atoms to a List, you can use the Tag Atoms feature:

Create the Tag: Fill out the tag metadata fields and press Create Tag
Publish Tag: The tag will be published as another Atom in the system
Tag Selected Atoms: Once the tag exists, press Tag Selected Atoms to associate your atoms with it

Important: Be precise while entering metadata. Each Atom's uniqueness is determined by the exact contents of its data.

History Window

If you press the View Your History button near the top-right corner of the screen, a new window will appear showing:

History of all the Atoms, Triples (Tags), and Requests that you've submitted
Ability to Save .csv files that you've previously submitted
Helpful when working on multiple machines or collaborating with others

Getting Started

Prerequisites

Access to the Intuition network
Structured data in supported formats
Understanding of Intuition's atom and triple concepts
Familiarity with the data you want to import

Installation

The Data Populator is available as a web application and can be accessed through the Intuition ecosystem.

Basic Workflow

Prepare your data - Ensure your data is in a supported format and properly structured
Validate data - Use the built-in validation tools to check data quality
Map relationships - Define how your data relationships should be represented as triples
Import data - Execute the import process with appropriate settings
Review results - Check the imported atoms and triples for accuracy
Iterate - Refine your import process based on results

Data Formats

📄 Supported Formats

CSV - Comma-separated values with headers
JSON - Structured JSON objects and arrays
XML - Extensible Markup Language documents
Excel - Microsoft Excel spreadsheets

📋 Data Structure Requirements

Clear entity identification
Relationship definitions
Metadata and provenance information
Quality indicators and confidence scores

Best Practices

Data Preparation

Clean and normalize your data before import
Ensure consistent naming conventions
Validate data quality and completeness
Document data sources and transformation processes

Relationship Mapping

Clearly define how traditional relationships map to triples
Consider the semantic meaning of relationships
Plan for relationship validation and verification
Account for relationship confidence and uncertainty

Quality Control

Implement comprehensive validation checks
Monitor import success rates and error patterns
Establish feedback loops for continuous improvement
Maintain data provenance and audit trails

Integration

With Intuition Portal

Imported data appears in the Portal for user interaction
Users can stake on imported atoms and triples
Integration with Portal's tagging and list features
Support for Portal's social features and user interactions

With Developer Tools

Programmatic access to imported data through APIs
Integration with custom applications and workflows
Support for automated data processing and analysis
Compatibility with Intuition's developer ecosystem

Advanced Features

Custom Mappings

Define custom mapping rules for complex data structures
Support for domain-specific ontologies and schemas
Flexible relationship mapping and transformation
Integration with external data sources and APIs

Automated Processing

Scheduled data imports and updates
Real-time data synchronization
Automated quality monitoring and reporting
Integration with external data pipelines

Analytics and Reporting

Import statistics and success rates
Data quality metrics and trends
Usage analytics and impact assessment
Performance monitoring and optimization

Development

The Data Populator is part of the Intuition ecosystem and follows the same development principles:

Open source - Available for community contribution and improvement
Modular design - Flexible architecture for custom extensions
API-first - Programmatic access for integration with other tools
Community-driven - Development guided by user needs and feedback

Getting Help

For support with the Data Populator:

Check the Intuition documentation for detailed guides
Visit the GitHub repository for technical details
Join the community discussions for user support
Review the API documentation for integration help

Getting Started

🚀 Ready to populate your data?

Testnet Data Populator - Start here for testing

Mainnet Data Populator - Production environment

Portal - Main interface for interacting with the knowledge graph

Explorer - Network exploration and transaction monitoring

Developer Tools - Programmatic access and integration

API Documentation - Technical integration guides

Why did we create the Data Populator?​

What it does, in brief​

Where you can access it​

Overview​

Key Features​

🔄 Bulk Data Import​

✅ Data Validation​

🔗 Relationship Mapping​

🎯 Quality Assurance​

Use Cases​

🎓 Academic Research​

💼 Business Intelligence​

📚 Content Curation​

Instruction Guide​

Authentication​

User Interface Overview​

Select Atom Type​

Schema.org <Thing>​

CAIP-10​

Raw URI​

Load CSV​

Schema.org <Thing> CSV Example:​

CAIP-10 CSV Example:​

Publish Atoms​

Atoms View​

Tag Atoms​

History Window​

Getting Started​

Prerequisites​

Installation​

Basic Workflow​

Data Formats​

📄 Supported Formats​

📋 Data Structure Requirements​

Best Practices​

Data Preparation​

Relationship Mapping​

Quality Control​

Integration​

With Intuition Portal​

With Developer Tools​

Advanced Features​

Custom Mappings​

Automated Processing​

Analytics and Reporting​

Development​

Getting Help​

Getting Started​

🚀 Ready to populate your data?​

Related Resources​

Why did we create the Data Populator?

What it does, in brief

Where you can access it

Overview

Key Features

🔄 Bulk Data Import

✅ Data Validation

🔗 Relationship Mapping

🎯 Quality Assurance

Use Cases

🎓 Academic Research

💼 Business Intelligence

📚 Content Curation

Instruction Guide

Authentication

User Interface Overview

Select Atom Type

Schema.org <Thing>

CAIP-10

Raw URI

Load CSV

Schema.org <Thing> CSV Example:

CAIP-10 CSV Example:

Publish Atoms

Atoms View

Tag Atoms

History Window

Getting Started

Prerequisites

Installation

Basic Workflow

Data Formats

📄 Supported Formats

📋 Data Structure Requirements

Best Practices

Data Preparation

Relationship Mapping

Quality Control

Integration

With Intuition Portal

With Developer Tools

Advanced Features

Custom Mappings

Automated Processing

Analytics and Reporting

Development

Getting Help

Getting Started

🚀 Ready to populate your data?

Related Resources