Skip to main content

Data Populator

The Intuition Data Populator is a specialized tool designed to help users and developers efficiently populate the Intuition knowledge graph with high-quality, structured data. This application streamlines the process of creating atoms, triples, and establishing meaningful relationships within the decentralized knowledge network.

Why did we create the Data Populator?

Do you have a large amount of data you want to add to the Intuition System, but not enough time or patience to manually create each data point as an atom or triple in the Portal? Look no further! The Data Populator V1 is here!

What it does, in brief

You can use the Data Populator to load CSV files which contain long lists of atom data. The user interface will allow you to perform some basic proofreading of the data, and when you're ready you can submit it to the Intuition System in one fell swoop. This works by batching all of the atom data together into very large EVM transactions, after automatically pinning and filtering the metadata onto IPFS. This is very similar to the behavior in the Portal App, except it's designed to work with much larger volumes of data.

Where you can access it

There are two deployments for the Data Populator - one for testing, which runs on Base Sepolia, and one for production which runs on Base.

It's strongly recommended, especially for first time users, to start off with the Testnet version to verify everything looks good with your data before spending real Base Ethereum on populating it.

Overview

The Data Populator serves as a bridge between traditional data sources and the Intuition ecosystem, enabling users to:

  • Bulk import data - Efficiently create multiple atoms and triples from structured datasets
  • Validate data quality - Ensure imported data meets Intuition's standards and requirements
  • Establish relationships - Create meaningful connections between imported entities
  • Maintain data integrity - Preserve the semantic structure and provenance of imported information

Key Features

🔄 Bulk Data Import

  • Import large datasets in various formats (CSV, JSON, XML)
  • Automated atom and triple creation from structured data
  • Batch processing for efficient handling of large datasets
  • Progress tracking and error reporting

Data Validation

  • Automatic validation of data structure and format
  • Quality checks for imported atoms and triples
  • Duplicate detection and resolution
  • Semantic consistency verification

🔗 Relationship Mapping

  • Automatic creation of triples based on imported relationships
  • Support for complex relationship structures
  • Mapping of traditional data schemas to Intuition's semantic model
  • Preservation of data provenance and metadata

🎯 Quality Assurance

  • Data quality scoring and reporting
  • Conflict resolution for conflicting claims
  • Automated tagging and categorization
  • Integration with Intuition's trust mechanisms

Use Cases

🎓 Academic Research

  • Import research datasets and establish connections between findings
  • Create atoms for research entities, methodologies, and conclusions
  • Build knowledge graphs for specific research domains
  • Enable collaborative research through shared knowledge structures

💼 Business Intelligence

  • Import organizational data and establish entity relationships
  • Create comprehensive knowledge graphs for business domains
  • Enable data-driven decision making through structured insights
  • Facilitate cross-organizational knowledge sharing

📚 Content Curation

  • Import content from various sources and establish semantic relationships
  • Create comprehensive knowledge graphs for specific domains
  • Enable content discovery and recommendation systems
  • Build curated collections of related information

Instruction Guide

Authentication

The first thing you will need to do, if you haven't already, is authenticate and log into the system. If you are not authenticated yet, you will be shown a Connect button. Click the connect button and authorize using your method of choice. Follow the prompts shown on screen to complete the log-in process.

User Interface Overview

After logging in, you'll see the main interface with:

  • Top menu bar: Contains a helpful tooltip toggle button and a history button
  • Version indicator: Shows which version of the Data Populator app you're using (TESTNET or MAINNET)
  • Four main tabs: Select Atom Type, Load CSV, Publish Atoms, and Tag Atoms

Select Atom Type

As of right now, you have 3 options here. Most users can stick with the default value of Schema.org <Thing> to create and tag basic metadata atoms. If you want to populate CAIP-10 atoms or Raw URI atoms, you can select either option from the drop-down before pressing Continue.

Schema.org <Thing>

We currently support 1 type from schema.org, the <Thing>. Most of the atoms in the Intuition Portal are **<Thing>**s. The <Thing> has 6 fields:

FieldDescription
@contextThis will always be https://schema.org - it's used to identify where the metadata type is defined
@typeThis will always be <Thing> - it's used to determine which metadata type is being used
nameThis is the name of your atom. This could be the name of an article, person, place, or thing
descriptionA description of your atom. This is a more in-depth blob of text which helps people understand the thing being referenced
imageA URL pointing to an image for your atom. This might point to a logo, a twitter avatar, or anything which visually demonstrates the thing being referenced
urlA URL pointing to a website for your atom. This could be a social media profile, the website for a business, an article, or anything else on the web

CAIP-10

A CAIP-10 is a string of text which is used to reference smart contracts. CAIP-10 atoms comprise of only one variable, the CAIP-10 itself. The CAIP-10 identifier is comprised of 4 : delimited values:

CAIP10 : [NAMESPACE] : [CHAIN_ID] : [ADDRESS]

Examples:

# Ethereum mainnet (canonicalized with EIP-55 checksum)
CAIP10:eip155:1:0xab16a96D359eC26a11e2C2b3d8f8B8942d5Bfcdb

# Bitcoin mainnet
CAIP10:bip122:000000000019d6689c085ae165831e93:128Lkh3S7CkDTBZ8W7BbpsN3YYizJMp8p6

# Cosmos Hub
CAIP10:cosmos:cosmoshub-3:cosmos1t2uflqwqe0fsj0shcfkrvpukewcw40yjj6hdc0

You can read more about the CAIP-10 standard here: https://chainagnostic.org/CAIPs/caip-10

Raw URI

The Intuition System also uses Raw URIs. These cannot be proofread, and are for advanced users only. One example use case for these is the https://schema.org/keywords Raw URI. The Intuition Portal uses this as a Predicate in Semantic Triples in order to denote that something has a Tag Keyword.

Load CSV

The Load CSV box allows you to load a .csv file from your PC which contains numerous Atoms.

Schema.org <Thing> CSV Example:

@context,@type,name,url,description,image
https://schema.org,Thing,Google,https://www.google.com,"A website where you can search for stuff.",https://google.com/google_logo.jpg
https://schema.org,Thing,Facebook,https://www.facebook.com,"A social media website.",https://facebook.com/facebook_logo.jpg

CAIP-10 CSV Example:

CAIP10
eip155:1:0x1234567890abcdef1234567890aBcdef12345678
eip155:1:0x1234567890abcfeF1234567890aBCDEf12345675
eip155:1:0x1234567890aBcdeF1234567890AbcDef12345672

Important Notes:

  • The column order, capitalization, and naming is crucial - ensure it matches the specification before exporting
  • If there's anything weird going on with your .csv file, the Data Populator will show you a Proofreading window which flags any unusual characters or errors
  • The application will attempt to fix any errors on its own, and can automatically checksum Ethereum addresses

Publish Atoms

The Publish Atoms tab shows you some basic information about the Atoms you have loaded. Once you've reviewed them, you can select them and publish them using the Publish Selected Atoms button.

Publishing Process:

  1. Select atoms using the checkbox on the left of their rows
  2. Press Publish Selected Atoms
  3. Confirm the transaction in the dialog
  4. Approve the EVM transaction in your wallet
  5. Wait for confirmation and see green "Atom Already Exists" icons

Atoms View

The Atoms View appears below all four tabs in the application, and displays the Atoms you currently have loaded. It has several features:

  • Edit and Preview: Cells can be selected and freely edited
  • Sort Data: Click on any column header to sort the data
  • Duplicate Detection: Identify partial duplicates before publishing
  • Delete Rows: Select unwanted rows and press Delete Row
  • Save CSV: Save a new copy to your PC after making edits
  • Add New Row: Manually enter atom data directly into the Data Populator

Tag Atoms

If you want to add your newly published Atoms to a List, you can use the Tag Atoms feature:

  1. Create the Tag: Fill out the tag metadata fields and press Create Tag
  2. Publish Tag: The tag will be published as another Atom in the system
  3. Tag Selected Atoms: Once the tag exists, press Tag Selected Atoms to associate your atoms with it

Important: Be precise while entering metadata. Each Atom's uniqueness is determined by the exact contents of its data.

History Window

If you press the View Your History button near the top-right corner of the screen, a new window will appear showing:

  • History of all the Atoms, Triples (Tags), and Requests that you've submitted
  • Ability to Save .csv files that you've previously submitted
  • Helpful when working on multiple machines or collaborating with others

Getting Started

Prerequisites

  • Access to the Intuition network
  • Structured data in supported formats
  • Understanding of Intuition's atom and triple concepts
  • Familiarity with the data you want to import

Installation

The Data Populator is available as a web application and can be accessed through the Intuition ecosystem.

Basic Workflow

  1. Prepare your data - Ensure your data is in a supported format and properly structured
  2. Validate data - Use the built-in validation tools to check data quality
  3. Map relationships - Define how your data relationships should be represented as triples
  4. Import data - Execute the import process with appropriate settings
  5. Review results - Check the imported atoms and triples for accuracy
  6. Iterate - Refine your import process based on results

Data Formats

📄 Supported Formats

  • CSV - Comma-separated values with headers
  • JSON - Structured JSON objects and arrays
  • XML - Extensible Markup Language documents
  • Excel - Microsoft Excel spreadsheets

📋 Data Structure Requirements

  • Clear entity identification
  • Relationship definitions
  • Metadata and provenance information
  • Quality indicators and confidence scores

Best Practices

Data Preparation

  • Clean and normalize your data before import
  • Ensure consistent naming conventions
  • Validate data quality and completeness
  • Document data sources and transformation processes

Relationship Mapping

  • Clearly define how traditional relationships map to triples
  • Consider the semantic meaning of relationships
  • Plan for relationship validation and verification
  • Account for relationship confidence and uncertainty

Quality Control

  • Implement comprehensive validation checks
  • Monitor import success rates and error patterns
  • Establish feedback loops for continuous improvement
  • Maintain data provenance and audit trails

Integration

With Intuition Portal

  • Imported data appears in the Portal for user interaction
  • Users can stake on imported atoms and triples
  • Integration with Portal's tagging and list features
  • Support for Portal's social features and user interactions

With Developer Tools

  • Programmatic access to imported data through APIs
  • Integration with custom applications and workflows
  • Support for automated data processing and analysis
  • Compatibility with Intuition's developer ecosystem

Advanced Features

Custom Mappings

  • Define custom mapping rules for complex data structures
  • Support for domain-specific ontologies and schemas
  • Flexible relationship mapping and transformation
  • Integration with external data sources and APIs

Automated Processing

  • Scheduled data imports and updates
  • Real-time data synchronization
  • Automated quality monitoring and reporting
  • Integration with external data pipelines

Analytics and Reporting

  • Import statistics and success rates
  • Data quality metrics and trends
  • Usage analytics and impact assessment
  • Performance monitoring and optimization

Development

The Data Populator is part of the Intuition ecosystem and follows the same development principles:

  • Open source - Available for community contribution and improvement
  • Modular design - Flexible architecture for custom extensions
  • API-first - Programmatic access for integration with other tools
  • Community-driven - Development guided by user needs and feedback

Getting Help

For support with the Data Populator:

Getting Started

🚀 Ready to populate your data?

Testnet Data Populator - Start here for testing

Mainnet Data Populator - Production environment

Portal - Main interface for interacting with the knowledge graph

Explorer - Network exploration and transaction monitoring

Developer Tools - Programmatic access and integration

API Documentation - Technical integration guides