Data Mapping Guide for MVP

Overview

Data mapping is the crucial foundation that enables interoperability between different components of the Skyocean platform. It defines how information moves between systems and ensures that data maintains its meaning and relationships throughout its lifecycle.

Why Data Mapping Matters:

  • Enables seamless communication between on-chain and off-chain systems
  • Preserves data integrity across different technological domains
  • Creates a standardized knowledge representation framework
  • Facilitates trusted knowledge asset trading without central intermediaries

Understanding Data Flow in Knowledge Asset Trading

Knowledge assets in Skyocean move through multiple systems with different data models:

  1. Creation and Registration - Data originates in user applications
  2. Transformation - Data is standardized into platform schemas
  3. Storage - Data is distributed across blockchain and DKG
  4. Verification - Data integrity is confirmed through multi-party validation
  5. Transaction - Asset ownership transfer is executed and recorded

Each transition requires precise data mapping to maintain consistency and meaning.

Architecture Components and Their Data Roles

1. Edge Node Layer

Purpose: Creates standardized knowledge asset representations in the DKG

Why it matters:

  • The Edge Node translates domain-specific data into universal, interoperable formats
  • Uses JSON-LD to create machine-readable, semantically rich data representations
  • Preserves contextual meaning of data through ontology references
  • Enables discoverability through knowledge graphs

Data transformation responsibilities:

  • Converting raw knowledge assets into JSON-LD format
  • Assigning unique identifiers compatible with DKG standards
  • Embedding semantic relationships through context definitions
  • Publishing assets to the DKG network

2. Middleware Layer

Purpose: Bridges the semantic web of the DKG with the deterministic world of blockchain

Why it matters:

  • Smart contracts require specific, well-defined data structures
  • DKG uses flexible, context-rich semantic structures
  • Without middleware translation, these systems cannot interoperate
  • Provides crucial abstraction that shields applications from underlying complexity

Data transformation responsibilities:

  • Extracting critical identifiers and metadata from DKG assets
  • Converting semantic properties to contract-compatible structures
  • Maintaining bidirectional references between on-chain and off-chain data
  • Handling state synchronization across systems

3. Smart Contract Layer

Purpose: Provides tamper-proof record of ownership, transfers, and verification status

Why it matters:

  • Blockchain data must be minimal for cost and efficiency reasons
  • Only essential data points and references are stored on-chain
  • Smart contracts enforce business rules and transaction logic
  • Blockchain provides the source of truth for ownership and verification status

Data storage responsibilities:

  • Maintaining minimal asset references with verification status
  • Recording ownership and transaction history
  • Storing cryptographic proofs of larger off-chain data sets
  • Emitting events that trigger middleware synchronization

4. Web Application Layer

Purpose: Translates complex underlying data into intuitive user interfaces

Why it matters:

  • End users interact with complex multi-system data through simplified interfaces
  • UI requires aggregation of data from multiple sources
  • User actions must be translated into technical operations across systems

Complete Transaction Data Workflow

The following diagram illustrates how data flows through a complete knowledge asset transaction:

Seller
WebApp
Middleware
EdgeNode
DKG
SmartContracts
Buyer
  1. Seller uploads knowledge asset to WebApp
  2. WebApp sends raw asset data to Middleware
  3. Middleware formats and sends asset metadata to EdgeNode
  4. EdgeNode creates JSON-LD asset and publishes to DKG
  5. DKG returns Universal Asset Locator (UAL) to EdgeNode
  6. EdgeNode passes UAL and metadata to Middleware
  7. Middleware registers asset on SmartContracts
  8. SmartContracts return Asset ID to Middleware
  9. Middleware confirms registration to WebApp
  10. WebApp confirms asset listing to Seller
  11. Buyer requests to purchase asset via WebApp
  12. WebApp sends purchase request to Middleware
  13. Middleware creates transaction record on SmartContracts
  14. SmartContracts return Transaction ID to Middleware
  15. Middleware requests asset details from EdgeNode
  16. EdgeNode queries asset from DKG by UAL
  17. DKG returns complete asset data to EdgeNode
  18. EdgeNode provides asset data to Middleware for Buyer
  19. Middleware sends transaction details to WebApp
  20. WebApp requests payment confirmation from Buyer
  21. Buyer sends payment via WebApp
  22. WebApp confirms payment to Middleware
  23. Middleware completes transaction on SmartContracts
  24. SmartContracts update ownership and notify Middleware
  25. Middleware requests EdgeNode to update DKG
  26. EdgeNode updates asset metadata in DKG
  27. Middleware confirms completion to WebApp
  28. WebApp grants access to purchased asset to Buyer

Schema Mapping Implementation Details

Why JSON-LD for DKG Integration?

JSON-LD (JSON for Linked Data) is critical for our platform because:

  1. Semantic Richness - Properties have well-defined meanings across systems
  2. Flexible Extension - New data types can be added without breaking existing systems
  3. Decentralized References - Enables linking to external knowledge systems
  4. Standards Compatibility - Aligns with W3C and schema.org standards
  5. Machine Readability - Supports automated processing and AI integration

1. Edge Node Mapping Process

// Location: edge-node/mappings/assetMapper.js
const mapKnowledgeAsset = (rawData) => {
  // Convert application-specific data to universal JSON-LD format
  return {
    "@context": {
      "schema": "http://schema.org/",  // Reuse existing standards
      "skyocean": "https://skyocean.io/ontology/"  // Custom domain ontology
    },
    "@type": "skyocean:KnowledgeAsset",  // Define asset type from our ontology
    "schema:identifier": rawData.assetId, // Unique ID
    "schema:name": rawData.title,
    "schema:description": rawData.description,
    "skyocean:assetType": rawData.type,  // Domain-specific property
    "skyocean:contentHash": rawData.hash, // Integrity verification
    "skyocean:ownerAddress": rawData.owner, // Blockchain reference
    "skyocean:creationDate": new Date().toISOString(), // Temporal metadata
    "skyocean:keywords": rawData.tags.map(tag => ({ "@value": tag })), // Searchable properties
    "skyocean:licenseType": rawData.license // Governance metadata
  }
};

What happens during mapping:

  1. Application-specific data is transformed into standardized format
  2. Properties are qualified with ontology prefixes (schema:, skyocean:)
  3. Complex structures (arrays, nested objects) are preserved
  4. Data validation occurs during mapping
  5. The result is semantically precise, machine-readable data

2. Middleware Transformation Process

// Location: middleware/transformers/dataTransformer.js

// Transform DKG data to blockchain-compatible format
const transformForBlockchain = (dkgData) => {
  // Extract only essential data for on-chain storage
  return {
    assetId: dkgData["schema:identifier"],
    metadata: {
      title: dkgData["schema:name"],
      description: dkgData["schema:description"],
      assetType: dkgData["skyocean:assetType"]
    },
    contentHash: dkgData["skyocean:contentHash"],
    owner: dkgData["skyocean:ownerAddress"]
  };
};

// Transform blockchain event data to DKG-compatible format
const transformFromBlockchain = (event) => {
  // Create or update DKG asset with blockchain data
  return {
    "@context": {
      "schema": "http://schema.org/",
      "skyocean": "https://skyocean.io/ontology/"
    },
    "@type": "skyocean:BlockchainTransaction",
    "skyocean:transactionHash": event.transactionHash,
    "skyocean:blockNumber": event.blockNumber,
    "skyocean:eventType": event.event,
    "skyocean:assetReference": event.args.assetId,
    "skyocean:newOwner": event.args.buyer,
    "skyocean:timestamp": new Date().toISOString()
  };
};

What happens during transformation:

  1. Complex DKG data is simplified for blockchain storage
  2. Only essential properties are extracted, reducing gas costs
  3. Blockchain events are captured and converted to DKG-compatible formats
  4. References are maintained to link on-chain and off-chain data
  5. Bidirectional mapping enables synchronization between systems

Ontology Design for Maximum Interoperability

Skyocean’s ontologies define the formal vocabulary for knowledge asset trading. They serve as the semantic contract between systems, ensuring consistent understanding of data across components.

Core Ontology Design Principles

  1. Reuse existing standards - Leverage schema.org, Dublin Core, and other established vocabularies
  2. Minimize custom terms - Only create custom terms for domain-specific concepts
  3. Hierarchical organization - Establish clear parent-child relationships between concepts
  4. Clear property definitions - Each property has explicit range, domain, and cardinality
  5. Version management - Support ontology evolution without breaking existing data

Core Ontologies

  1. Asset Ontology

    skyocean:KnowledgeAsset: description: “A tradable unit of knowledge with defined ownership and verification” subClassOf: schema:CreativeWork properties: - skyocean:assetType: description: “The type of knowledge asset (dataset, algorithm, document)” range: skyocean:AssetType - skyocean:contentHash: description: “Cryptographic hash of the asset content for verification” range: xsd:string - skyocean:ownerAddress: description: “Blockchain address of current owner” range: xsd:string - skyocean:verificationStatus: description: “Current verification state of the asset” range: skyocean:VerificationStatus - skyocean:tradingStatus: description: “Current trading state (listed, sold, restricted)” range: skyocean:TradingStatus

  2. Transaction Ontology

    skyocean:TradeTransaction: description: “Record of knowledge asset transfer between parties” properties: - skyocean:transactionId: description: “Unique identifier of the transaction” range: xsd:string - skyocean:buyer: description: “Entity acquiring the knowledge asset” range: schema:Person or schema:Organization - skyocean:seller: description: “Entity transferring the knowledge asset” range: schema:Person or schema:Organization - skyocean:assetReference: description: “Reference to the traded knowledge asset” range: skyocean:KnowledgeAsset - skyocean:status: description: “Current state of the transaction” range: skyocean:TransactionStatus - skyocean:documents: description: “Supporting documents for the transaction” range: skyocean:Document cardinality: zero or more

Implementation in Different System Layers

1. Edge Node Implementation

The Edge Node uses the full ontological structure:

// Location: edge-node/config/ontology.js
const ontologyConfig = {
  baseContext: "https://skyocean.io/ontology/",
  mappings: {
    KnowledgeAsset: {
      type: "Asset",
      properties: [
        "identifier",
        "name",
        "description",
        "assetType",
        "contentHash",
        "ownerAddress",
        "verificationStatus",
        "tradingStatus",
        "creationDate",
        "keywords",
        "licenseType"
      ]
    },
    TradeTransaction: {
      type: "Transaction",
      properties: [
        "transactionId",
        "buyer",
        "seller",
        "assetReference",
        "status",
        "documents",
        "timestamp",
        "transactionHash",
        "blockNumber"
      ]
    }
  }
};

2. Smart Contract Implementation

Smart contracts use a minimal representation of the ontology:

// Location: contracts/KnowledgeAssetRegistry.sol
struct AssetMetadata {
    string assetType;
    string contentHash;
    address owner;
    VerificationStatus status;
}

enum VerificationStatus {
    Unverified,
    Pending,
    Verified,
    Rejected
}

// Mapping from asset ID to metadata
mapping(string => AssetMetadata) public assets;

Practical Implementation Process

1. Ontology Definition First

Before writing code, define the ontology:

  1. Identify key concepts (Asset, Transaction, Document)
  2. Define properties for each concept
  3. Establish relationships between concepts
  4. Document in machine-readable format (RDF, OWL)
  5. Publish ontology at persistent URL

2. Schema Implementation

After ontology definition:

  1. Create JSON-LD contexts that reference the ontology
  2. Define schema mappings for each data source
  3. Implement validation rules for schemas
  4. Create example instances for testing

3. Middleware Development

With schemas defined:

  1. Implement transformation functions between schemas
  2. Create validation middleware
  3. Build API endpoints for data access
  4. Develop event handlers for blockchain events

4. Integration Testing

End-to-end testing for data flow:

  1. Create test assets with known properties
  2. Trace data transformation through each system
  3. Verify property preservation
  4. Test roundtrip conversions
  5. Validate interoperability with external systems

Why This Approach Ensures Interoperability

  1. Semantic Foundation - Ontologies provide shared understanding across systems
  2. Standard Formats - JSON-LD enables interconnection with semantic web
  3. Bidirectional Mapping - Data can flow between blockchain and DKG
  4. Clear Boundaries - Each system has well-defined data responsibilities
  5. Explicit Transformations - All data changes are documented and traceable

Next Steps for MVP Implementation

  1. Define and publish the core Skyocean ontology
  2. Implement base schema mappings for assets and transactions
  3. Develop Edge Node configuration for DKG integration
  4. Create middleware transformers for blockchain integration
  5. Integrate with the web application
  6. Run end-to-end tests for complete transactions
  7. Monitor and optimize data flow through all systems

References


Back to top

Copyright © 2023 SKYOCEAN. All rights reserved.