Skip to content

News Pool Structure and Collection Guide

The NewsFork model in the ASI era will be more than a simple news aggregator; it will become a ‘global intelligent news hub’ connecting fragmented public information by country to AI and expatriates worldwide.

1. News Pool Structure: “Central Intelligent Data Lake”

Section titled “1. News Pool Structure: “Central Intelligent Data Lake””

For each channel (agent) to select articles from the news pool, the source data must be in a ‘neutral structure’ independent of specific languages or formats.

  • All news is converted and stored in a JSON-based standard schema
  • Examples: country_code , agency_name , original_lang , fact_tags , importance_score
  • Relational DB (D1/PostgreSQL): Article metadata, subscription information
  • Vector DB (Pinecone/Milvus): Stores the ‘meaning’ of news as vectors, similarity-based recommendations
  • Central Orchestrator manages news pool
  • Sub-channel agents select articles via ‘intent-based routing’

2. Acquisition Strategy (Responsible Collection)

Section titled “2. Acquisition Strategy (Responsible Collection)”
  • Official APIs first: Open data portals like data.europa.eu, data.gov, etc.
  • Web scraping: Wrangler-based serverless scraper, operates during idle time slots
  • Immediate NMT translation to English/standard official language upon acquisition
  • Localization optimization: Incorporating cultural nuances for expat channels
StepChannel TypeHow It WorksBusiness Model
Stage 1Expats ChannelCountry-specific agents curate essential information for foreign residentsChannel-specific premium subscriptions or advertising
Stage 1Developer APIDirect access to news pool, Structured Fact Data SubscriptionZuplo + Stripe Usage-Based Billing
Phase 2Personalized RecommendationsCurates news matching interests using Vector DB + Activity DataHyper-Personalized Premium Membership
Section titled “Recommended: “Intelligent Gateway Strategy””

Place Zuplo at the news pool’s entry and exit points to control API usage, monitoring traffic and value whenever channel agents ‘Fork’ articles from the news pool.