
From Transactions to AI/BI Ready Data
TxFlow — AI Embedded Data Pipeline by Alpina Analytics
Banks sit on a goldmine of interbank data in the form of SWIFT MT and ISO 20022 messages. Each message holds valuable insights, and when aggregated they reveal patterns across business processes, liquidity, risk, client activity, and counterparty behavior. Yet in their raw form this data is complex and fragmented, leaving much of its value untapped.
Starting on 22 November 2025, institutions will have one year, until 14 November 2026, to transition away from unstructured postal addresses and adopt hybrid formats under ISO 20022. Alpina Analytics tackled ISO 20022 migration years ago in Switzerland, with solutions that remain in production today across Tier 1 environments. TxFlow builds on this experience to help institutions turn compliance requirements into a data advantage, transforming unstructured information into consistent, enriched, and AI/BI ready datasets.
At a Glance
- ✅ Convert unstructured to hybrid, structured, or geolocated formats
- ✅ Enrich critical fields inside (e.g. country code) and outside addresses (e.g. BICs, LEIs)
- ✅ Deploy as a scalable, on prem or cloud native Kubernetes cluster
- ✅ Integrates within full data engineering pipelines, writing to destinations such as BigQuery
- ✅ Proven in Tier 1 banking environments processing over 1 million transactions
- ✅ >99% accuracy on address parsing
- ✅ >20% fewer false positives and 0% false negatives (!) in sanctions screening
- ✅ Supported by experts in AI, data, and systems engineering
TxFlow brings automation and intelligence into the data preparation layer, bridging the gap between messaging standards and advanced analytics.
Core Capabilities
- Complete parsing: Extracts all blocks, sections, fields, and subfields from any message type into one consistent and machine readable data model.
- Postal address structuring: Converts unstructured or hybrid addresses into structured and enriched ISO 20022 formats, with geo referencing using authoritative geodata sources.
- Quality assurance: Flags incomplete or implausible address components by checking for geo consistency and basic plausibility against external geodata.
- Simple API: Input the raw message and receive structured output, ideal for rapid prototyping and seamless integration.
- Structured output: Expands nested fields into JSON trees or SQL and Excel ready tables for BI and ML workflows.
- On demand enrichment:Adds BICs/LEIs details, time zones, geolocation, and entity recognition using geodata and AI based enrichment logic.
See more in our Knowledge Base.
Example: Address Structuring
One of the toughest challenges in the ISO 20022 migration, and now in the hybrid address phase, is address quality and consistency across messages and systems. TxFlow converts unstructured address data into structured and geo referenced formats, improving completeness and data quality at scale.
The challenge continues even after migration:
- Need for structured postal addresses across all message types
- Growing demand for machine readable formats and geo coordinates
- Only AI driven parsing and enrichment can address these complexities at scale
Production Readiness
TxFlow runs as a production grade, scalable Kubernetes cluster, proven in Tier 1 banking environments. It processes high volume transaction data, enriches addresses using authoritative geodata, and promotes completeness and consistency across millions of records. Performance has been validated on large scale datasets with over millions of transactions, achieving more than 99 percent accuracy on address parsing. In sanctions screening scenarios, TxFlow reduced false positives by 20 percent and achieved zero false negatives, ensuring that no sanctioned payment is missed. It can be deployed as part of a full data engineering pipeline writing to BigQuery or other destinations, as shown in our example here. TxFlow can be procured for on premise installations or through one click deployment on the Google Cloud Marketplace.
Use Cases
Operations: Leaner and faster processes with standardized data.
- Payments – Improve data quality and readiness for ISO 20022 adoption (e.g. CBPR+, SEPA, CHAPS, MEPS+).
- Securities – Streamline settlement matching and reporting (CSDR).
Compliance: Faster, more reliable checks with enriched and structured data.
- Enhance sanctions screening by reducing false positives.
- Detect suspicious flows through AML and CTF monitoring with geo location and AI enrichment.
Front Office and Treasury: Actionable insights from transaction data.
- Improve customer profiling with KYT and activity monitoring.
- Support treasury and corporate banking with liquidity forecasting and flow analysis.
Flexible Integration
- Transactional: Output consumable by your core banking platform or Transformer engine.
- Custom Analytics: Deliver data to your preferred database or use Alpina’s recommended storage options.
- Modularity: Keep downstream usage fully under your control.
- AI Synergy: Combine TxFlow with TxAgent for natural language analytics and reporting.
MT Is Not Dead
The global migration to ISO 20022 is reshaping payments, yet according to SWIFT many transactions remain out of scope. TxFlow supports both SWIFT MT and ISO 20022, ensuring business continuity today and future proofing for tomorrow.
The MT standard remains essential for:
- Securities post trade
- Clearing and settlement
- Corporate actions
- FX
- Treasury and trade finance
More details on TxFlow features here and on cloud integration here.