InfoLink Data Integration

A complete set of fully integrated features for low-code data integration.

Connectivity

80+ native connectors to cloud and on-premise apps:

  • Universal user and API to load data from CRMs, databases, files in any format (Excel, CSV, XML), and more.
  • Optimized bulk load.
  • Incremental updates with Change Data Capture support.
  • Update and enrich data in data source.

Discovery

Builds a complete and accurate picture of enterprise data across data sources and formats:

  • Meta-data search and navigation through universal user interface
  • Data preview from any source and in any format.
  • Advance data profiler: column value distribution, uniqueness and completeness, join profiler, discovering relationships.
  • Integrated user interface for in-place interactions.

Transform and blend data

Complete set of operation and tools to manipulate your data:

  • Easy to use drag-n-drop code-less data blending and transformations.
  • Compete set of operations ranging from simple to sophisticated: filter, join, transform, standardize, pivot, fuzzy match, and dozens more.
  • Integrated user interface for in-place data crunching without switching windows.
  • Diagram-style workflows with point-n-click interactions
  • Handle large amounts of data with efficient in-source computations.
  • Automate via scheduling repeatable workflows

Data Quality

Rich set of tools to ensure all your data is clean and ready to provide the trusted insights:

  • Data validation to check uniquence, reference integrity, record validity, format checker, aggregate constraints.
  • Data standardization based on rules and reference data
  • Address validation and standardization with international support.
  • Market’s most advanced probabilistic fuzzy matching technology for deduplication (single-source) and linking (two-source).
  • Holistic data steward user interface designed for the business user.

High Performance and Scalability

Scales to any data volume and provides industry-leading performance:​

  • Inter-operation parallelism allows concurrent execution of logically independent operations significantly reducing total execution time.
  • Intra-operation parallelism in which a single operation decomposed into smaller tasks that are executed concurrently on partitions of input data. Operation decomposition and data partitioning are done automatically and does not require complex configuration.
  • In-database execution is a powerful principle in the core of InfoLink design that maximizes performance and utilization of your existing infrastructure. It pushes down operation execution to a wide range of relational, columnar, NoSQL and distributed systems including Hadoop and Spark. In-database execution is transparent to the user: the same InfoLink operations can be executed on any of the systems listed above. It greatly improves performance by not moving the data out of a database. It also allows choosing a system that can handle your transformations data volumes in the most efficient way.
  • InfoLink implements its own native distributed mechanism that allows you to easily deploy InfoLink on many servers, partition your data across the servers, dispatch distributed tasks, gather the results, and monitor the whole process from a single interface.
  • Intelligent optimizer automatically chooses the most efficient execution method analyzing properties of your data and capabilities of the underlining source/staging systems.