In every change of technologies or systems, large amounts of data usually have to be transformed or exchanged as well, sometimes even beyond system boundaries. Also, data is frequently stored within inappropriate tools, such as Excel instead of proper databases, and needs to be bulk-ported and integrated into new systems.
Extraction of data from a variety of sources, consolidation of different data formats and the elimination of duplicates, as well as merging certain metadata, is called data integration.
Sometimes, proprietary software does not allow to export its data into machine-processable formats. Data integration ensures that nothing is copied manually, but various tools are used, such as databases, PIM software, or scripts and common Unix tools.
For all data sources, appropriate integration strategies must be applied. Finally, all relevant data can be integrated into a single, new database. And since we profoudly know various database formats (and various SQL dialects), content management systems (Typo3, Wordpress, Wikis) and also file formats (such as XML and their automated transformation through XPath, XQuery and XSLT), we have broad experience in data integration.
We also already have mined databases for products, user data, keywords, categories, lemmas or geolocation data. If not already available, we are quickly able to create such databases by data extraction or data mining.