With the build-up of websites from the 1990s on, many large repositories of unstructured text were created in HTML.
Now those same repositories are getting increasingly difficult to index and catalog for search and retrieval.
The answer is XML, but extracting and transforming vast quantities of text from HTML (a layout/markup language) to XML (a data description language) is a daunting task. Beyond the prettyprint and tidy utilities is a more challenging task to create the correct XML structure and tags for the data. Fortunately, Mobilize.Net has found a way to automate the process.
The Mobilize.Net solution is highly automated through file crawlers, HTML parsers, and our tried and tested migration engine. The Mobilize.Net migration is customizable which enables creating mappings that reflect your specific XML schema.
In addition we preserve all of the embedded files including images, audio, video and more.
SEE LATEST POST
Chances are good that we may already answer your technical questions.
If not, we promise to answer immediately.
NTCNA Chassis Dynamics chose Mobilize VBUC because the automated migration technology greatly sped up our move off VB6.
We ran a proof of concept comparing the Visual Basic Upgrade Companion (VBUC) with other VB6 migration tools and we definitely preferred the way VBUC handled the conversion.