QXmlEdit, a simple XML editor, is born to explore Qt libraries and then gradually added features becoming a sort of swiss army knife.
In its latest release, 0.5.3, there is a tool to split XML files of any size in smaller fragments. It is also possible to examine a fragment at time in the user interface, without loading it completely in memory.
You can download QXmlEdit at the following link: http://code.google.com/p/qxmledit/downloads
This little tutorial, extracted from the manual, shows how to use the file splitter in details:
When starting, the program will open a welcome dialog leading to the most common operations. Choose “Split a file”. If you choose to hide the dialog next time the program restart and change your mind, you can reactivate it in the configuration panel. You can access this feature also from the menu Tools.
Decide if you want to extract the information or navigate them
If you want only navigate the data without creating any files, uncheck ‘Extract Documents‘. Even if you don’t extract the XML fragments into smaller file, QXmlEdit scans the file and records the fragments for a review.
Choose the file and how to fragment it
Browse to the file to examine, the insert the XPath that identifies a fragment. Insert a string of the form /ROOT/DATA and so on.
Limit the extracted fragments number
Selecting a range permits to extract only a small subset of the original file. You are not obliged to waste space on your disk if you want only a single fragment.
Decide the output folder and the naming
Enter the location where write the extracted fragments and decide if you want to create a folder each N files. Decide also how to name the files and the folders. The combo boxes have some predefined values to ease the task, like a timestamp, or a progressive counter.
Go and examine data
After the start of the operation, in the the main window a navigation box appears. The XML fragments found in the input file are accessible directly with a random access using the information collected in the previous phases. If you choose to split the file in fragments, you can examine them in the extraction directory.