Program that can open LARGE .XML files

DoubleY · April 3, 2014

Probably way more effort than it's worth.

If you have access to a web server with MySQL and Phpmyadmin, you may be able to import the XML file as a MySQL DB (I vaguely remember this being possible, haven't used Phpmyadmin in ages), then use some code to display the database as paged results, then go through the results. You may also be able to split the database via Phpmyadmin into multiple separate databases (using some basic queries), then export each one individually as smaller XML files.

But, effort.

Well, my dad might have access to a few of those types of servers How does a 500Mb/s internet connection sound just for hosting a .XML file for myself sound? if only I could actually do that...

ultimatemythbuster · April 3, 2014

So... I downloaded Wikipedia in the form of a .XML file. It's around 40GB... I can't view the file because Notepad says it's too big. and MSFT word doesn't allow files larger than 512MB

Where did you download it from? I'd like to have a xml copy of Wikipedia, even though it will be immediately out of date.

DoubleY · April 3, 2014

Where did you download it from? I'd like to have a xml copy of Wikipedia, even though it will be immediately out of date.

This link will auto download it. Be careful though, you'll want 50gb of free space before you download it. When it uncompresses it makes a "temp" file that doesn't get deleted. http://download.wikimedia.org/enwiki/latest/enwiki-latest-pages-articles.xml.bz2

rashdanml · April 3, 2014

http://stackoverflow.com/questions/12612229/parsing-a-large-40gb-xml-text-file-in-python

The only other option is to use a programming language to parse the file line-by-line (without having to load the entire file).

ultimatemythbuster · April 3, 2014

So... I downloaded Wikipedia in the form of a .XML file. It's around 40GB... I can't view the file because Notepad says it's too big. and MSFT word doesn't allow files larger than 512MB

Taking a LONG time to extract that file. Didn't take long to download at 100Mbps

DoubleY · April 3, 2014

Taking a LONG time to extract that file. Didn't take long to download at 100Mbps

I know, it took me a while too

ultimatemythbuster · April 3, 2014

Probably way more effort than it's worth.

If you have access to a web server with MySQL and Phpmyadmin, you may be able to import the XML file as a MySQL DB (I vaguely remember this being possible, haven't used Phpmyadmin in ages), then use some code to display the database as paged results, then go through the results. You may also be able to split the database via Phpmyadmin into multiple separate databases (using some basic queries), then export each one individually as smaller XML files.

But, effort.

e: Querying a 40GB database might be problematic; most large databases (including large forums) are nowhere close to the 40GB mark.

I just happen to have exactly this. I'm going to have to try it.

ultimatemythbuster · April 3, 2014

So... I downloaded Wikipedia in the form of a .XML file. It's around 40GB... I can't view the file because Notepad says it's too big. and MSFT word doesn't allow files larger than 512MB

Found a program that might work. XMLmax I'm going to try it and see how it does.

http://www.xponentsoftware.com/TrialDownload.aspx

Ciccioo · April 3, 2014

http://www.swiftgear.com/ltfviewer/features.html

Sign In

Program that can open LARGE .XML files

Link to comment

Share on other sites

Link to post

Share on other sites

Link to comment

Share on other sites

Link to post

Share on other sites

Link to comment

Share on other sites

Link to post

Share on other sites

Link to comment

Share on other sites

Link to post

Share on other sites

Link to comment

Share on other sites

Link to post

Share on other sites

Link to comment

Share on other sites

Link to post

Share on other sites

Link to comment

Share on other sites

Link to post

Share on other sites

Link to comment

Share on other sites

Link to post

Share on other sites

Link to comment

Share on other sites

Link to post

Share on other sites

Create an account or sign in to comment

Create an account

Sign in

Topics

Latest From Linus Tech Tips:

Google’s Best Feature In Years - WAN Show June 5, 2026

Latest From ShortCircuit:

The coolest looking monitor. Period. - ASUS ROG display at Computex (Sponsored)

Latest From TechLinked:

This Summer’s Lookin’ Steamy

Latest From GameLinked:

This Was A GOOD One...

Latest From Tech Quickie:

The Secret Council Behind Every Emoji

Latest From The WAN Show:

Google’s Best Feature In Years - WAN Show June 5, 2026