Help on postback webpages, with C++

ProfoundDisputes · July 19, 2016

I am making a program in C++ that needs to download HTML files and extract info from them. The program is to try and figure out my exact nyseg bill by using data from their website. Being a novice in web interaction I managed to figure out how to download webpages with this code:

#include <iostream>
#include <fstream>
#include <string>
#include <tchar.h>
#include <urlmon.h>
#pragma comment(lib, "urlmon.lib")

using namespace std;

int main() {

	ifstream fin;
	string tempString;

	HRESULT hr = URLDownloadToFile(NULL, _T("http://www.nyseg.com/SuppliersAndPartners/pricingandtariffs/electricitytariffs/transitionchargestatements.html"), _T("nyseg.txt"), 0, NULL);

	fin.open("nyseg.txt");

	for (int i = 0; i < 20; i++) {

		getline(fin, tempString);

		cout << tempString << endl << endl;
	}
	fin.close();
	
	return 0;


}

So as long as a I had the url I could then download pages and avoid actually interacting with the pages, until I notice something I think is called postback? This, https://ebiz1.nyseg.com/cusweb/opcosupplyprice.aspx, is a tool for their prices and when you submit to the form the URL doesn't change. The page updates with new information after clicking submit, yet, the URL doesn't change. I spent a few hours looking at the code and researching and I just don't understand it. It uses Javascript in some way and I don't know that language. I see this code in the submit button:

onclick="javascript:WebForm_DoPostBackWithOptions(new WebForm_PostBackOptions(&quot;btnPage1&quot;, &quot;&quot;, true, &quot;&quot;, &quot;&quot;, false, false))"

I simply want to know how to get the updated page downloaded so that I can get the information that I need.

-Thanks

Mr_KoKa · July 19, 2016

You can use network tab in dev tools in your browser to see where and what data are send.

If this site you try to crawl on doesn't need you to log in then it os one problem less. But if it does then you need cookie jar for persist session cookies. I don't know capabilities of urlmon, but I know curl lib is capable.

I cannot check the site, It is unreachable for me.

HarryNyquist · July 25, 2016

The URL doesn't change because it's using an HTTP POST request to retrieve the information, as opposed to an HTTP GET request, which would look like so:

http://website.com/page.aspx?parameter=value&parameter2=value

You either have to perform the POST request manually, or interact with the page and do the postback event, then download the updated page.

Dat Guy · July 25, 2016

Use curl_easy.

https://techoverflow.net/blog/2013/03/15/c-simple-http-download-using-libcurl-easy-api/

Sign In

Help on postback webpages, with C++

Link to comment

Share on other sites

Link to post

Share on other sites

Link to comment

Share on other sites

Link to post

Share on other sites

Link to comment

Share on other sites

Link to post

Share on other sites

Link to comment

Share on other sites

Link to post

Share on other sites

Create an account or sign in to comment

Create an account

Sign in

Featured Topics

Topics

Latest From Linus Tech Tips:

I Am Not Buying A Super Computer - WAN Show May 3, 2024

Latest From Tech Quickie:

This Guy BUILT His Own Graphics Card!

Latest From TechLinked:

Microsoft, Give Up Already.

Latest From GameLinked:

Roblox and Walmart... Are One

Latest From ShortCircuit:

Dell Has Destroyed the XPS - Dell XPS 16 (2024)

Latest From Mac Address:

Why did you buy an Apple Vision Pro?

Latest From Channel Super Fun:

I Swapped the CEO's Assistant For a Day!

My Activity Streams