Extracting Information From HTML?

Oct 2, 2009

The method I'm currently using to extract html and the parse is via a WebBrowser control.I'm grabbing a collection of tags, sorting through the ones I want, and then pulling the innertext.Doing this on my development machine is kind of slow, but manageable. At max, I can go through 60 different web pages across 3 different sites. It takes about 5 mins on my machine.

However, this app is targeted towards machine that have a quarter of the technology that my computer has. So, it takes anywhere from 10-15 minutes. This is less than ideal.I've bee reading up on the documentation on HTTPWebRequest. However, form what I've seen I can't seem to find a way to grab the HTML and then parse it with a method from HTTPWebRequest.

Due to the way I'll be using it, I won't be able to use RegEx to parse the data and was hoping there was a method within the HTTPWebRequest property to do it.

View 2 Replies


ADVERTISEMENT

VS 2008 Extracting Information From HTML?

Oct 2, 2009

The method I'm currently using to extract html and the parse is via a WebBrowser control. I'm grabbing a collection of tags, sorting through the ones I want, and then pulling the innertext.

Doing this on my development machine is kind of slow, but manageable. At max, I can go through 60 different web pages across 3 different sites. It takes about 5 mins on my machine.

However, this app is targeted towards machine that have a quarter of the technology that my computer has. So, it takes anywhere from 10-15 minutes. This is less than ideal.

Does anyone know of any other method that I could do that would take less resources and perform a lot quicker?

View 7 Replies

C# - Extracting Inner Text From HTML BODY Node With HTML Agility Pack?

Jul 27, 2011

Need a bit of help with HTML Agility Pack!Basically I want to grab plain-text withing the body node of the HTML. So far I have tried this in vb.net and it fails to return the innertext meaning no change is seen, well atleast from what I can see.

Dim htmldoc As HtmlDocument = New HtmlDocument
htmldoc.LoadHtml(html)
Dim paragraph As HtmlNodeCollection = htmldoc.DocumentNode.SelectNodes("//body")

[code]....

I have tried this:

Return htmldoc.DocumentNode.InnerText

But still no luck!

View 1 Replies

Extracting Information From Text?

Jul 4, 2011

I have the following text, it displays like this in a textbox, but is all stored in one string:

[Code].....

But I cant think of a way to do it, the only thing I can think is to try and grab the text on the line after each "+CMGR:"

View 2 Replies

Extracting Partial Information Of XML?

Feb 19, 2011

I have a lot of information distributed among some tables in a database (and must likely it will all be represented into oop). I will need to create text files with different formats from the same data depending on the request. My idea is to create a design that will require no coding if such formats are to change of be added.I was thinking in using some sort of XML magic trick to accomplish this but I have no idea how to. I am also open to new ideas.

View 1 Replies

Extracting Information From Text Files

Feb 5, 2011

I'm trying to identify a specific string within a text file, and then extra a couple of bits of information that follow from that.

Example snippet from text file:
>Date 5.3: Tech Manual {30201}
Freya Plumes (30201) - 1
mu

These are made from brightly coloured. I want to be able to find where it says "Tech Manual", and then assign "Freya Plumes", "30201" and "1" to variables for manipulating and exporting to another file. There will be many instances from the same text file with similar information, so I'd need the extracted strings popped into an array for later referencing.

View 7 Replies

Extracting Specific Information From A Webpage

Feb 25, 2009

I want to extract a specific information from a webpage.For example:url...So can I make the software see what's between "<h1>Your IP address is<BR>", to "</h1>"?And doing all this by using the webbrowser form in Visual basic studio 2008.

View 1 Replies

VS 2008 Extracting Information From A Webpage?

May 10, 2009

I am trying to take specific information from a web page and then process that information so that it can be sent to a label or text box.Previously, I had wanted to do this by using Regular Expressions. I've looked around and it seems that using regular expressions to parse information isn't always the best way because website aren't always coded to standards.Regardless, learning regex isn't working out too well for me. So, I was wondering if there was another way to do this?I was thinking that I might be able to use the web browser control. It would be ideal to be able to see the page in the form, select the information, and then be able to display it.

View 8 Replies

VS 2008 Extracting Information From Websites

Aug 16, 2009

I'm currently using a webbrowser control to extract information from multiple websites. At any given time, my application could be searching through 1-80 different webpages across 1-4 different websites. On my computer, it runs pretty quick. I could go through all 80 sites in just a few minutes. But when on another machine with fewer resouces, it runs awfully slow. Taking up to 15 minutes. Which kind of beats the purpose, but not entirely, of my application.

I'll begin by explaining my methods, and go from there. When utilizing the Webbrowser control, I use While statements and Application.DoEvents, instead of the Webbrowser controls's DocumentCompleted event. I'm wondering if this is one thing that could be slowing down the process. Though I'm not sure why it would be. Also, would utilizing separate threads help with this? I can't imagine it would boost performance. But I'm not entirely sure. I think it bogs down because the application uses so much memory when running. On idle, the application uses about 9.5k memory. When running, I've seen it go as high as 110k.

View 4 Replies

VS 2010 - Extracting Information From Website

Dec 20, 2010

I want to extract some information from a web site (Bloomberg). This side has a search textbox with a onKeyUp event that, as far as I can understand, uses AJAX to create a table. Is it possible to insert some text in the textbox and then get the table created with AJAX? Given the source code of the page with the tabled already AJAXed (If you understand what I want to say here), I can easily retrieve the table. I just need to get the source code with the table already on it.

View 7 Replies

Extracting EXIF Information/tags From Photos?

Aug 18, 2010

I'd like to look into extracting EXIF tags/data from jpg images. I've looked around but can't seem to see any simple example. Anyone able to point me into the direction of pulling out the model/date values etc out of EXIF data?

View 2 Replies

VS 2008 Extracting Information From 'database' (?) Behind Website?

Feb 3, 2010

My university has a website that students can use to check stuff like their lesson schedules, latest grades, exam times, etc. Just general information about you and your study (after you have logged in of course).For interested, this is the website (not much use without a login obviously, but still): [URL]I have been thinking lately that it would be nice to have a desktop application that could give me the same information. However, I have absolutely no idea how that would work, and whether or not it is even possible. I mean, technically it should be possible for me to get the information (how else does the website get it) provided it is not secured (even when using the correct login details). I am sure the website communicates with a database somewhere. Does that mean I could theoretically also communicate with the database (probably: read from it only, not write) for example by executing queries?If that is possible, how do I figure out which queries to use for the different pieces of information? Obviously I don't know the details of the database so I can't just send a random query and expect it to work...

View 8 Replies

VS 2008 Extracting Specific Information From The Registry?

Nov 1, 2009

I'm trying to get the version of IE and Google Chrome from the registry. I've successfully been able to get IE, but it seems chrome is eluding me.The following code is what I'm trying:

vb.net
Public Sub ChromeVersion()
Dim SoftwareKey As String = "HKEY_CURRENT_USERSoftwareGoogleUpdateClients{8A69D345-D564-

[code].....

View 2 Replies

Extracting Javascript From Html Using .net?

Dec 24, 2009

I am having trouble extracting links from a web page and it involves javascript running on a website that contains the url links i need. I have encountered this issue before and was able to resolve it by getting the javascript file and then parsing it for the links. However this time I am unable to find the javascript that contains the links.To better clarify here is the source code segment from the web page:

<li><a href="javascript:void(0);" onclick="javascript:changeViewMode('Detailed', false); javascript:getTopTenStatsData(this, 'PlayerStats','HR'); javascript:changeMainCategoryLinkClass('liPlayersLink'); ">Highest-Rated Members</a></li>
<li><a href="javascript:void(0);" onclick="javascript:changeViewMode('Detailed', false); javascript:getTopTenStatsData(this,

[code]....

View 7 Replies

VS 2010 Extracting A HTML Value

Jul 26, 2011

I have found a good amount of threads, but I don't know what my problem is. I cannot seem to get it. It should be simple but I cannot figure it out. Here is the HTML

[Code]....

View 3 Replies

Extracting Table From Html Into Htmltable In B (htmlagilitypack)?

Sep 22, 2011

I am trying to grab a html table from a remote page and display the contents of this table in a htmltable on my site. I am using htmlagility pack. So far here is my code:

Imports HtmlAgilityPack
Partial Class ContentGrabExperiment
Inherits System.Web.UI.Page

[code].....

View 1 Replies

Extracting Specific Text From Loaded Html Source?

Oct 19, 2009

I'm trying to extract the text fields inbetween the code but the text is always changing so I'm not sure how to keep this dynamic. In put them in to the proper text boxes.

So text box 1 might be Date: then it pulls the date.

and there are multiple listings. so I need it to loop until the end of </table>

[Code].....

View 10 Replies

VS 2008 Extracting / Parsing Text From HTML Source

Jun 1, 2011

[code]The two parts I've coloured red change, I need to grab the first part which is the link but I'm not sure how to do this. I've used regex before and it doesn't look possible to use it on this on this, there's about 25 of these in the source.

View 11 Replies

Visual Basic 2008 Extracting Div Tags, Extracting Title Tags, Extracting Keyword Tags, Parsing Div Tags?

Nov 7, 2009

I was just wondering how to extract or parse any particual tags (whichever I specify) from webpages. I know how to extract text and links from webpages, but I tried to use the same method from the following code for div tags, title tags etcetera and it doesn't seem to work:

[Code]...

View 2 Replies

Get Information From HTML?

Jun 15, 2009

I need to retrieve information from a table in a HTML code.

The page i need to retrieve info from is not mine so i can't change the HTML code to make it solve my problem.[code]...

View 6 Replies

Get Information From A String Html?

Jan 29, 2011

How do I get information from a string html?

I am working on a megaupload downloader and I can not take information from a string, that is this: <span class="down_txt2"> UltraMU.rar </ span> It would be the name of the file that I want to download. I tried with this

Private Sub WebBrowser1_DocumentCompleted (ByVal sender As System.Object, ByVal e As System.Windows.Forms.WebBrowserDocumentCompletedEv entArgs) Handles WebBrowser1.DocumentCompleted

[Code]...

When I go to try it gives me the error and I can not fix it You could write the code to be able to take the name of the file, in this case "UltraMU.rar?

View 1 Replies

Html - Pass Information To VB Program

Mar 5, 2012

I am making a download manager in VB 2010. I need to be able to pass the download link from an HTML page to the VB Application. This is what i think i need to do in the HTML:

[Code]...

View 1 Replies

Retrieve 2-3 Pieces Of Information From A Parsed Html File?

Jan 1, 2010

What i'm trying to do is retrieve 2-3 pieces of information from a parsed html file, then add the data into a listview in the corresponding columns i was initially going to do a function for each piece of information to retrieve but thought maybe i could do it all with 1 function instead.

[Code]...

View 3 Replies

VS 2008 Regex - Extract Information Between Two Tags In Some Html From The Source Of A Website

May 24, 2009

what i am trying to do is extract information beween two tags in some html from the source of a website. The contents of the text between the two tags will always be different. the code i currently have is;

[Code]...

View 12 Replies

Extracting Url Source Code ,url Not Extracting All?

Apr 25, 2012

I have been working lately on a program who extracts URl Source codes!The program does work with most of URL but not for MEdiafire URL!When i check the source code from the web browser i can see there is some code missing;y tried diferrent types of Encoding.

Example:This is the final source code extracted from WEBBROWSER(Firefox,InternetExplorer,GoogleCrome)

--------------------------------------------------------------------------------------------
<div class="mf_lightbox_btns lb-footer" style="text-align: right;">
<a href="javascript:void(0);" class="secondary btn" onclick="$('body').removeClass('has-virus'); return false;">Dismiss Message</a>
<a href="http:www.bitdefender.com/mediafire/fix-it.html" target="_blank" class="alt btn">Get BitDefender</a>

[code]....

View 1 Replies

Parse Live HTML From A Website And Extract Specific Information And Store It Into A Database With Visual Basic?

Dec 30, 2011

The info i need extracted is formatted:

<TD><A HREF="http://xxxxx.com/xxxxxx/index.html"><IMG SRC="../xxxxx/thumbnails/xxxxx.jpg"> </A></TD>
<TD>=== <B><A HREF="http://xxxxxxxxx.com/xxxxxxxx/index.html">LINE 0</A></B> ===<BR>
<FONT SIZE="2" COLOR="#400080">

[code]....

how do i extract the info between TD=== and /a and Line 1,2, and 3 and store it into a database from a live website?

View 2 Replies

Html Source Code Doesn't Show Html But In Firebug Inspect Element Html Is There?

Jan 10, 2012

This may sound really stupid but I have to ask cause I'm not finding this answer anywhere.I have an application where the user will need to sign up for a new user account on the website [URL]..However when I am using Firefox's plug-in Firebug to view html I am getting something totally different than when I just right click on the site and view the page source.

What I am trying to do is to get the captcha from the website and display it in a picturebox on the application so the user can view the captcha, solve the captcha and then the app post is back to the service for a response.

Here is the source that I am getting using Firefox's Firebug to inspect the element:

<td>
<input type="hidden" value="Oo3Jo1I8bgzK68agMqo3s79ZZib2OkbK" name="iden">
<img class="capimage" src="/captcha/Oo3Jo1I8bgzK68agMqo3s79ZZib2OkbK.png" alt="i wonder if these things even work">
</td>

[Code]...

Why would the two be showing me two different versions of the HTML?

And how would you be able to grab that source to view in a picturebox using webclient?

View 2 Replies

Sending An HTML Email, Where The HTML Comes From An HTML File .Net/ClickOnce Environment?

Jun 20, 2009

Usage: Users create pretty HTML news letters in another app. They post the newsletter to the web, but they also want to set the contents of the HTML news letter file as the body of an email and send it using Application In Question. The users understand to use absolute link and image references when sending an E Newsletter. Environment:

AIQ is a VB.Net app deployed via ClickOnce. It is an intranet app; one can be sure MS Office 2003 and the interop 11 dlls are on the target machines.

Restrictions: MAPI is out. It mangles the HTML. Since it is a ClickOnce deployment, we can't register dlls (I think, correct me if I am wrong). Therefore CDO and COM is out (again, I may be wrong.... I would be happy to be proven so).

View 1 Replies

OLEDB Connection - Pull Information From Three Different Database Tables And Compiling The Information Onto One Screen

Jun 9, 2011

The program is currently trying to pull information from three different database tables and compiling the information onto one screen, to do this i am using three different connections and a function to handle the data at each stage. The issue is as my code hits PageLoadStage of "2" and runs the QueryDatabase() Function i am encountering an error of: "OleDB exception unhandled: No value given for one or more of the required parameters." This occurs on line 15 of the first code snippet. Can anyone explain this and/or identify any code issues if that is the cause.

[Code]...

View 8 Replies

Method Which Check If A Certain DSN Exists Then If It Does Extract Information And Use Information For Connection Purposes

Feb 2, 2011

I was looking for some stuff online, and i was looking through my reference sheets for vb.net, but it seems in the code i have found, there is declare in methods. Below is an example i am talking about. The thing is, that i dont recognize it and believe that it is an earlier version of Visual Basic ( 6 or earlier) which used it. Should i just keep looking for different code?

Public Declare Function RegOpenKey Lib "advapi32.dll" Alias "RegOpenKeyA" (ByVal hKey As Long, ByVal _
lpSubKey As String, phkResult As Long) As Long

Goal: Find information on how to implement a method which will check if a certain DSN exists, and then if it does, extract the information and use the information for connection purposes.

View 2 Replies







Copyrights 2005-15 www.BigResource.com, All rights reserved