Correctly Matching Ending Tag With Its Starting Tag In HTML With A RegEx
May 29, 2009
I'm using VB.Net in an ASP.Net 2.0 app to run some regular expressions that remove some unnecessary markup. One of the things that I'd like to do is remove span elements that don't have any attributes in them:[code]I'd like to remove the outer span elements. Unfortunately, my regex above gives me this as a result, since the closing span matches the first one it comes across:[code]
View 4 Replies
ADVERTISEMENT
Sep 14, 2009
This is a follow up to another question of mine. The solution I found worked great for every one of the test cases I threw at it, until a case showed up that eluded me the first time around.My goal is to reformat improperly formatted tag attributes using regex (I know, probably not a fool-proof method as I'm finding out, but bear with me).
My functions:
Public Function ConvertMarkupAttributeQuoteType(ByVal html As String) As String
Dim findTags As String = "</?w+((s+w+(s*=s*(?:"".*?""|'.*?'|[^'"">s]+))?)+s*|s*)/?>"
[code].....
View 2 Replies
Jun 6, 2011
it was regarding in vb.net text operations, i want to replace the text by finding some particular text.but the problem arises when the starting and the at the end delimiter of the string is something same every-time but middle thing keeps on changing so how can i work out with that?? str = Replace(str, "abcdparameter = xyz123;", "abcdarameter = my defined parameter;")
i tried to learn some regular expressions but i am not able to get that.the xyz 123 is same every-time....and also there is one delimiter at the end of the parameter string so anyhow you can select the whole line and replace it.
View 2 Replies
Nov 2, 2009
I want to make a very small program where the user inputs a starting time and an ending time that is in "clock time (?)" and the program converts it to total minutes in one box and a decimal total in a second box. I have 4 textboxes, 2 are masked text boxes (military time) for input and two are not, for output and the math is coming out weird.
If I put a start time of 0925 and an end time of 1015, the results I'm expecting would be 50 total minutes and .83333 total as a decimal. I don't think I'm converting the textbox input correctly as I'm not getting anything near that. I am using a button to calculate and another to clear the boxes, just FYI. Also, I'm not sure what to do about any times that span the noon hour except to use an absolute value.
Public Function Time()
Dim StartTime As Double
Dim Endtime As Double
StartTime = txtStart.Text
Endtime = txtEnd.Text
txtDecimal.Text = (Endtime - StartTime) / 60
txtMinutes.Text = Endtime - StartTime
End Function
View 11 Replies
Nov 30, 2011
I found out that I can start any external executable using:
[Code]...
View 4 Replies
Jun 1, 2010
I have a two buttons: Previous Week & Next Week When i run the app the LabelDate needs to shows up "May 31, 2010 - June 6, 2010" When i click a previous week button i want to set LabelDate's text to "May 24, 2010 - May 30, 2010" If i click Next Week button i want it shows up "June 7, 2010 - June 13, 2010" Means it should always display previous/next week starting from Monday and ending with Sunday.
View 4 Replies
Mar 23, 2010
My look-behind: (?="","") is not working. This is due to the fact that I use .+ before it because I am parsing messages in a chat room where the users can literally type anything. The return is as follows for an example:
[Code]...
This depicts the beginning which contains the chat topic plus two messages. how exactly I would go about parsing the message out?
View 2 Replies
Dec 31, 2010
I'm trying to extract a piece of text from within the string, defining both a starting point and an ending point. For instance, say I want to search for the text found between "That is a " and " car" in the string "That is a red car", so it would return "red".I know you can use .SelectionStart and SelectionLength, but the length of the word can change, so the selection length can vary. What I want is to establish a specific starting point and a specific ending point from which to extract the text from the string.
View 5 Replies
Apr 16, 2009
how would i get this USERNAME, USERID from this string:
Code:
<a href="javascript:attackWindow('USERNAME','USERID','50','RANDSTRING', 'RANDSTRING2')">
Adding the USERNAME to listbox, and then USERID into an array separated by a space.
View 2 Replies
Mar 30, 2011
How ever it is returning wrong results. Any one spot the mistake. Needs to return the username / ID from every online user from [code]...
View 3 Replies
Apr 17, 2009
i'm really new at the VB language only been doing for around 2 weeks.I hoping someone can help me with this, how would i get this USERNAME, USERID from a string
Adding the USERNAME to listbox, and then USERID into an array separated by a space. - I no how to do this part just the regex above remember i've only been doing for two weeks so i'm not very knowledged at this.
View 11 Replies
Jun 21, 2011
Regex in VB2008.I have a file system directory that my code is watching. When a file is placed in that directory my code kicks off a processing application. Based on the filename structure, the code will perform different processes. So I need to match the filename structure specficially to get the correct processes to run on that file.My file name structure is: IK2YYYYMMDD_VV.e;where YYYYMMDD is well, the year, month and day. And VV is a verison number from 00-99 If the new file saved in the directory match this structure then I want a TRUE returned. From a filewatcher I get the filename as an "e.Name" (FileSystemEventArgs) variable. My current code to see if it matches the baseline structure is;
Dim IKPred2Match as New Regex ("^IK2.*\.e$") Dim ValidIKPred2Match as MatchCollection = IKPred2Match.Matches(e.Name)I've tried many different regex match patterns, and just can't seem to find the magic one.
View 2 Replies
Jul 1, 2011
I need an extremely simple regex pattern that matches anything within square brackets [].
Example:
This is a [test] text with [some] [examples].Required results: [test], [some], [examples].I don't want to get rid of the brackets, they should be included in the result (that should make it easier, right?)
If possible I'd like to exlude results that have whitespace in them, so things like [this string] should not be matched. If that gets too hard (probably not, but hey I can't even manage this :S) then I don't really mind, I can always check if the result contains whitespace myself.
I tried a lot of examples I could find online and ones that I could think of myself but none work. Note: I've been testing with the Visual Studio Find dialog (checking to use regular expressions obviously), maybe that's where the problem lies, because I don't really understand why some of these don't work...
Anyway, first thing I tried was simple: match the two brackets and one or more characters in between (that should be a +, right?)
View 4 Replies
Dec 3, 2010
I want to use Regex.Replace to replace a Match with a context sensitive value. I use the MatchEvaluator for this. But to make things simpler lets say I'm matching %v I want %v to be escapable, so if I use \%v it will not match. Anything else should match.
The pattern I came up with is this: (?:[^\]|^)%v It basically matches %v if it occurs at the beginning of the string, or if it follows any character except . It doesn't capture the first part of the expression.I know this isn't the "right" way to do it. But it worked just fine until I noticed that when I use this pattern in a replace, it includes the character before %v in the replacement(duh, right?)So, if I have ThisIsAValue:%v and I do Regex.Replace, replacing with the string Value, my result will be ThisIsAValueValue instead of ThisIsAValue:ValueI've tried googling this but the fact that "escape character" is so heavy in RegEx, all the results are geared towards the USAGE of escape characters instead of picking them out with a pattern.
View 1 Replies
Sep 5, 2009
I've been working with Regex and I've run into something that doesn't make sense to me; I either misunderstood how the Multiline option works, I'm not using it right, or it really just doesn't work like it's supposed to. This is a little test I made:
[Code]...
View 18 Replies
Feb 16, 2011
I am trying save a value from an input tag in some HTML source code. The tag looks like so:
<input name="user_status" value="3" />
I have the page source in a variable (pageSourceCode), and need to work out some regex to get the value (3 in this example). I have this so far: [Code] Which works fine most of the time, however this code is used to process source code from multiple sites (that use the same platform), and sometimes there are other attributes included in the input tag, or they are in a different order, eg:
<input class="someclass" type="hidden" value="3" name="user_status" />
I just dont understand regex enough to cope with these situations.
View 2 Replies
Jun 5, 2011
I am using vb.net to parse my own basic scripting language, sample below. I am a bit stuck trying to deal with the 2 separate types of nested brackets.
Assuming name = Sam
Assuming timeFormat = hh:mm:ss
Assuming time() is a function that takes a format string but
has a default value and returns a string.
[code]....
I could in theory change the syntax of the script completely but I would rather not. It is designed like this to enable strings without quotes because it will be included in an XML file and quotes in that context were getting messy and very prone to errors and readability issues. If this fails I could redesign using something other than quotes to mark out strings but I would rather use this method.
Preferably, unless there is some other way I am not aware of, I would like to do this using regex. I am aware that the standard regex is not really capable of this but I believe this is possible using MatchEvaluators in vb.net and some form of recursion based replacing. However I have not been able to get my head around it for the last day or so, possibly because it is hugely difficult, possibly because I am ill, or possibly because I am plain thick. I do have the following regex for parts of it.
Detecting the parentheses: (w*?)((.*?))(?=[^(+)]*((|$))
Detecting the square brackets: [[(.*?)]](?=[^[+]]*([[|$))
View 2 Replies
Sep 6, 2010
im trying to find a way of resizing a html document to fit inside my browser correctly.
I would like the width of the page to be the same width of the browser. I can scroll down for the rest. I have no idea on how to do this. Ive found some bits and pieces on the net but nothing is working for me.
Private Sub WebBrowser1_DocumentCompleted(ByVal sender As System.Object, ByVal e As System.Windows.Forms.WebBrowserDocumentCompletedEventArgs) Handles WebBrowser1.DocumentCompleted
[Code].....
View 2 Replies
Mar 9, 2011
I'm currently writing a library where I wish to allow the user to be able to specify spreadsheet cell(s) under four possible alternatives:
A single cell: "A1";
Multiple contiguous cells: "A1:B10"
Multiple separate cells: "A1,B6,I60,AA2"
A mix of 2 and 3: "B2:B12,C13:C18,D4,E11000"
Then, to validate whether the input respects these formats, I intended to use a regular expression to match against. I have consulted this article on Wikipedia:
Regular Expression (Wikipedia)
And I also found this related SO question:regex matching alpha character followed by 4 alphanumerics.Based on the information provided within the above-linked articles, I would try with this Regex:
Default Readonly Property Cells(ByVal cellsAddresses As String) As ReadOnlyDictionary(Of String, ICell)
Get
Dim validAddresses As Regex = New Regex("A-Za-z0-9:,A-Za-z0-9")
[code]....
1. Is my regular expression correct? If not, please help me understand what expression I could use.
2. What exception is more likely to be the more meaningful between a FormatException and an InvalidExpressionException? I hesitate here, since it is related to the format under which the property expect the cells to be input, aside, I'm using an (regular) expression to match against.
View 5 Replies
May 20, 2010
I have some update panels on my page that do some asyncpostbacks to keep some dropdownlists correctly populated. My problem is that on my page i have an HTML input that is handling some file uploads. With the AJAX on the page with asyncpostbacks, and while i step through my code behind, the files arent being uploaded. Using a postbacktrigger (non-async) is not possible because of my layout.
Here is my code:
<div id="divFileInputs" runat="server">
<input id="file1" name="fileInput" type="file" runat="server" size="50" style="width: 50em"
onfocus="AddFileInput()" class="textbox" /></div>
[code]....
When I put a breakpoint in at the declaration of service and then check the value of "files", the count is 0. I am expecting it to be 2 when i have one file uploaded.
View 1 Replies
Feb 17, 2012
i'm trying to get some information of a webpage via regex on visual basic 2010
it's something like this:
<SPAN CLASS="clear"></SPAN>
<h2> blabla </h2>
<h2> blabla </h2>
<b> blabla </b>
[Code]...
View 1 Replies
Jul 14, 2009
i have this
Dim wc As New System.Net.WebClient()
Dim p As New System.Net.WebProxy()
Dim test As String
wc.Encoding = System.Text.Encoding.GetEncoding("utf-8")
p.Credentials = System.Net.CredentialCache.DefaultCredentials
wc.Proxy = p
[Code]...
View 7 Replies
Jun 11, 2009
How would I use Regex to extract the body from a html doc,taking into account that the html and body tags might be in uppercase, lowercase or might not exist?
View 3 Replies
Mar 9, 2011
I need to remove all quotes from an image tag found within lots of other text. For example, I want to make
<img src="folder/image.gif" target="_blank" />
into
<img src=folder/image.gif target=_blank />
I'm using vb, and need to use a regEx specifically for the img tag and not use replace. The img tag can be in a block of other text, so I need to use regEx to search for the <img and then within that until I meet a /> I need to remove all quotes.
View 5 Replies
Dec 5, 2010
I want to get tags content in a string with regular expression. I wrote it for just one line. When the content changed into some lines from one line, Regex will never do pattern on the tag. I choose RegexOptions.Multiline + RegexOptions.Singleline for finding options.My pattern in low level: (>)[ a-z A-z 0-9 ]*(</)
View 2 Replies
Apr 4, 2011
what would the regEX code be for the following:
<br/><span class=""synopsis-view-synopsis"">America's justice system comes under indictment in director <a href='/people/1035' class='actor' style='font-weight:bold'>Norman Jewison</a>'s trenchant film starring <a href='/people/1028'
[code].....
View 3 Replies
Dec 19, 2010
I am trying to remove the tables within an HTML file, specifically, for the following document, I'd like to remove anything within the tags <TABLE....> and </TABLE>. The document contains multiple tables with texts in between.
The expression that I came up with, <TABLE.*>s*[s|S]*</TABLE>s*, however would remove the text in between the tables. In fact it would remove everything between the first <TABLE> and the last </TABLE> tags. I would like to keep the texts in between and only remove the tables.
[Code]....
View 2 Replies
Jun 25, 2009
I have a long string of HTML code that contains several quotes around objects in the string. I have a Regex.Replace() function to change the "img src" tag, due to the way its stored in a database.
Here is my string: <p><strong><u><font color="#cc0099">RICH TEXT BOLD UNDERLINE. PICTURE TO APPEAR BELOW</font></u></strong></p><p><strong><u><font color="#cc0099"/></u></strong></p><p><img src="/inlineimages/WorkOrder/6/1245981403232.jpg"/> </p><p /><p>W00T!</p>
[Code].....
Notice how the "<font color=" has single quotes around its value, and "<img src=" has single quotes, with a double quote jammed in between?
Also, is there an automated way to go from my initial string, to the "Dim input As String" I created? I had to manually type that out, editing the quotes just to try and make it work.
View 6 Replies
Apr 3, 2011
i need help parsing html using regex..i am hardly find the exact expression to use.
[Code]...
View 2 Replies
Apr 8, 2010
Im trying to make an appliaction that tells the user his/her location depending on the URL
Im using this site to get the information :[URL]..Im having trouble with the tags that are always changing.
What i have so far:
[Code]...
View 1 Replies