Extracting a line from html

i have some html code and i am after the lines that start with “br”. can i use tokenizer with separators longer than one character ?

extract html line.zip (1.8 kB)

if it was nice html the text would be in between “p” or “div” so you could use xpath to get everything inbetween these tags.

in your case you can use regexpr (string) with the query: {CODE(ln=>1)}(lt;)br(gt;)(.*?)$^
(substitute the lt; with the opening bracket… didn’t find a way to display that in the wiki)

though it will only return everything that is between the beginning tag and the end of the line

try and the equivalent closing tag

seems it doesn’t work in the vvvvorums.

… no text …