Forum

Reading wikipedia pages

hi vvvvlers,

i tried to load a random page from wikipedia (wich is a service from wikipedia) but just want to have the content (no navigation etc).

my problems are:

  • the special characters do not work (even in the HTML renderer)
  • i could not read out the source code between the comments “” and “” with the RegExpr node (may be it is because of the line breaks)

thanks for your help,
nic

load_random_wikipage.v4p (4.7 kB)

i could not read out the source code between the comments “” and “” with the RegExpr node (may be it is because of the line breaks)

uups, the comments are ignored in the vvvvorum.

comments.gif (7.3 kB)

the special chars don’t work inside the renderer cos it’s utf8
convert it to latin0 and you see everything as you wish.
(see patch)

when working with (x/ht/xht)ml style content its easier to use xpath/xslt for extracting. (see patch)

did’t dig into the issue, why the regexpr isn’t working though

cheers
w

load_random_wikipage_w.v4p (13.4 kB)

hi woei,

thanks for your help, thas exactly what i tried but in a different way. the diffenece: your patch works :)

tanks,
nic