A busy night for me. Midterm exams next thursday :(.
My question; I would really like to be able to get individual “tweets” (damn, i hate that word) of their website analyzing the code. I have found that "tweets are located between “” and “”.My idea was to somehow filter out the tweet. After taking “the scenic route” i finally found that regexpr. would do the job. I did manage to get out the first tweet but i’ve stumbled upon some problems with the remainder. The problem starts with this line when i want to further narrow done results. This is the expression used;
“(=“entry-content”>)(.*)(<a href=)”
Is this statement wrong?
I have a feeling it goes wrong with the “<a href=)-part” but something also tells me it doesn’t have to be since regexpr does’nt take “<a” as input right?
Thanks for any possible help!
FRid
(i’m no Beta-thinker and always flunked at math, so please be gentle :)
Not yet as smooth as i’d want it to be but hey, it works! (if the tweets contain just plain text) I’m happy with that for now and i hope my teachers will be as well. Take a look and let me know what you think because there is probably a lot of improvement possible.
Not yet as smooth as i’d want it to be but hey, it works! (if the tweets contain just plain text) I’m happy with that for now and i hope my teachers will be as well. Take a look and let me know what you think because there is probably a lot of improvement possible.
And it did. Thanks! That really cleaned up the patch. I did look at that solution before but i overlooked the “\s” so i didn’t get any output. I’ve added some filters for when people start linking and stuff.
The Tidy/Xpath-combo will take some more time though but thanks to point it out.
And it did. Thanks! That really cleaned up the patch. I did look at that solution before but i overlooked the “\s” so i didn’t get any output. I’ve added some filters for when people start linking and stuff.
The Tidy/Xpath-combo will take some more time though but thanks to point it out.