#perl - Thu 19 Apr 2007 between 00:00 and 00:26

NY Lost Funds



Caelumeval: $_ = "box_8_2007-04-01_02-36-32.522.wav"; my ($y, $M, $d) = /(\d{4})-(\d\d)-(\d\d)/; [$y, $M, $d]
buubotCaelum: ['2007','04','01']
cosit seems to just be there to make the regex engine do some extra pointless work, without changing what it will match
keramhmm
MortuisHow do you make a perl script wait or pause for a second? I'm trying to avoid hitting a server more than five times per second.
f00li5hMortuis: sleep
Mortuisthanks
f00li5hMortuis: but which libs are you using?
there may just be an option for it
MortuisLWP
f00li5hare you screen scraping?
Mortuisyeah, my wife wants a backup of her livejournal
f00li5hMortuis: your wife has a live journal?
Mortuistheir bot policy asks not to hit them more than 5 times per second.
f00li5hdo you hit her?
do they have an API (livejournal, not your wife - although i'm sure her API is quite nice)
Mortuislol no, she'd kick my ass
f00li5hdoes live journal use some kind of markup lanugage?
like BBcode style?
Mortuisthey have an api, but this seemed easier
f00li5hno, it won't be
Mortuisit's all html, they have their own tags but they get translated
f00li5hscreen scraping should be a last resort
because if you scrape the generated HTML,
- you loose information
- you can't post the code back
- you have to parse the html
(don't use a regex!)
'ya see? it's not worth the pain
Mortuishmm
f00li5hnow, had the sight contained buisness critical data, and lacked an API, sure
confoundwell said
f00li5hconfound: ^_^
Caelumif they have a SOAP or XML-RPC api, it would be trivial
f00li5hMortuis: sorry about the rant, i've only just got out of bed
MortuisI think they do have XML-RPC
Maybe I'll check that out
cfeddethere are some pretty good CPAN modules for mucking with that.
but scraping is a good first aproximation.
CaelumSOAP::Lite, which contains XMLRPC::Lite

Page: 2 9 16 23 30 37 44 51 58 65 72 79 86 93 100 

IrcArchive

NY Lost Funds