February 24, 2003 - Posted by André Restivo- 1 Comment
title="Ready.gov - From the U.S. Department of Homeland Security"
href="http://www.ready.gov/index.html">Ready.gov: The US
government made this nice site so we can all know what do to in
case of a terrorist attack. Or was it so that we can all laugh a
little bit?
The slogan (Helping America prepare for fiery death), the
very elucidative images and the little boy trying to eat his own
hands on the top banner, everything seems to be a parody to the US
paranoia. But no, it’s real. The parody it’s
href="http://www.idlewords.com/biological.html">here.
Update: A better paody is
href="http://www.unready.net/">here.
February 24, 2003 - Posted by André Restivo- 0 Comments
Blogs
em .pt: A nice compilation of Portuguese blogs.
February 24, 2003 - Posted by André Restivo- 0 Comments
href="http://us.imdb.com/Title?0294252">2009 Lost Memories
(2002): What if Japan had won the war, what if Germany had been
nuked instead of Japan and Korea was still part of Japan’s empire.
This is a very violent movie with lots of action and with an
important historical side to it all.
This was the third Korean film I have seen in this Fantas and I
must say I’m impressed with the overall quality of Korean films.
Last year I already saw
href="http://us.imdb.com/Title?0255589">Seon (The Isle),
another Korean movie, that besides being a shock movie had also one
of the best photographies I have ever seen.
February 24, 2003 - Posted by André Restivo- 0 Comments
href="http://www.daypop.com/burst/">Daypop Top Word Bursts:
Damn, Daypop was quicker than me
From Daypop’s
href="http://www.danchan.com/weblog/daypop/54957">weblog:
Shortly after putting up the Top 40, I realized there
were plenty of memes that make the rounds that weren’t accompanied
by links. Either the meme was a topic of conversation that had no
link, or it was a meme that had no single, authoritative link.
Catching heightened word usage is a natural extension to the Top
40. I kept the idea on the back burner until very recently, when I
realized Joe Millionaire was the buzz, but there were no
authoritative links to anchor the meme. There was the one link to
the Joe Millionaire site but few bloggers linked to it when writing
about the show (in spite of this, that link still made the Top 40).
I went to sleep that night thinking about implementing what
everyone now calls Word Bursts. The next morning (can you believe
it?) I get emails about the article in New Scientist about Word
Bursts. I also read it in Slashdot and it eventually makes the Top
40. Well, that got me working this weekend on this: Daypop Top Word
Bursts It’s catching topics that don’t have authoritative links. It
also catches those that do. There are sample posts from weblogs so
that you can get a decent idea of what the word burst is in
relation to.
Nicely done!
February 24, 2003 - Posted by André Restivo- 1 Comment
As i said
href="http://www.andrerestivo.com/weblog/archives/001836.html">here,
I decided to give a go to the word burst idea I saw on New
Scientist. I’m going to use
href="http://www.R-project.org">R to build a prototype (just to
test the concept).
R is a language for statistical computing and is both powerfull and
confusing. At the moment Iam able to anaylze the freqs of a single
weblog using the following code:
hp <- htmlTreeParse('http://www.scripting.com/')
html.elem <- unlist(hp$children$html$children)
text <- html.elem[which(regexpr("text.value",names(html.elem)) > 0)]
names(text) <- NULL
text2 <- paste(text,collapse=" ")
wrds <- strsplit(text2," ")
wrds <- sapply(wrds,tolower)
wrds <- gsub("[,.!?;:]","",wrds)
f.wrds <- factor(wrds)
freqs <- table(f.wrds)
sort(freqs)
Applied to Scripting News
this gives the following table of most used words:
- google 22
- have 23
- my 24
- was 25
- with 29
- be 32
- on 43
- it 44
- for 46
- is 50
- in 56
- that 65
- and 71
- of 72
- i 82
- a 105
- to 130
- the 177
Now it’s just a question of joining
href="http://www.technorati.com/cosmos/top100.html">several
important blogs, do the same type of frequency analisys and
compare the results from one day to another. This way we can see
which words have big bursts and which stopped being in the
spotlight.