Tr, Vinux's plot was an intraday graphic. I'm saying take a composite over multiple days so that an hour slice represents some function of the aggregate at that hour on all days. Then any people that have clustering should align closely and all be in one graph. Your multi-day plot appears to have different time scales on the x-axis. The only way to see how people tend to cluster over days would be if the plots were stacked on top of each other with the same scale. Then you can essentially take a slice down the plots to see how people cluster at that time. Trying to evaluate something like that for a month or a year would basically be incomprehensible. Instead, I was proposing an aggregation of the multi-day information into one plot. Each person's outcome is a function of the distribution of the phenomena over multiple days.

Plot1: Clustering based on the time. Clustering based 24 variables ( 0-1, 1-2, ...).
Caution: spy diagrams ahead.
Plot2: I have seen the season package in the latest R journal. We could achieve the same by stars. Plot3. Further drill down of plot2.

The clustering brings timezone matches. Even I change the methods the clusters look similar. In the second graph. It is showing the seasonality. It seems chatbox is very active in the last three working days.

I am thinking of exploring grid graphics. It would be fun adding "SVG + Javascript" to make it interactive.

If anyone can suggest some interesting data, we could all analyze on that.

@BG still not sure if I know what you mean. Is this what you're thinking? Plus should the scaling be done by person or just overall? I did it by person.

Vinux, that star graph is nice! It clearly shows that people talk the most Wednesday through Friday and aren't that active on the weekends, which I know, but I would always think people would be more active! Mainly because that's when I'm more active since I work all day and can't use TS lol

Trinker, that heat map looks good. The scales are all the same now so when I look at a certain point on one graph then I know it's the same time point on the other graph, but a nice transformation of the data now would be to look at a time point (say Dason at 8 PM) across all the days. Maybe just take the median value. Do that for all time points. Then we'd have a composite single graph representing what hour people are the busiest. With Vinux's star graph, this composite can also be done by day, so we can see which hour people are most active on a given day given all the days data we have.

If you're talking about Vinux's it's because you both speak during the same time intervals. However we don't know if it's on the same day. Also Jimmy Brooks has almost no words spoken (see below):

This gives some basic word statistics (ignore sentence because I didn't break it up by sentence; that's more our turns of talk or chat inputs).

Code:

library(qdap); library(talkstats)
dat <- ts_chatbox()
with(dat, word_stats(dialogue, person))

Here's a link to a txt data frame of the stats as it's pretty long: LINK

"If you torture the data long enough it will eventually confess."
-Ronald Harry Coase -

If you're talking about Vinux's it's because you both speak during the same time intervals. However we don't know if it's on the same day. Also Jimmy Brooks has almost no words spoken (see below):

This gives some basic word statistics (ignore sentence because I didn't break it up by sentence; that's more our turns of talk or chat inputs).

Code:

library(qdap); library(talkstats)
dat <- ts_chatbox()
with(dat, word_stats(dialogue, person))

Here's a link to a txt data frame of the stats as it's pretty long: LINK

Trinker, I used the old data for my last graphs, where jimmy was one other top ten. For clustering my idea was to identify the timezone (Just because we have the reference so it can easily comparable). I mean the variable I have created in that way.

I like the graph trinker! I wonder what other aggregation we can do, maybe something that accounts for timezone (since every hour is locally different), and include a scale on the right to show what the continuous variable represents (graphs need to be completely informative, at least in the final analysis).

I decided to showcase the use of images in R graphics.

Lets take a look at the forum's from a spammer perspective. Generally speaking its a dangerous place, with ravenous moderators around each corner waiting to devour their posts and ban them for all eternity. Thus lets see how the TS mods compare in evil (from a spammers perspective) to famous cinema villains.

Using data from the net, on villain body counts and comparing these to the mod ban statistics we can get a mle estimate of how the mods compare to famous villains. This finally give us an opportunity to scientifically prove who is a raptor and who is a (Skynet) bot. [I would want to compare mods to super heroes but there is no way to rank super heroes objectively that I can think of]. In the analysis I assume that ban stats are a Poisson distributed random variable.

So here are the results;

And here is the script (make sure to download the images zip file and set your working directory);

As we can see there are certainly Raptors on the forum, Dragan is as deadly as Chucky, Dason is likely the Predator but cant be significantly distinguished from the T101 (a bot). Quark, like his DS9 Ferengi bar-tender counterpart, is likely an Alien (but still most likely a Raptor)... bugman will incubate his spawn in your abdomen... and funny enough I'm Darth Vader.

So come on over to the Darkside and join me in the power of (wrapper free) R graphics

Note; the mod stats may not reflect actual clean-up contributions of the moderators, as we have an automatic spammer "remove all posts and ban" button.. which does not get recorded in the stats. Some mods may therefore be more deadly than the image suggests

The true ideals of great philosophies always seem to get lost somewhere along the road..

Indeed a Raptor. But it really all boils down towards whether you are a Frequentest or a Bayesian. You see the mode of the likelihood distribution is a Raptor, but Bayesians would be more interested in the mean which is an Alien , I w

Also, as I am Darth Vader, I would like to urge bugman to finish that death star in his avatar. I want it fully operational!

The true ideals of great philosophies always seem to get lost somewhere along the road..