<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>CorpBlawg &#187; Robert Scoble</title>
	<atom:link href="http://corpblawg.ynada.com/category/robert-scoble/feed" rel="self" type="application/rss+xml" />
	<link>http://corpblawg.ynada.com</link>
	<description>Cornelius Puschmann on computer-mediated discourse, linguistics, open access and other things that interest him. Now discontinued - see blog.ynada.com</description>
	<lastBuildDate>Wed, 21 Apr 2010 13:10:32 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Scoble more productive than Shakespeare</title>
		<link>http://corpblawg.ynada.com/2007/07/04/scoble-more-productive-than-shakespeare</link>
		<comments>http://corpblawg.ynada.com/2007/07/04/scoble-more-productive-than-shakespeare#comments</comments>
		<pubDate>Wed, 04 Jul 2007 19:04:13 +0000</pubDate>
		<dc:creator>Cornelius</dc:creator>
				<category><![CDATA[Linguistics]]></category>
		<category><![CDATA[Other Stuff]]></category>
		<category><![CDATA[Robert Scoble]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[Web 2.0]]></category>

		<guid isPermaLink="false">http://corpblawg.ynada.com/2007/07/04/scoble-more-productive-than-shakespeare</guid>
		<description><![CDATA[Robert Scoble likes Google better than Microsoft (but not much) &#8211; and I have proof for that. He also holds his wife Maryam dearer than his company PodTech, but sadly she is outranked by Twitter and Apple. Ah, cruel World 2.0 capitalism.
How do I know? Simple, I have a list of 1,587 posts with 273,994 [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://scobleizer.com/">Robert Scoble</a> likes Google better than Microsoft (but not much) &#8211; and I have proof for that. He also holds his wife <a href="http://maryamie.spaces.live.com/">Maryam</a> dearer than his company <a href="http://www.podtech.net/home/">PodTech</a>, but sadly she is outranked by Twitter and Apple. Ah, cruel World 2.0 capitalism.</p>
<p>How do I know? Simple, I have a list of 1,587 posts with 273,994 running words of text that Mr. Scoble has produced between 2 Aug 2006 and 4 Jul 2007. That translates into 18,362 sentences. An average Scoble blog entry has a length of 172.6 words, with 14.9 words per sentence and an average word length of 3.8; all of which is fairly &#8211; deep breath &#8211; average for a blog.</p>
<p>All, except for the word count. It&#8217;s pretty impressive, especially when you consider that he&#8217;s been at it for almost 6 years (I believe he started in <a href="http://scoble.weblogs.com/2001/10/07.html">October 2001</a> &#8211; correct me if I&#8217;m wrong). That&#8217;s 69 months of blogging, which translates into an estimated staggering 1,65 million words. That would make him twice as productive as <a href="http://en.wikipedia.org/wiki/William_Shakespeare">William Shakespeare</a>, who (only) <a href="http://shakespeare.about.com/b/a/020320.htm">managed 884,647 words</a> in his entire lifetime, though in all fairness it has to be noted that Mr. Scoble didn&#8217;t have to write all that with a <a href="http://en.wikipedia.org/wiki/Quill">quill pen</a>.</p>
<p>And here are his favorite nouns, by frequency (the number after the word  indicates how often in occurs).</p>
<p>1     Google     1015<br />
2     blog     779<br />
3     Microsoft     776<br />
4     people     688<br />
5     video     503<br />
6     stuff     393<br />
7     things     365<br />
8     something     357<br />
9     way     354<br />
10     Web     343<br />
11     lot     322<br />
12     today     320<br />
13     time     301<br />
14     thing     290<br />
15     link     280<br />
16     Apple     267<br />
17     week     259<br />
18     Search     258<br />
19     world     256<br />
20     post     245<br />
21     videos     229<br />
22     bloggers     220<br />
23     interview     217<br />
24     Twitter     215<br />
25     blogs     213<br />
26     company     206<br />
27     one     199<br />
28     Maryam     199<br />
29     update     197<br />
30     day     195<br />
31     fun     193<br />
32     someone     192<br />
33     news     190<br />
34     team     185<br />
35     companies     178<br />
36     lots     177<br />
37     iPhone     175<br />
38     service     172<br />
39     Steve     171<br />
40     show     171<br />
41     site     170<br />
42     TechMeme     169<br />
43     business     165<br />
44     phone     160<br />
45     Windows     159<br />
46     conference     158<br />
47     year     158<br />
48     PodTech     153<br />
49     minutes     153<br />
50     developers     151</p>
]]></content:encoded>
			<wfw:commentRss>http://corpblawg.ynada.com/2007/07/04/scoble-more-productive-than-shakespeare/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Fake can be just as good</title>
		<link>http://corpblawg.ynada.com/2007/04/10/fake-can-be-just-as-good</link>
		<comments>http://corpblawg.ynada.com/2007/04/10/fake-can-be-just-as-good#comments</comments>
		<pubDate>Tue, 10 Apr 2007 14:44:56 +0000</pubDate>
		<dc:creator>Cornelius</dc:creator>
				<category><![CDATA[Corporate Blogging]]></category>
		<category><![CDATA[Debbie Weil]]></category>
		<category><![CDATA[Fake Blogs]]></category>
		<category><![CDATA[Gourmet Station]]></category>
		<category><![CDATA[PR]]></category>
		<category><![CDATA[ROI of Blogging]]></category>
		<category><![CDATA[Robert Scoble]]></category>

		<guid isPermaLink="false">http://corpblawg.ynada.com/2007/04/10/fake-can-be-just-as-good</guid>
		<description><![CDATA[That&#8217;s the title of a great 1997 album by Blonde Redhead and as it happens, it is also today&#8217;s topic &#8211; just in a way not related to alternative rock, but to (corporate) blogging.
Here&#8217;s the thing: it never ceases to intrigue me how often I come across blogging-related advice. There&#8217;s no shortage of suggestions, guidelines [...]]]></description>
			<content:encoded><![CDATA[<p>That&#8217;s the title of a great 1997 <a href="http://www.amazon.com/Fake-Can-Be-Just-Good/dp/B0000019LV">album</a> by <a href="http://www.last.fm/music/Blonde+Redhead">Blonde Redhead</a> and as it happens, it is also today&#8217;s topic &#8211; just in a way not related to alternative rock, but to (corporate) blogging.</p>
<p>Here&#8217;s the thing: it never ceases to intrigue me how often I come across blogging-related advice. There&#8217;s no shortage of suggestions, guidelines and even rules out there &#8211; rules that are often considered absolute and inviolable by those who postulate them.  Often suggestions from perceived authorities such as <a href="http://scobleizer.com/">Robert Scoble</a> and <a href="http://www.debbieweil.com/">Debbie Weil</a> on how to blog are interpreted as dogma; for example, the maxims that blogs are personal, that you must be transparent and so forth have all become pervasive*. How often have you read that a blog is a conversation, or that misleading readers about the identity or motives of the blogger is immoral?</p>
<p>I don&#8217;t want to challenge any of these ideas, but I do want to make a distinction between the different shades of meaning of the words <em>blog</em>, <em>blogging</em> and <em>blogger</em>, because it is hard to talk about something when you lack a consistent definition. I also want to question the validity of the judgment that certain blogs are &#8220;fake&#8221;, or at least ask whether that&#8217;s really a bad thing.</p>
<p><em>Blogging</em> is understood alternately understood as</p>
<p>a) the use of a publishing technology</p>
<p>b) the style in which blogs are often written</p>
<p>c) the type of social interaction between the blogger and his readers</p>
<p>and often &#8211; but not always &#8211; it is the combination of all three of these things. Note that they build upon each other: a bloggy style makes limited sense when you&#8217;re writing a letter (using another publishing technology), because even though the two types of text share several common traits they also differ significantly in other regards.</p>
<p>Say you&#8217;re a Java developer who likes to write about coding, snowboarding in the Rockies and Frank Miller comic books. You&#8217;ve set up an installation of <a href="http://wordpress.org/">Wordpress</a> on your own webserver and publish your first entry. It could start like this:</p>
<p><em>Hey everyone! So, guess what, I&#8217;ve decided to start a blog too. I&#8217;ll post here from time to time to talk about whatever catches my interest [...]</em></p>
<p>Even with just a handful of words, it can be clearly established that this kind of writing appeared in a blog and not, say, a newspaper, a personal diary, or a speech, even though it contains elements that are also common in these genres (of course it has the word &#8220;blog&#8221; in it, but even without that keyword I think a classification is possible). Now imagine that you&#8217;re a loyal reader of this blog and one day you find out that your snowboarding hacker friend is actually an invention &#8211; a fictional character developed by the department of systematic deception (DoSD) of a global PR firm (let&#8217;s call it Noble PR).</p>
<p>How would you react to this piece of information?</p>
<p>I think one gets a good idea of how people feel about these things when looking at <a href="http://gourmetstationblog.typepad.com/">blogs like this one</a> and <a href="http://blogthenticity.com/2005/03/31/meet-t-a-at-gourmet-station-blog/">reactions such as these</a> (read the first few comments). Blogs like Gourmet Station&#8217;s have been widely criticized for &#8220;violating the rules&#8221; and &#8220;being fake&#8221;. Where do these sentiments come from? They are the result of a holistic interpretation of blogs as a specific combination of a publishing technology, a style of writing and a kind of social interaction (a + b + c; see above). In other words: if you run a blogging software, write from a first-person viewpoint and directly address your readers, it is assumed that you are a real person, because only real human beings can engage in such an interaction (meaning a + b implicates c).</p>
<p>There are good reasons why you might want to use a blog as a publishing tool <em>without</em> writing in a bloggy style or allowing comments from your readers. Tools such as Wordpress and <a href="http://www.movabletype.org/">Movable Type</a> are used for everything from publishing poetry to managing entire websites and their versatility makes &#8220;non-traditional&#8221; usages plausible. But the Catch 22 appears to be style: if a writer makes frequent use of the first-person pronoun, <a href="http://en.wikipedia.org/wiki/Vocative">vocatives,</a> <a href="http://en.wikipedia.org/wiki/Interjection">interjections</a> and other stylistic elements that are traditionally frequent in spoken language in what looks like a blog in terms of presentation, it must be assumed that he is communicating with me, because that is how a <em>typical</em> blog works.</p>
<p>Social interactions of even the simplest type represent an investment for the participants. I react to you in a certain way because I have assumptions both <em>about you</em> and <em>about your assumptions about me</em>. If my assumptions turn out to be unfounded, the result is a loss of face. Nobody wants to deal with someone who isn&#8217;t honest about their identity.</p>
<p>The special thing about blogs is that the technological frame they live in makes it especially plausible to assume these things. Nobody finds the conversational style described above terribly confusing or irritating in a novel, despite the fact that we usually know the difference between the voice of the author and the voice of his fictional characters**. But the difference is that I can&#8217;t interact with the author when reading a novel and thus there is very little likelihood that I&#8217;ll mistake what is going on for a real instance of communication that somehow involves me.</p>
<p>So where does that leave us? And why is the title of this post &#8220;fake can be just as good&#8221;?</p>
<p>Despite the outrage two years ago, the fictional T. Alexander still blogs for Gourmet Station and the blog has a PageRank of 5 out of 10 (this site has a mere 3). It shows up in fourth place <a href="http://www.google.com/search?q=gourmet+blog">if you google for &#8220;gourmet blog&#8221;</a> and, according to Technorati, <a href="http://www.technorati.com/search/www.gourmetstationblog.typepad.com%2F">almost 400 links poin there</a>. Finally the <a href="http://www.scoutblogging.com/success_study/index.html">Northeastern University/Backbone Media Study</a> lists it as an example for successful corporate blogging.</p>
<p>Here&#8217;s a (rather long) excerpt that provides an excellent picture of Gourmet Station&#8217;s approach to the blog (<a href="http://www.scoutblogging.com/success_study/blogger_interviews/gourmet_station_donna_lynesmil.html">taken from the study</a>):</p>
<blockquote><p><em>Donna described how everything on the blog has to be consistent with the brand.  She moderates the comments and makes sure those comments are <strong>consistent with the brand</strong>.  No profanity or unrelated comments are allowed on the blog. Donna explained that â€œeverything has got to be very <strong>buttoned up</strong>, we have a very buttoned up brand, and we have a very upscale brand, very upscale, well educated customers. So anything that goes out there has to be <strong>consistent</strong> with that.â€ The blog also allows the company to discuss their content in a laid back tone.  That content has produced higher rankings on search engines and helped to increase traffic to the blog by 10%.</em></p>
<p><em>Donna believes it to be important that the people who write on the blog are <strong>knowledgeable</strong> about food and wine.  The blog&#8217;s readers are looking for ideas around food, drink, and entertainment.</em></p>
<p><em>The blog has helped Donna&#8217;s company add content to their website on the topics and products the company is focused on providing.  Also, the blog has given Donna the ability to place content that they otherwise would not have been able to put on their website.  Donna said it was important that a company covers all of the topics they wish to cover in their blog posts, and to categorize those topics by keyword.</em></p>
<p><em>The Gourmet Station blog has achieved a number two ranking on the keyword &#8220;gourmet dinners&#8221; in <a href="http://search.yahoo.com/search?p=gourmet+dinners&amp;fr=yfp-t-501&amp;toggle=1&amp;cop=mss&amp;ei=UTF-8" title="Check search engine ranking in Yahoo">Yahoo</a>! The blog has played a big part in helping the company to achieve that ranking.  According to Donna, the blog has also helped establish the company&#8217;s brand and provide more sales conversions by making a &#8220;passionate connection&#8221; with readers.</em></p>
<p><em>The topic that generates the most conversation and interaction from readers on the blog is romance.  Donna said that made sense, as the search volumes for romance and dinner have a great connection.</em></p>
<p><em>Donna selects the content of the posts by season.  Donna said the blog has 14 categories, and the company always has a recent post in each of the categories.</em></p>
<p><em>Donna recommends a company <strong>have a strategy</strong> before starting to blogging.  Her company has two strategies: <strong>to fill their categories with content and to increase theyâ€™re</strong></em> (sic) <em><strong>ranking on search engines</strong>.</em></p></blockquote>
<p>The bottom line appears to be: Gourmet Station designed a blog to increase search engine visibility and to publish material that did not fit into the context of a traditional corporate site. Perhaps they felt that this material was too context-dependent (recipes for seasonal gourmet foods, etc), or that a less formal style of writing was needed, but only in a certain limited area and not for the entire site. Whatever their motivation &#8211; there is hardly a rational reason to argue against their success. Whether &#8220;fake&#8221; or &#8220;real&#8221; (note the quotes), it appears that different strategies can realize different goals for different people.</p>
<p>I&#8217;m pretty sure that examples such as the Gourmet Station blog will remain marginal, though. It&#8217;s not really because of the outrage &#8220;fake&#8221; company blogs generate  (is there such a thing as bad PR?), but because it seems somewhat contrived and unnecessary to come up with a fictional character to write your blog when you might just as well have a real person do it. It&#8217;s not too hard to stick with The Message even when you&#8217;re blogging under your own name &#8211; numerous product blogs out there prove that. How you measure success is an entirely other question. In that context, note Gourmet Station&#8217;s specific goals of increasing visibility and publishing &#8220;unconventional&#8221; content.</p>
<p>So there it is. You can blog, or you can <em>publish via a blog</em>. Or you can do the latter and hope that people will believe it&#8217;s really the former. Not much shame in that, I think.</p>
<p>* The single most important document in this context is probably Scoble&#8217;s <a href="http://radio.weblogs.com/0001011/2003/02/26.html">Corporate Weblog Manifesto</a>, which has seems to have influenced most subsequently formulated blogging guidelines.</p>
<p>** Of course this is systematically exploited in literature, for example in <a href="http://en.wikipedia.org/wiki/Epistolary_novel">epistolary novels</a>. Playing with the status of a piece of writing as ambiguously real or fictional was also a hallmark of Postmodernism.</p>
<p>(Edit) Here are a few more interesting links I initially forgot to include: <a href="http://www.micropersuasion.com/2005/04/here_comes_anot.html">one</a>, <a href="http://www.whatsnextblog.com/archives/2005/09/a_really_lame_fake_blog_from_h.asp">two</a>, <a href="http://blogs.guardian.co.uk/technology/archives/2006/10/16/whats_a_flog_a_fake_blog.html">three</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://corpblawg.ynada.com/2007/04/10/fake-can-be-just-as-good/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Visualizing blog language data</title>
		<link>http://corpblawg.ynada.com/2007/02/09/visualizing-blog-language-data</link>
		<comments>http://corpblawg.ynada.com/2007/02/09/visualizing-blog-language-data#comments</comments>
		<pubDate>Thu, 08 Feb 2007 23:31:03 +0000</pubDate>
		<dc:creator>Cornelius</dc:creator>
				<category><![CDATA[Corporate Blogging]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[Jonathan Schwartz]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[Robert Scoble]]></category>
		<category><![CDATA[Style]]></category>
		<category><![CDATA[Thesis]]></category>
		<category><![CDATA[Visualization]]></category>

		<guid isPermaLink="false">http://corpblawg.ynada.com/2007/02/09/visualizing-blog-language-data</guid>
		<description><![CDATA[I&#8217;ve been playing around with this great little tool for several days now and thought I&#8217;d share some of the results with you.
But first, here&#8217;s a brief recap of what I&#8217;ve been doing before I start throwing statistics at you.
I am in the process of building a textual database (or corpus, as linguists call it) [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve been playing around with <a href="http://services.alphaworks.ibm.com/manyeyes/app">this great little tool</a> for several days now and thought I&#8217;d share some of the results with you.</p>
<p>But first, here&#8217;s a brief recap of what I&#8217;ve been doing before I start throwing statistics at you.</p>
<p>I am in the process of building a textual database (or corpus, as linguists call it) of corporate and enterprise web logs. The purpose of this corpus is to investigate corporate blogs as a text type. In the current phase of my research, I am especially interested in the following questions</p>
<p>- how do corporate blogs compare stylistically with non-corporate blogs, news texts and other types?</p>
<p>- is there a typical &#8216;corporate blogging style&#8217; in terms of how people write?</p>
<p>- are there recognizable differences in style that correspond with differences in purpose or authorship (in other words, do CEOs, marketers, software developers, etc have distinct styles?)</p>
<p>- how much variation is there stylistically between different blogs, different bloggers in the same hub (e.g. <a href="http://blogs.msdn.com/">MSDN</a>) and between different posts by the same blogger?</p>
<p>- are there patterns of change in style over time?</p>
<p>You might wonder what such a description is good for (well, apart from furthering the pursuit of knowledge and all that). I think that, on the practical level, it will enable us to better understand what people are trying to achieve with blogs and how they do it. Ultimately blogging is about good writing. The trouble is, neither is &#8216;good&#8217; easily defined, nor is it always the same to everyone on any occasion. Blogging styles are highly dynamic and situation-dependent and I think the most successful bloggers very consciously adapt different styles to address different people and issues.</p>
<p>Right, so what do I have so far?</p>
<p>One of the first measures I&#8217;ve implemented into my database is a relatively simple formula for calculating how formal/informational or (on the other end of the scale) involved/context-dependent a text is. This is done by adding the frequencies of certain types of words together and subtracting others, under the assumption that (for example) nouns are more numerous in texts which are primarily informational, while a high frequency of pronouns indicates involvement. The formula looks like this:</p>
<p><span id="commentLoop"><span class="description">0.5 * ((NOUNS + ADJECTIVES + PREPOSITIONS + DETERMINERS) &#8211; (PRONOUNS + VERBS + ADVERBS + INTERJECTIONS) + 100) 	             </span></span></p>
<p>(see <a href="http://www.springerlink.com/content/p08225g588771321/">Heylighen and Dewaele 2002</a>)</p>
<p>As you can guess, the results are potentially ambiguous &#8211; in other words, texts can have a very high or low score for a variety of reasons &#8211; and should be used with care. That being said, the measure produces some pretty interesting results.</p>
<p>This is a chart of f-scores from <a href="http://scobleizer.com/">Robert Scoble&#8217;s blog</a></p>
<p><a href="http://services.alphaworks.ibm.com/manyeyes/view/S2fqLEsOtha637FVU9lTE2-" style="margin: 0pt; padding: 0pt"><br />
<img src="http://services.alphaworks.ibm.com/manyeyes/static-resources/snapshot/89ade5ae105f6ac401107f12e0851245.jpeg" id="$ManyEyesThumbnail" style="border-style: solid solid none; border-color: rgb(175, 117, 93) rgb(175, 117, 93) -moz-use-text-color; border-width: 1px 1px 0pt; margin: 0pt; padding: 0pt" /><br />
<img src="http://services.alphaworks.ibm.com/manyeyes/images2/blog_this_caption.jpg" style="border: 0pt none ; margin: 0pt; padding: 0pt; display: block; position: relative; top: -5px" id="Any_0_0" /><br />
</a></p>
<p>Each data point in the graph is the f-score for a single post, or the average for several posts made on a single day. As the graph shows, Scoble&#8217;s posts are fairly consistently in the 50s in August and September. They surge to over 100 in mid-October and make overall gains in November and December, though these gains aren&#8217;t really as significant as they might look at first. The more notable change is the high degree of variation in these months compared to the time span before that.</p>
<p>You might wonder which posts exactly get a high or low f-score. Here are the entries with the highest score, by date.</p>
<p><em>Comparing new TailRank/DiggTech/TechMeme to Google Reader</em>, <a href="http://scobleizer.com/2006/10/16/comparing-new-tailrankdiggtechtechmeme-to-google-reader/">16 October 2006</a> (f-score 102)</p>
<p><em>Grapes on a Plane</em>, <a href="http://scobleizer.com/2006/10/29/grapes-on-a-plane/">29 October 2006</a> (f-score 97)</p>
<p><em>The highs and lows of CES</em>, <a href="http://scobleizer.com/2007/01/15/the-highs-and-lows-of-ces/">15 January 2007</a> (f-score 93)</p>
<p><em>Photo &#8220;training&#8221;</em>, <a href="http://scobleizer.com/2007/01/21/">21 January 2007</a> (f-score 106)</p>
<p>If you have a look at those posts, you&#8217;ll probably notice that they aren&#8217;t really in any way more <em>formal</em> than Scoble&#8217;s other writing. The difference is that they tend to be more <em>informational</em>, i.e. have more and more condensed information crammed into to them than most entries. Lists and enumerations will immediately lead to a high score (because they usually translate into a high noun count) and for Scoble those entries which are written in a sort of telegraph style to convey information about a photowalk or CES thus have a high score. This doesn&#8217;t really demerit the f-score as a metric &#8211; it simply means that it&#8217;s context-sensitive. What&#8217;s important is that, with an overall mean score of 60, Scobelizer ranks on the extreme low end of the formal/informational vs involved/contextual scale. To Scoble, <a href="http://www.amazon.com/Naked-Conversations-Changing-Businesses-Customers/dp/047174719X/sr=8-1/qid=1170970309/ref=pd_bbs_sr_1/002-5709788-7375262?ie=UTF8&amp;s=books">blogs really <em>are</em> conversations</a>, not just metaphorically but in a quite literal stylistic way.</p>
<p>That&#8217;s the score for one source over time. Let&#8217;s compare a bunch of sources.</p>
<p><a href="http://services.alphaworks.ibm.com/manyeyes/view/Sh77bEsOtha6q6kQLhXcE2-" style="margin: 0pt; padding: 0pt"><br />
<img src="http://services.alphaworks.ibm.com/manyeyes/static-resources/snapshot/89ade5ae109c926d0110a23b57730236.jpeg" id="$ManyEyesThumbnail" style="border-style: solid solid none; border-color: rgb(175, 117, 93) rgb(175, 117, 93) -moz-use-text-color; border-width: 1px 1px 0pt; margin: 0pt; padding: 0pt" /><br />
<img src="http://services.alphaworks.ibm.com/manyeyes/images2/blog_this_caption.jpg" style="border: 0pt none ; margin: 0pt; padding: 0pt; display: block; position: relative; top: -5px" id="Any_0_0" /><br />
</a></p>
<p>If you have trouble seeing anything on the chart, look for a little dropdown menu on the lower right hand side labeled <strong>dot size</strong>. Change it from &#8216;posts&#8217; to &#8216;no selection&#8217; and all the dots will be changed to have the same size, which should make the whole thing a lot easier to read.</p>
<p>The chart is a representation of scores for 137 different blogs, computed from data collected during the last five months. Each dot represents a single blog and its average f-score on the x axis. The position of a dot on the y axis indicates the <a href="http://en.wikipedia.org/wiki/Standard_deviation">standard deviation</a> of values inside of that blog, i.e. the degree of internal variation</p>
<p>The vast majority of the sources I&#8217;ve used are corporate blogs &#8211; after all that&#8217;s what my research is about. But in addition I&#8217;ve also thrown in a few non-corporate sources, simply to be able to compare one type of blog with another one. Thus the list contains 17 personal blogs randomly found via <a href="http://www.blogger.com/">blogger.com</a>, 1 a-list professional blogger (Scoble), 1 political blog hub (<a href="http://www.huffingtonpost.com/">huffingtonpost.com</a>) and 3 non-blog sources, namely editorials from the <a href="http://topics.nytimes.com/top/opinion/editorialsandoped/editorials/">New York Times</a>, the <a href="http://www.washingtonpost.com/wp-dyn/content/linkset/2005/05/30/LI2005053000331.html?nav=left">Washington Post</a> and the <a href="http://www.latimes.com/news/opinion/editorials/">LA Times</a> collected in the course of this week (see below for a full list of sources).</p>
<p>The first thing likely to catch you eyes are the outliers. On the far right hand side, there is one source simply tagged &#8220;Blog&#8221; (informative, I know) with a record f-score of 195 and and a standard deviation of 92. That&#8217;s <a href="http://rayozzie.spaces.live.com/blog/">Ray Ozzie</a>, Chief Software Architect of <a href="http://www.microsoft.com/">Microsoft</a>. Now, if you have a look at his blog you might find that the best description for his writing is not so much formal, but rather &#8220;technical&#8221; or maybe &#8220;information-oriented&#8221;. The reasons for the high scores are the many compound nouns (things like <em>development ecosystem</em>, <em>application components</em>, <em>clipboard data formats</em>, etc) coupled with the overall significant length of entries. Like the other outlier, <a href="http://irvingwb.typepad.com/">Irving Wladawsky-Berger</a> of <a href="http://www.ibm.com/">IBM</a>, Ozzie also produces very long posts. Ozzie&#8217;s longest has 1,700 words,  while Wladawsky-Berger is a close second with 1,500. Length tends to coincide with somewhat higher f-scores, however, there are counter-examples. <a href="http://blogs.msdn.com/heatherleigh/">Heather Hamilton</a> has <a href="http://blogs.msdn.com/heatherleigh/archive/2006/07/26/679215.aspx">one post</a> with a whopping word count of over 2,000 and an f-score of only 105. Generally brief posts tend to coincide with lower score<span id="BlogViewId"><span style="font-family: Verdana"></span></span><font color="#000000">s, but, as the example shows, there are exceptions.</font></p>
<p><font color="#000000">Overall it is important to consider a few things, especially in regards to the those sources with a high standard deviation <em>and</em> a high f-score:</font></p>
<p><font color="#000000">- the deviation is often high simply because there aren&#8217;t many posts (for example, Ozzie only has 6 entries)</font></p>
<p><font color="#000000">- several of the high-deviation blogs are hubs, i.e. they aggregate a number of individual blogs (e.g. <a href="http://blogs.msdn.com/">MSDN</a> and HuffPo)</font></p>
<p><font color="#000000">But the cool part is that the remaining sources usually contain very conscious stylistic variation (<a href="http://blogs.sun.com/jonathan/">Jonathan Schwarz</a> is a prime example). I other words, they write differently to address different people and achieve different things and this &#8211; at least to some extent &#8211; stylistically visible. Compare that with the scores for the three newspaper editorials grouped together in the lower right area of the plot. They are surprisingly consistent if you consider that we&#8217;re looking at texts published in three different papers, written by an even larger number of journalists. Which just shows that the editorial is a pretty solidified type of text in terms of style, while the (corporate) blog isn&#8217;t &#8211; at least not yet.<br />
</font></p>
<p><font color="#000000">Anyway, I&#8217;ll wrap it up for now and save the more in-depth look for another post.</font></p>
<p><strong>Sources</strong></p>
<p>iUpload InSights<br />
<a href="http://hopper.iupload.com/default.asp">http://hopper.iupload.com/default.asp</a></p>
<p>Time Leadership<br />
<a href="http://www.jimestill.com/">http://www.jimestill.com/</a></p>
<p>I Love Me, vol. I<br />
<a href="http://www.michaelocc.com/">http://www.michaelocc.com/</a></p>
<p>Simply Albert<br />
<a href="http://simplyalbert.blogspot.com/">http://simplyalbert.blogspot.com/</a></p>
<p>ChristianLindholm.com<br />
<a href="http://www.christianlindholm.com/christianlindholm/">http://www.christianlindholm.com/christianlindholm/</a></p>
<p>PR Thoughts<br />
<a href="http://www.prthoughts.com/">http://www.prthoughts.com/</a></p>
<p>Occam&#8217;s Razor<br />
<a href="http://mgoldberg.typepad.com/occams_razor/">http://mgoldberg.typepad.com/occams_razor/</a></p>
<p>Loic Le Meur Blog<br />
<a href="http://www.loiclemeur.com/">http://www.loiclemeur.com/</a></p>
<p>CTO Blog<br />
<a href="http://www.capgemini.com/ctoblog/">http://www.capgemini.com/ctoblog/</a></p>
<p>Lakattack<br />
<a href="http://spreadlog.net/">http://spreadlog.net/</a></p>
<p>Marcel Reichart Blog<br />
<a href="http://marcellomedia.blogs.com/mrb/">http://marcellomedia.blogs.com/mrb/</a></p>
<p>stefan<br />
<a href="http://stefan.21publish.com/">http://stefan.21publish.com/</a></p>
<p>Amazon Web Services Blog<br />
<a href="http://aws.typepad.com/">http://aws.typepad.com/</a></p>
<p>Cisco High Tech Policy Blog<br />
<a href="http://blogs.cisco.com/gov/">http://blogs.cisco.com/gov/</a></p>
<p>Digital Straight Talk<br />
<a href="http://www.digitalstraighttalk.com/">http://www.digitalstraighttalk.com/</a></p>
<p>Direct2Dell, Dell&#8217;s Weblog<br />
<a href="http://www.direct2dell.com/default.aspx">http://www.direct2dell.com/default.aspx</a></p>
<p>eBay Developers Program<br />
<a href="http://ebaydeveloper.typepad.com/">http://ebaydeveloper.typepad.com/</a></p>
<p>EDS&#8217; Next Big Thing Blog<br />
<a href="http://www.eds.com/sites/cs/blogs/eds_next_big_thing_blog/default.aspx">http://www.eds.com/sites/cs/blogs/eds_next_big_thing_blog/default.aspx</a></p>
<p>From Edison&#8217;s Desk &#8211; GE Global Research Blog<br />
<a href="http://www.grcblog.com/">http://www.grcblog.com/</a></p>
<p>Real Baking with Rose Levy Beranbaum<br />
<a href="http://www.realbakingwithrose.com/">http://www.realbakingwithrose.com/</a></p>
<p>GM Fastlane Blog<br />
<a href="http://fastlane.gmblogs.com/">http://fastlane.gmblogs.com/</a></p>
<p>Google Blog<br />
<a href="http://googleblog.blogspot.com/">http://googleblog.blogspot.com/</a></p>
<p>Dan Socci&#8217;s Blog<br />
<a href="http://h20325.www2.hp.com/blogs/socci">http://h20325.www2.hp.com/blogs/socci</a></p>
<p>Kara R<br />
<a href="http://www.honeywellblogs.com/kara_r/">http://www.honeywellblogs.com/kara_r/</a></p>
<p>ING Asia/Pacific&#8217;s Blog<br />
<a href="http://mycupofcha.ingblogs.com/">http://mycupofcha.ingblogs.com/</a></p>
<p>TinyScreenfuls.com<br />
<a href="http://www.tinyscreenfuls.com/">http://www.tinyscreenfuls.com/</a></p>
<p>Open for Discussion<br />
<a href="http://csr.blogs.mcdonalds.com/default.asp">http://csr.blogs.mcdonalds.com/default.asp</a></p>
<p>One Louder<br />
<a href="http://blogs.msdn.com/heatherleigh/">http://blogs.msdn.com/heatherleigh/</a></p>
<p>NIKEBASKETBALL<br />
<a href="http://blog.nikebasketball.com/">http://blog.nikebasketball.com/</a></p>
<p>OraBlogs<br />
<a href="http://www.orablogs.com/orablogs/">http://www.orablogs.com/orablogs/</a></p>
<p>Things That Make You Go Wireless<br />
<a href="http://businessblog.sprint.com/1/1/">http://businessblog.sprint.com/1/1/</a></p>
<p>The Lobby from SPG<br />
<a href="http://www.thelobby.com/">http://www.thelobby.com/</a></p>
<p>Jonathan Schwartz&#8217;s Weblog<br />
<a href="http://blogs.sun.com/jonathan">http://blogs.sun.com/jonathan</a></p>
<p>Texas Instruments Video360 Blog<br />
<a href="http://blogs.ti.com/">http://blogs.ti.com/</a></p>
<p>The Jason Calacanis Weblog<br />
<a href="http://www.calacanis.com/">http://www.calacanis.com/</a></p>
<p>Boeing Blog: Randy&#8217;s Journal<br />
<a href="http://www.boeing.com/randy/">http://www.boeing.com/randy/</a></p>
<p>Guided By History<br />
<a href="http://blog.wellsfargo.com/guidedbyhistory/">http://blog.wellsfargo.com/guidedbyhistory/</a></p>
<p>PlayOn<br />
<a href="http://blogs.parc.com/playon/">http://blogs.parc.com/playon/</a></p>
<p>Yahoo! Search Blog<br />
<a href="http://www.ysearchblog.com/">http://www.ysearchblog.com/</a></p>
<p>The CEO&#8217;s Blog &#8211; John Mackey<br />
<a href="http://www.wholefoodsmarket.com/blogs/jm/">http://www.wholefoodsmarket.com/blogs/jm/</a></p>
<p>Blog<br />
<a href="http://www.nixonmcinnes.co.uk/about-us/blog/">http://www.nixonmcinnes.co.uk/about-us/blog/</a></p>
<p>Kate&#8217;s Blog<br />
<a href="http://katesblog.u3.com/">http://katesblog.u3.com/</a></p>
<p>The Bocada Blog<br />
<a href="http://bocada.typepad.com/bocadablog/">http://bocada.typepad.com/bocadablog/</a></p>
<p>Michael M&#8217;s X10 Blog<br />
<a href="http://www.x10community.com/michaelm/">http://www.x10community.com/michaelm/</a></p>
<p>Notes from MNR<br />
<a href="http://blogs.adobe.com/notesfrommnr/">http://blogs.adobe.com/notesfrommnr/</a></p>
<p>Entrepreneurial Marketing<br />
<a href="http://blogs.accenture.nl/EntrepreneurialMarketing/">http://blogs.accenture.nl/EntrepreneurialMarketing/</a></p>
<p>TiVo Blog<br />
<a href="http://blog.tivo.com/tivo_blog/">http://blog.tivo.com/tivo_blog/</a></p>
<p>Guiness Blog<br />
<a href="http://www.guinnessblog.co.uk/blogs/home.aspx?App=guinnessblog&amp;allowAccess=4r7a6h">http://www.guinnessblog.co.uk/blogs/home.aspx?App=guinnessblog&amp;allowAccess=4r7a6h</a></p>
<p>Hu Yoshida&#8217;s Blog<br />
<a href="http://blogs.hds.com/hu/">http://blogs.hds.com/hu/</a></p>
<p>Forta Blog<br />
<a href="http://www.forta.com/blog/">http://www.forta.com/blog/</a></p>
<p>Novell Open PR<br />
<a href="http://www.novell.com/prblogs/">http://www.novell.com/prblogs/</a></p>
<p>Jeff Jaffe&#8217;s Blog<br />
<a href="http://www.novell.com/ctoblog/">http://www.novell.com/ctoblog/</a></p>
<p>Blog<br />
<a href="http://rayozzie.spaces.live.com/blog/">http://rayozzie.spaces.live.com/blog/</a></p>
<p>Mena&#8217;s Corner<br />
<a href="http://www.sixapart.com/about/corner/">http://www.sixapart.com/about/corner/</a></p>
<p>Alan Meckler<br />
<a href="http://weblogs.jupitermedia.com/meckler/">http://weblogs.jupitermedia.com/meckler/</a></p>
<p>Infrablog<br />
<a href="http://blogs.verisign.com/infrablog/">http://blogs.verisign.com/infrablog/</a></p>
<p>Thompson Holidays Blog<br />
<a href="http://thomsonholidays.blogs.com/my_weblog/">http://thomsonholidays.blogs.com/my_weblog/</a></p>
<p>Baby Babble<br />
<a href="http://stonyfield.typepad.com/babybabble/">http://stonyfield.typepad.com/babybabble/</a></p>
<p>The Bovine Bugle<br />
<a href="http://stonyfield.typepad.com/bovine/">http://stonyfield.typepad.com/bovine/</a></p>
<p>Stone Creek Coffee Blog<br />
<a href="http://sccv3.stonecreekcoffee.com/blog.cfm">http://sccv3.stonecreekcoffee.com/blog.cfm</a></p>
<p>bugBlog<br />
<a href="http://rescuebugblog.typepad.com/rescue_bugblog/">http://rescuebugblog.typepad.com/rescue_bugblog/</a></p>
<p>Speaking of Security<br />
<a href="http://www.rsasecurity.com/blog/">http://www.rsasecurity.com/blog/</a></p>
<p>Hybrid Talk<br />
<a href="http://hybridtalk.nyse.com/">http://hybridtalk.nyse.com/</a></p>
<p>Jonathan Bruce&#8217;s WebLog<br />
<a href="http://jonathanbruceconnects.com/jonathan_bruce/">http://jonathanbruceconnects.com/jonathan_bruce/</a></p>
<p>The Tinbasher Sheet Metal Blog<br />
<a href="http://www.butlersheetmetal.com/tinbasherblog/">http://www.butlersheetmetal.com/tinbasherblog/</a></p>
<p>The NCC Weblog<br />
<a href="http://www.northfieldconstruction.net/">http://www.northfieldconstruction.net/</a></p>
<p>Signs Never Sleep<br />
<a href="http://signsneversleep.typepad.com/">http://signsneversleep.typepad.com/</a></p>
<p>ACCAbuzz<br />
<a href="http://www.accabuzz.com/">http://www.accabuzz.com/</a></p>
<p>English Cut<br />
<a href="http://www.englishcut.com/">http://www.englishcut.com/</a></p>
<p>Life at Wal-Mart<br />
<a href="http://walmartfacts.com/lifeatwalmart/">http://walmartfacts.com/lifeatwalmart/</a></p>
<p>Scobelizer<br />
<a href="http://scobleizer.wordpress.com/">http://scobleizer.wordpress.com/</a></p>
<p>The DustBlog<br />
<a href="http://thedustblog.blogspot.com/">http://thedustblog.blogspot.com/</a></p>
<p>The Baby Blawg<br />
<a href="http://babyblawg.blogspot.com/">http://babyblawg.blogspot.com/</a></p>
<p>life&#8217;s short&#8230;make it sweet&#8230;<br />
<a href="http://dunlin.blogspot.com/">http://dunlin.blogspot.com/</a></p>
<p>xbsg<br />
<a href="http://mi50.blogspot.com/">http://mi50.blogspot.com/</a></p>
<p>I am the evil master genius<br />
<a href="http://arnique.blogspot.com/">http://arnique.blogspot.com/</a></p>
<p>i want you<br />
<a href="http://nuratikahnabilah.blogspot.com/">http://nuratikahnabilah.blogspot.com/</a></p>
<p>44 Words for 365 People<br />
<a href="http://44for365.blogspot.com/">http://44for365.blogspot.com/</a></p>
<p>neurotic kitten<br />
<a href="http://nkitten.blogspot.com/index.html">http://nkitten.blogspot.com/index.html</a></p>
<p>Discover Norwegian Music<br />
<a href="http://discovernorwegianmusic.blogspot.com/">http://discovernorwegianmusic.blogspot.com/</a></p>
<p>my smiles arent a facade<br />
<a href="http://badass-freak.blogspot.com/">http://badass-freak.blogspot.com/</a></p>
<p>ï¿½?Å¯ï¿½?Ã°Â£Ð· ï¿½?ï¿½? Å¦ï¿½?Ç¿Å¯Äï¿½?Å§ï¿½?<br />
<a href="http://chibinyu.blog.com/">http://chibinyu.blog.com/</a></p>
<p>Flying Tragic<br />
<a href="http://tragicflyer.blog.com/">http://tragicflyer.blog.com/</a></p>
<p>The Irony of Life<br />
<a href="http://mujerlatina319.blog.com/">http://mujerlatina319.blog.com/</a></p>
<p>cudgeland<br />
<a href="http://cudge.blogspot.com/">http://cudge.blogspot.com/</a></p>
<p>Over the Horizon<br />
<a href="http://blogs.zdnet.com/OverTheHorizon/">http://blogs.zdnet.com/OverTheHorizon/</a></p>
<p>DaveBlog<br />
<a href="http://blogs.netapp.com/dave/">http://blogs.netapp.com/dave/</a></p>
<p>Earthling<br />
<a href="http://blogs.earthlink.net/">http://blogs.earthlink.net/</a></p>
<p>developerWorks blogs<br />
<a href="http://www-03.ibm.com/developerworks/blogs/">http://www-03.ibm.com/developerworks/blogs/</a></p>
<p>Irving Wladawsky-Berger<br />
<a href="http://irvingwb.typepad.com/">http://irvingwb.typepad.com/</a></p>
<p>Forum Nokia Blogs<br />
<a href="http://blogs.forum.nokia.com/author_group.html?id=2">http://blogs.forum.nokia.com/author_group.html?id=2</a></p>
<p>Nokia N90 Blog<br />
<a href="http://n90.bloggercomm.com/">http://n90.bloggercomm.com/</a></p>
<p>Sparkle Like The Stars<br />
<a href="http://www.sparklelikethestars.com/">http://www.sparklelikethestars.com/</a></p>
<p>FYI Blog<br />
<a href="http://fyi.gmblogs.com/">http://fyi.gmblogs.com/</a></p>
<p>Southwest Airlines Blog<br />
<a href="http://www.blogsouthwest.com/">http://www.blogsouthwest.com/</a></p>
<p>Benra Blog: ZoomAlbum, Photos &amp; Photo Sharing<br />
<a href="http://benra.typepad.com/">http://benra.typepad.com/</a></p>
<p>WeatherBug Corporate Blog<br />
<a href="http://blog.weatherbug.com/">http://blog.weatherbug.com/</a></p>
<p>CTO Blog &#8211; TalkBMC<br />
<a href="http://talk.bmc.com/blogs/blog-bishop/cto/">http://talk.bmc.com/blogs/blog-bishop/cto/</a></p>
<p>Commentary from Cape Clear&#8217;s CEO [...]<br />
<a href="http://www.capeclear.com/annrai/">http://www.capeclear.com/annrai/</a></p>
<p>QuickBooks Online Edition The Team Blog<br />
<a href="http://quickbooks_online_blog.typepad.com/">http://quickbooks_online_blog.typepad.com/</a></p>
<p>The QuickBooks Team Blog<br />
<a href="http://www.quickbooks.blogs.com/">http://www.quickbooks.blogs.com/</a></p>
<p>The Mindjet Blog<br />
<a href="http://blog.mindjet.com/">http://blog.mindjet.com/</a></p>
<p>Warehousing and Distribution<br />
<a href="http://thirdpartylogistics.blogspot.com/">http://thirdpartylogistics.blogspot.com/</a></p>
<p>The Official Salesforce Blog<br />
<a href="http://blogs.salesforce.com/">http://blogs.salesforce.com/</a></p>
<p>Park City Mountain Resort<br />
<a href="http://parkcity.typepad.com/park_city_mountain_resort/">http://parkcity.typepad.com/park_city_mountain_resort/</a></p>
<p>SunbeltBLOG<br />
<a href="http://sunbeltblog.blogspot.com/">http://sunbeltblog.blogspot.com/</a></p>
<p>TaylorMade Blogs<br />
<a href="http://www.taylormadeblogs.com/">http://www.taylormadeblogs.com/</a></p>
<p>Scenic Nursery Gardening Blog<br />
<a href="http://www.scenicnursery.com/">http://www.scenicnursery.com/</a></p>
<p>Lightning Labels Blog<br />
<a href="http://lightninglabels.typepad.com/blog/">http://lightninglabels.typepad.com/blog/</a></p>
<p>Wiggly Wigglers<br />
<a href="http://wigglywigglers.blogspot.com/">http://wigglywigglers.blogspot.com/</a></p>
<p>EIE FLUD<br />
<a href="http://www.eieflud.co.uk/blog/">http://www.eieflud.co.uk/blog/</a></p>
<p>Eriska, Scottish Islan<br />
<a href="http://www.isleoferiska.com/">http://www.isleoferiska.com/</a></p>
<p>Outdoor Landscape Lighting<br />
<a href="http://www.residential-landscape-lighting-design.com/blogger.html">http://www.residential-landscape-lighting-design.com/blogger.html</a></p>
<p>Thoughts of Beauty<br />
<a href="http://www.overallbeauty.com/beauty-blog/">http://www.overallbeauty.com/beauty-blog/</a></p>
<p>Stormhoek Winery<br />
<a href="http://www.stormhoek.com/">http://www.stormhoek.com/</a></p>
<p>Chevron Collectible Toy Cars<br />
<a href="http://chevroncarsblog.com/">http://chevroncarsblog.com/</a></p>
<p>MSDN Blogs<br />
<a href="http://blogs.msdn.com/">http://blogs.msdn.com/</a></p>
<p>Ruby is Coming<br />
<a href="http://rubyiscoming.blogspot.com/">http://rubyiscoming.blogspot.com/</a></p>
<p>am I lonely<br />
<a href="http://rongsheng.blogspot.com/">http://rongsheng.blogspot.com/</a></p>
<p>Pineywoods Opinings<br />
<a href="http://longleaf.blogspot.com/">http://longleaf.blogspot.com/</a></p>
<p>Tangent, Oregon<br />
<a href="http://tangentcity.blogspot.com/">http://tangentcity.blogspot.com/</a></p>
<p>Verizon &#8211; PoliBlog<br />
<a href="http://poliblog.verizon.com/PoliBlog/Blogs/poliblog.aspx">http://poliblog.verizon.com/PoliBlog/Blogs/poliblog.aspx</a></p>
<p>Ted&#8217;s Take<br />
<a href="http://ted.aol.com/">http://ted.aol.com/</a></p>
<p>The Student LoanDown<br />
<a href="http://blog.wellsfargo.com/StudentLoanDown/">http://blog.wellsfargo.com/StudentLoanDown/</a></p>
<p>Emerson Process Experts<br />
<a href="http://www.emersonprocessxperts.com/">http://www.emersonprocessxperts.com/</a></p>
<p>A Thousand Words<br />
<a href="http://1000words.kodak.com/">http://1000words.kodak.com/</a></p>
<p>Glenfiddich Blog<br />
<a href="http://blog.glenfiddich.com/">http://blog.glenfiddich.com/</a></p>
<p>IT@Intel Blog<br />
<a href="http://blogs.intel.com/it/">http://blogs.intel.com/it/</a></p>
<p>All My Eye<br />
<a href="http://allmyeye.blogspot.com/">http://allmyeye.blogspot.com/</a></p>
<p>HuffPo Full Blog Feed<br />
<a href="http://www.huffingtonpost.com/theblog/">http://www.huffingtonpost.com/theblog/</a></p>
<p>News@Cisco Notes<br />
<a href="http://blogs.cisco.com/news/">http://blogs.cisco.com/news/</a></p>
<p>Mobile Visions<br />
<a href="http://blogs.cisco.com/wireless/">http://blogs.cisco.com/wireless/</a></p>
<p>Open standards, open source, open minds, open opportunities<br />
<a href="http://www-03.ibm.com/developerworks/blogs/page/BobSutor">http://www-03.ibm.com/developerworks/blogs/page/BobSutor</a></p>
<p>Marriott on the Move<br />
<a href="http://www.blogs.marriott.com/">http://www.blogs.marriott.com/</a></p>
<p>NYT Editorials<br />
<a href="http://topics.nytimes.com/top/opinion/editorialsandoped/editorials/">http://topics.nytimes.com/top/opinion/editorialsandoped/editorials/</a></p>
<p>Washington Post Editorials<br />
<a href="http://www.washingtonpost.com/wp-dyn/content/opinions/columnsandblogs/?nav%3Dleft&amp;sub=new">http://www.washingtonpost.com/wp-dyn/content/opinions/columnsandblogs/?nav%3DleftâŠ‚=new</a></p>
<p>LA Times Editorials<br />
<a href="http://www.latimes.com/news/opinion/editorials/">http://www.latimes.com/news/opinion/editorials/</a></p>
]]></content:encoded>
			<wfw:commentRss>http://corpblawg.ynada.com/2007/02/09/visualizing-blog-language-data/feed</wfw:commentRss>
		<slash:comments>13</slash:comments>
		</item>
		<item>
		<title>Ducking out when it counts</title>
		<link>http://corpblawg.ynada.com/2006/11/23/ducking-out-when-it-counts</link>
		<comments>http://corpblawg.ynada.com/2006/11/23/ducking-out-when-it-counts#comments</comments>
		<pubDate>Wed, 22 Nov 2006 22:52:13 +0000</pubDate>
		<dc:creator>Cornelius</dc:creator>
				<category><![CDATA[Corporate Blogging]]></category>
		<category><![CDATA[Debbie Weil]]></category>
		<category><![CDATA[Jonathan Schwartz]]></category>
		<category><![CDATA[PR]]></category>
		<category><![CDATA[Robert Scoble]]></category>

		<guid isPermaLink="false">http://corpblawg.ynada.com/2006/11/23/ducking-out-when-it-counts</guid>
		<description><![CDATA[I just came across this short article in the Guardian, posted last week. It follows the usual modus operandi of mentioning Robert Scoble and Jonathan Schwartz (and Thomas Mahon of English Cut fame) and goes on to quote Debbie Weil numerous times (not that there&#8217;s anything wrong with that).
But the real gem is right at [...]]]></description>
			<content:encoded><![CDATA[<p>I just came across <a href="http://business.guardian.co.uk/comment/story/0,,1950484,00.html">this short article in the Guardian</a>, posted last week. It follows the usual modus operandi of mentioning <a href="http://scobleizer.com/">Robert Scoble</a> and <a href="http://blogs.sun.com/jonathan/">Jonathan Schwartz</a> (and <a href="http://www.englishcut.com/">Thomas Mahon of English Cut fame</a>) and goes on to quote <a href="http://www.debbieweil.com/">Debbie Weil</a> numerous times (not that there&#8217;s anything wrong with that).</p>
<p>But the real gem is right at the beginning of the piece:</p>
<p><em>When The Carphone Warehouse boss Charles Dunstone started his corporate blog earlier this year, he was hailed as a cutting-edge chief executive; a man prepared to open up the inner workings of his company to the wider world and willing to communicate directly with his customers.</em></p>
<p><em>But that was April, when Britain&#8217;s biggest mobile phones retailer was riding high on a wave of favourable publicity about its &#8220;free&#8221; TalkTalk broadband offer.</em></p>
<p><em>Scroll forward a few months and the web is full of tales of &#8220;My TalkTalk Hell&#8221; as the group struggles to cope with the demand it so badly under-estimated, leaving thousands of customers angry and frustrated.</em></p>
<p><em>So what did Dunstone do at the height of the crisis? He simply stopped blogging. From September 1 until earlier this week &#8211; two and a half months &#8211; he failed to make a single entry. His post this Monday largely consists of an apology for his lengthy absence and a reassurance that the broadband supply problems are being worked out.</em></p>
<p>Ouch. If there&#8217;s one general, universal rule of business blogging it&#8217;s <em>in the midst of a crisis, silence is <strong>not</strong> golden</em>. Posting positive messages while the sailing is smooth is fine, but if there&#8217;s any time when a blog is almost indispensable, it&#8217;s when things go awry. Why? Because a blog is by far the best channel to make clear beyond doubt that</p>
<p>a) you recognize that there&#8217;s a problem</p>
<p>b) you&#8217;re sorry</p>
<p>If you aren&#8217;t convinced that those two aspects are extremely relevant, ask <a href="http://www.direct2dell.com/">these guys</a> about it. It&#8217;s a bit like <a href="http://sethgodin.typepad.com/">Seth Godin</a> once pointed out in <a href="http://video.google.com/videoplay?docid=-6909078385965257294">a very interesting presentation at Google</a>. Godin shocked his listeners by telling them something both harsh and true: <em>nobody cares about your product</em>. I believe he later qualified the statement &#8211; obviously a lot of people do care about Google&#8217;s products &#8211; but in assuming a complete lack of interest and &#8220;passion&#8221; on the side of customers regarding the phone service, dog food or toilet paper that you sell, you&#8217;re usually on the safe side. And the same largely holds true for companies. If wireless provider X is reliable and moderately priced, will I actively seek out X CEO&#8217;s blog to add my praise? Not too likely. But once things go wrong &#8211; once I&#8217;m frustrated and annoyed and quite sure that nobody is doing anything at all about my problem &#8211; <strong>then</strong> I&#8217;m going to post a comment on the company blog and make sure that I&#8217;m heard.</p>
<p>Silence leaves a barn door open for interpretation. Explaining and apologizing are basic social abilities &#8211; a lack of them indicates that you don&#8217;t understand how interpersonal interaction works, or (even worse), that you understand quite well but don&#8217;t care.</p>
<p>Mr. Dunstone didn&#8217;t realize that he was saying a whole lot by not saying anything. Don&#8217;t make that mistake.</p>
]]></content:encoded>
			<wfw:commentRss>http://corpblawg.ynada.com/2006/11/23/ducking-out-when-it-counts/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Dissecting Robert Scoble (2)</title>
		<link>http://corpblawg.ynada.com/2006/10/07/dissecting-robert-scoble-2</link>
		<comments>http://corpblawg.ynada.com/2006/10/07/dissecting-robert-scoble-2#comments</comments>
		<pubDate>Sat, 07 Oct 2006 14:13:31 +0000</pubDate>
		<dc:creator>Cornelius</dc:creator>
				<category><![CDATA[Corporate Blogging]]></category>
		<category><![CDATA[Robert Scoble]]></category>
		<category><![CDATA[Thesis]]></category>

		<guid isPermaLink="false">http://corpblawg.ynada.com/2006/10/07/dissecting-robert-scoble-2</guid>
		<description><![CDATA[As promised earlier, today I&#8217;m going to look at how Robert Scoble&#8217;s blog differs from other corporate blogs, and from blogs in general (apologies for the delay, this should have been up two days ago).
The earlier entry focused on a number of language-related statistics: word length, sentence length, words per post etc. In this second [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://corpblawg.ynada.com/2006/10/02/dissecting-robert-scoble">As promised earlier</a>, today I&#8217;m going to look at how <a href="http://scobleizer.wordpress.com/">Robert Scoble&#8217;s blog</a> differs from other corporate blogs, and from blogs in general (apologies for the delay, this should have been up two days ago).</p>
<p>The earlier entry focused on a number of language-related statistics: word length, sentence length, words per post etc. In this second step, I want to look at the distribution of individual words in the three different collections analyzed and draw some (lofty) conclusions based on the results.</p>
<p>Here are the top ten most frequent words for Scobleizer, the corporate blogs collection and the random blog comparison group:</p>
<p><strong>Scobleizer</strong><br />
Rank Word Frequency<br />
1 THE 625<br />
2 TO 442<br />
3 A 431<br />
4 <strong>I </strong>414<br />
5 AND 332<br />
6 OF 313<br />
7 IS 255<br />
8 THAT 243<br />
9 ON 221<br />
10 IN 175</p>
<p><strong>Corporate Blogs</strong><br />
Rank Word Frequency<br />
1 THE 35432<br />
2 TO 19714<br />
3 AND 17692<br />
4 A 16457<br />
5 OF 16154<br />
6 IN 11110<br />
7 IS 8475<br />
8 THAT 7819<br />
9 <strong>I</strong> 7342<br />
10 FOR 7220</p>
<p><strong>Random Blogs Comparison Group</strong><br />
Rank Word Frequency<br />
1 THE 4374<br />
2 TO 2985<br />
3 AND 2975<br />
4 <strong>I</strong> 2951<br />
5 OF 2097<br />
6 A 2025<br />
7 IN 1335<br />
8 YOU 1146<br />
9 THAT 1120<br />
10 <strong>MY</strong> 1065</p>
<p>At first glance, you&#8217;re likely to think that the three lists look very alike. This is not unusual in any way &#8211; in virtually any given English text &#8220;THE&#8221; will rank at number 1, whether you are looking at the Bible or <a href="http://www.amazon.com/Personal-Finance-Dummies-Fourth-Tyson/dp/0764525905/">Personal Finance for Dummies</a>. The same is the case with common function words such as prepositions, which form the basic building blocks of pretty much any text you can come across.</p>
<p>An interesting variation that I want to focus on for the moment is the distribution of the personal pronoun &#8220;I&#8221; and the possessive determiner &#8220;MY&#8221;. Both for Scobleizer and the Random Blog Comparison Group &#8220;I&#8221; ranks at number 4, well ahead of any other pronouns (for example &#8220;WE&#8221;). In the Corporate Blogs Collection &#8220;I&#8221; is at rank number 9, making it significantly less frequent. Further down the list, &#8220;MY&#8221; ranks at 14 in Scobleizer and at 28 in the Corporate Blogs Collection. Consequently, &#8220;WE&#8221; ranks higher in Corp. Blogs than it does it the other two collections.</p>
<p>Big surprise there, you might think. Obviously Scoble speaks only for himself, thus he is unlikely to use &#8220;WE&#8221; as frequently as it is used in blogs on corporate responsibility or policy, most of which are authored by a team of people. Even in those cases where there is just one author, he or she often prefers the corporate &#8220;WE&#8221;, especially when the person in question is an executive. And of course there&#8217;s the possibility of largely writing without a personal agent. What is intriguing to me, however, is just <em>how close</em> Scoble is to the Random Blogs Group in regards to &#8220;I&#8221;-use. The Random Blogs Group largely consists of blogs written by teenagers, housewives, activists and other private individuals. As with their writing, the question of personal involvement is always relevant in Scoble&#8217;s blogging &#8211; it all relates to him as an individual in some way. I find it likely that this level of involvement in turn engages his readers more strongly than a less personal (that is, &#8220;self-centric&#8221;) approach would. Telling others about yourself serves a social function; it allows them to empathize with you, to better understand your motifs. &#8220;Talking about yourself&#8221; does not necessarily always mean relating thoughts or emotions, though. Scoble very often describes <strong>where</strong> he is and <strong>what</strong> he is doing because this gives his readers a better understanding of <strong>who</strong> he is, which allows them to better judge whether they value his opinion on whatever gadget, trend or company he then proceeds to discuss. He makes a conscious effort to overcome the decisive asymmetries in the relationship with his readers: the fact that they aren&#8217;t in the same place at the same time as he is. When you&#8217;re having a chat with your friend, <strong>all or some</strong> of the following apply:</p>
<p>- you are physically in the same place, at the same time</p>
<p>- you can hear the other person&#8217;s voice</p>
<p>- you can see the other person</p>
<p>- the other person is actively addressing you</p>
<p>- you can immediately respond to what he or she is saying</p>
<p>In a real-life, face-to-face conversation all of these points usually apply. In a technically mediated interaction, whether it&#8217;s texting on AIM or talking on the phone, normally some (but not all) criteria are applicable. The more of them are, the closer the interaction resembles a &#8220;real&#8221; conversation, simply because a real-life conversation has all of these characteristics. Notice how blogs are different. Only the last point works &#8211; you can respond to a blog, but not quite immediately. A blog author is very unlikely to exclusively address just one other person; the readership is usually plural and largely unknown.</p>
<p>So what does Scoble do to overcome these limitations? He tells you where he is and what he&#8217;s doing to make the kind of communication between him and his readers seem more like a conversation. Of course you could argue that it really <em>is</em> a conversation since you can respond to him &#8211; and you&#8217;d be right -, but he aims to overcome the other impairments as well. The innovation here is that Scoble doesn&#8217;t pretend to address his readers directly (unless he really is responding to another blogger) since he doesn&#8217;t really know who they are individually. Instead he focuses on his part of the equation by making sure that you know <em>where he&#8217;s coming from</em> and <em>where he&#8217;s going with something</em> â€“ both physically and metaphorically speaking.</p>
<p>While the figures cited above are pretty vague indicators which should not be over-interpreted, I think they support the basic idea that blogs can function as time-delayed conversations and are naturally used in that manner by individuals. When organizations blog they are confronted with their inherent inability to have conversations in the same way that individuals do. The options are thus to either let individuals speak for the company â€“ which is risky for a plethora of reasons â€“ or to (mis)use blogs as a broadcast medium. I&#8217;m not even saying that the latter can&#8217;t work, just that people are likely to be very critical of such a usage, because they expect blogs to work differently.</p>
<p>One thing to always keep in mind is that you&#8217;re not real to your readers unless you have a face, name, identity and physical location. We like to think that we can relate to abstractions just as easily as we relate to concrete things, but our instincts often say otherwise.</p>
]]></content:encoded>
			<wfw:commentRss>http://corpblawg.ynada.com/2006/10/07/dissecting-robert-scoble-2/feed</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Dissecting Robert Scoble</title>
		<link>http://corpblawg.ynada.com/2006/10/02/dissecting-robert-scoble</link>
		<comments>http://corpblawg.ynada.com/2006/10/02/dissecting-robert-scoble#comments</comments>
		<pubDate>Mon, 02 Oct 2006 21:46:24 +0000</pubDate>
		<dc:creator>Cornelius</dc:creator>
				<category><![CDATA[Corporate Blogging]]></category>
		<category><![CDATA[Robert Scoble]]></category>
		<category><![CDATA[Thesis]]></category>

		<guid isPermaLink="false">http://corpblawg.ynada.com/2006/10/02/dissecting-robert-scoble</guid>
		<description><![CDATA[Disclaimer: No bloggers were harmed in the course of this experiment.
As I&#8217;ve hinted at in the past, I&#8217;m in the process of building a textual database that contains thousands of posts culled from the RSS feeds of about a hundred corporate blogs, plus a comparison group of several â€œmiscellaneousâ€ blogs randomly picked through blogger.com and [...]]]></description>
			<content:encoded><![CDATA[<p style="margin-bottom: 0in"><strong>Disclaimer:</strong> No bloggers were harmed in the course of this experiment.</p>
<p style="margin-bottom: 0in">As I&#8217;ve hinted at in the past, I&#8217;m in the process of building a textual database that contains thousands of posts culled from the RSS feeds of about a hundred corporate blogs, plus a comparison group of several â€œmiscellaneousâ€ blogs randomly picked through <a href="http://www.blogger.com/start">blogger.com</a> and <a href="http://www.blog.com/">blog.com</a>. The corpus currently has a little under 800,000 words and is expected to reach a round million words (or tokens) in about two to three weeks time.</p>
<p style="margin-bottom: 0in">So far, I&#8217;m just calculating a few very basic statistics: post word count, post sentence count and average word/sentence/post length, along with a top 100 list of the most frequent words. Though these are very basic figures, they nevertheless give a few interesting clues about the sources in question, especially when you compare one collection of blogs with another.</p>
<p style="margin-bottom: 0in">My test subject today will be <a href="http://en.wikipedia.org/wiki/Robert_Scoble">Robert Scoble</a>&#8217;s blog, <a href="http://scobleizer.wordpress.com/">Scobleizer</a>. I&#8217;ll compare it to a) a large collection of other company blogs and b) a collection of randomly chosen non-corporate blogs. My reasons for picking Robert are pretty unspectacular. I happened to add him to the database fairly early on so that now I have a reasonable amount of data. Also, his immense popularity should make for some interesting results&#8230; note that I say â€œinterestingâ€ and not conclusive â€“ a few language statistics don&#8217;t equate to the recipe for the Scoble Special Sauce of Blogging Fame. Anyway, let&#8217;s crunch a few numbers.</p>
<p style="margin-bottom: 0in">
<p style="margin-bottom: 0in"><font size="4"><strong>Scobleizer</strong></font></p>
<p style="margin-bottom: 0in">Posts: 327</p>
<p style="margin-bottom: 0in">First Post to Last Post (FPLP): 2 August 2006, 03:26 &#8211; 30 September 2006, 22:07</p>
<p style="margin-bottom: 0in">Tokens / Types (Ratio): 17014 / 3743 (4.55)</p>
<p style="margin-bottom: 0in">Sentences (SC): 1950</p>
<p style="margin-bottom: 0in">Average Word Length (AWL): 4.9</p>
<p style="margin-bottom: 0in">Average Sentence Length (ASL): 10.1</p>
<p style="margin-bottom: 0in">Average Words per Post (AWpP): 52.9*</p>
<p style="margin-bottom: 0in">* not relevant because Scoble&#8217;s RSS doesn&#8217;t include complete posts but only summaries (the first 56 words)</p>
<p style="margin-bottom: 0in">
<p style="margin-bottom: 0in"><strong><font size="4">Corporate Blogs</font></strong></p>
<p style="margin-bottom: 0in">(Blogs: 107)</p>
<p style="margin-bottom: 0in">Posts: 4443</p>
<p style="margin-bottom: 0in">First Post to Last Post (FPLP): 2 May 2005, 00:00 &#8211; 2 October 2006, 00:50</p>
<p style="margin-bottom: 0in">Tokens / Types (Ratio): 667969 / 62230 (10.73)</p>
<p style="margin-bottom: 0in">Sentences (SC): 44350</p>
<p style="margin-bottom: 0in">Average Word Length (AWL): 5.5</p>
<p style="margin-bottom: 0in">Average Sentence Length (ASL): 15.9</p>
<p style="margin-bottom: 0in">Average Words per Post (AWpP): 155.1</p>
<p style="margin-bottom: 0in">
<p style="margin-bottom: 0in"><strong><font size="4">Random Blogs Comparison Group</font></strong></p>
<p style="margin-bottom: 0in">(Blogs: 18)</p>
<p style="margin-bottom: 0in">Posts: 576</p>
<p style="margin-bottom: 0in">First Post to Last Post (FPLP): 17 November 2004, 03:17 &#8211; 2 October 2006, 00:48</p>
<p style="margin-bottom: 0in">Tokens / Types (Ratio): 105253 / 16979 (6.2)</p>
<p style="margin-bottom: 0in">Sentences (SC): 10335</p>
<p style="margin-bottom: 0in">Average Word Length (AWL): 5.1</p>
<p style="margin-bottom: 0in">Average Sentence Length (ASL): 10.8</p>
<p style="margin-bottom: 0in">Average Words per Post (AWpP): 184.5</p>
<p style="margin-bottom: 0in">
<p style="margin-bottom: 0in"><strong>The stats</strong></p>
<p style="margin-bottom: 0in">The first thing to note is that the three collections differ significantly in terms of size. The Scobleizer collection only has a size of 17,014 tokens (words), while both the corporate blog collection (667,969 tokens) and the random blogs comparison group (105,253 tokens) are much larger. This has strong implications for the accuracy of the figures, as a larger sample is obviously more accurate. The posts indexed in my database are not the total of posts made in those blogs, but only those which have been recorded since I began indexing a few months ago. Some entries date back several years, which is simply due to the fact that some of the RSS feeds which were used go back that far.</p>
<p style="margin-bottom: 0in">You might be wondering what on earth <em>types</em> are. Don&#8217;t worry, it&#8217;s really simple: while tokens are all words in a text, types are all <em>unique</em> words. So while the sentence <em>&#8220;The cat ate the mouse&#8221;</em> has 5 tokens, it only has 4 types because<em> &#8220;the&#8221;</em> occurs twice. The token-type-ratio for that sentence would be 5:4, or 1.25. As you can imagine, a long text will have a significantly larger number of tokens than types, since function words (pronouns, articles, prepositions etc) are re-used all the time, while lexical words (something like <em>&#8220;blog&#8221;</em>, <em>&#8220;Google&#8221;</em> or <em>&#8220;greenish&#8221;</em>) occur a lot less often.</p>
<p style="margin-bottom: 0in">The other statistics are pretty straight-forward: the number of total posts in the database, the time span from the first to the last post, the total number of sentences and three averages: average word length (AWL), average sentence length (ASL) and average words per post (AWpP). AWL refers to the number of characters in a word, while ASL in turn refers to the number of words in a sentence. As mentioned above, Scoble&#8217;s AWpP value should be ignored, since his RSS feed does not include complete entries but only summaries.</p>
<p style="margin-bottom: 0in">
<p style="margin-bottom: 0in"><strong>A cautious interpretation</strong></p>
<p style="margin-bottom: 0in">The comparison shows that Robert Scoble uses shorter words and sentences than both the blogs in the random comparison group and those in the corporate blogging collection. Words are only slightly shorter (Scoble: 4.9; Corp.blogs: 5.5; Random blogs: 5.1) but it should be noted that variation in this category is normally not very strong, thus the difference between Scoble and the corporate blogs seems notable. The differences in sentence length (10.1; 15.9; 10.8) are even more pronounced: on average, the other corporate blogs have much longer sentences than Scoble, who is again a little below the average value of the random blogs. Finally, it cannot be determined if Scoble&#8217;s posts are shorter than those in the other two collections (52.9*; 155.1; 184.5) because his RSS syndicates only summaries, though my personal bet would be that they are. This is also the only category where the random group scores higher than the corporates.</p>
<p style="margin-bottom: 0in">So what does this mean? In one sentence, it means that on average Robert Scoble seems to use shorter sentences than most other corporate bloggers, and that the words he uses are also significantly shorter. Looking further, it appears that Scoble&#8217;s style â€“ only speaking in terms of word and sentence length â€“ is closer to that of non-corporate bloggers. However, these numerical statistics aren&#8217;t terribly exciting by themselves, which is why tomorrow I&#8217;ll take a peek at a list of the most frequently used words in our three source collections.</p>
<p style="margin-bottom: 0in">(to be continued)</p>
<p style="margin-bottom: 0in"><strong>Edit:</strong> My claim that Robert&#8217;s RSS feed does not contain full texts is bogus &#8211; my indexing tool was simply looking in the wrong place. I&#8217;ll correct the problem asap. Mea culpa.</p>
]]></content:encoded>
			<wfw:commentRss>http://corpblawg.ynada.com/2006/10/02/dissecting-robert-scoble/feed</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>
