<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: Screenshots of the Corporate Blogging Corpus</title>
	<atom:link href="http://corpblawg.ynada.com/2006/11/08/screenshots-of-the-corporate-blogging-corpus/feed" rel="self" type="application/rss+xml" />
	<link>http://corpblawg.ynada.com/2006/11/08/screenshots-of-the-corporate-blogging-corpus</link>
	<description>Cornelius Puschmann on corporate and institutional blogging, linguistics, open access and other things that interest him.</description>
	<pubDate>Sat, 22 Nov 2008 05:32:52 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.6.2</generator>
		<item>
		<title>By: Cornelius</title>
		<link>http://corpblawg.ynada.com/2006/11/08/screenshots-of-the-corporate-blogging-corpus#comment-9099</link>
		<dc:creator>Cornelius</dc:creator>
		<pubDate>Mon, 18 Dec 2006 14:58:10 +0000</pubDate>
		<guid isPermaLink="false">http://corpblawg.ynada.com/2006/11/08/screenshots-of-the-corporate-blogging-corpus#comment-9099</guid>
		<description>Hey Dan, sorry for not getting back to you! You are exactly right, I went from something else because I couldn't get the results I was looking for. I ended up using TreeTagger (http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/), a very nice statistical tagger from the University of Stuttgart. It works as follows: I write a text from my database to a file buffer, then call TreeTagger from PHP  (it's a perl binary), let it tag the text and then write the result back to the DB. It is quite precise and very fast and I'm really happy I managed to integrate it.

Kudos to you for your efforts regaring a PHP-based tagger. That would probably have been the cleanest solution, but I'm happy with the way it works for now.</description>
		<content:encoded><![CDATA[<p>Hey Dan, sorry for not getting back to you! You are exactly right, I went from something else because I couldn&#8217;t get the results I was looking for. I ended up using TreeTagger (http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/), a very nice statistical tagger from the University of Stuttgart. It works as follows: I write a text from my database to a file buffer, then call TreeTagger from PHP  (it&#8217;s a perl binary), let it tag the text and then write the result back to the DB. It is quite precise and very fast and I&#8217;m really happy I managed to integrate it.</p>
<p>Kudos to you for your efforts regaring a PHP-based tagger. That would probably have been the cleanest solution, but I&#8217;m happy with the way it works for now.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dan Zarrella</title>
		<link>http://corpblawg.ynada.com/2006/11/08/screenshots-of-the-corporate-blogging-corpus#comment-9098</link>
		<dc:creator>Dan Zarrella</dc:creator>
		<pubDate>Mon, 18 Dec 2006 14:47:56 +0000</pubDate>
		<guid isPermaLink="false">http://corpblawg.ynada.com/2006/11/08/screenshots-of-the-corporate-blogging-corpus#comment-9098</guid>
		<description>I take it you didn't need to use my script, looks pretty cool anyways.</description>
		<content:encoded><![CDATA[<p>I take it you didn&#8217;t need to use my script, looks pretty cool anyways.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: CorpBlawg &#187; We mourn the passing of these corporate blogs</title>
		<link>http://corpblawg.ynada.com/2006/11/08/screenshots-of-the-corporate-blogging-corpus#comment-8613</link>
		<dc:creator>CorpBlawg &#187; We mourn the passing of these corporate blogs</dc:creator>
		<pubDate>Fri, 15 Dec 2006 16:17:08 +0000</pubDate>
		<guid isPermaLink="false">http://corpblawg.ynada.com/2006/11/08/screenshots-of-the-corporate-blogging-corpus#comment-8613</guid>
		<description>[...] The only one on the list that has not just been abandoned, but deleted. See archive.org for proof of its passing. No worries Dan, your five posts are safe for posterity in my indestructible linguistic database. Your blog on &#8220;HP&#8217;s industry leading support services which provide innovative support of HP products and also help customers manage their IT environment operations more efficiently across all vendor platforms&#8221; may be gone, but it is not forgotten. And believe me, in my statistics all those juicy adjectives make a nice dent under &#8220;suasive language&#8221;. [...]</description>
		<content:encoded><![CDATA[<p>[...] The only one on the list that has not just been abandoned, but deleted. See archive.org for proof of its passing. No worries Dan, your five posts are safe for posterity in my indestructible linguistic database. Your blog on &#8220;HP&#8217;s industry leading support services which provide innovative support of HP products and also help customers manage their IT environment operations more efficiently across all vendor platforms&#8221; may be gone, but it is not forgotten. And believe me, in my statistics all those juicy adjectives make a nice dent under &#8220;suasive language&#8221;. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John Cass</title>
		<link>http://corpblawg.ynada.com/2006/11/08/screenshots-of-the-corporate-blogging-corpus#comment-6997</link>
		<dc:creator>John Cass</dc:creator>
		<pubDate>Fri, 24 Nov 2006 18:52:27 +0000</pubDate>
		<guid isPermaLink="false">http://corpblawg.ynada.com/2006/11/08/screenshots-of-the-corporate-blogging-corpus#comment-6997</guid>
		<description>interesting results.</description>
		<content:encoded><![CDATA[<p>interesting results.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Cornelius</title>
		<link>http://corpblawg.ynada.com/2006/11/08/screenshots-of-the-corporate-blogging-corpus#comment-6644</link>
		<dc:creator>Cornelius</dc:creator>
		<pubDate>Mon, 20 Nov 2006 09:54:05 +0000</pubDate>
		<guid isPermaLink="false">http://corpblawg.ynada.com/2006/11/08/screenshots-of-the-corporate-blogging-corpus#comment-6644</guid>
		<description>Thanks for stopping by, John. There's a great deal of variation in terms of style, but overall a significant number of business bloggers writes in a very involved and immediate way. This is signalled by a high frequency of personal pronouns and certain constructions that are not traditionally used in written language, such as "I think" or "I guess" to qualify what you're saying, directly addressing the readers ("What's your take on this?"), incomplete sentences, informal expressions, etc. However, they tend to use longer and more complex sentences than private bloggers and variation in spelling ("c u l8er") is not too common.

I'll be looking at stylistic differences between individual blogs soon... stay tuned.</description>
		<content:encoded><![CDATA[<p>Thanks for stopping by, John. There&#8217;s a great deal of variation in terms of style, but overall a significant number of business bloggers writes in a very involved and immediate way. This is signalled by a high frequency of personal pronouns and certain constructions that are not traditionally used in written language, such as &#8220;I think&#8221; or &#8220;I guess&#8221; to qualify what you&#8217;re saying, directly addressing the readers (&#8221;What&#8217;s your take on this?&#8221;), incomplete sentences, informal expressions, etc. However, they tend to use longer and more complex sentences than private bloggers and variation in spelling (&#8221;c u l8er&#8221;) is not too common.</p>
<p>I&#8217;ll be looking at stylistic differences between individual blogs soon&#8230; stay tuned.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John Cass</title>
		<link>http://corpblawg.ynada.com/2006/11/08/screenshots-of-the-corporate-blogging-corpus#comment-6632</link>
		<dc:creator>John Cass</dc:creator>
		<pubDate>Mon, 20 Nov 2006 06:43:47 +0000</pubDate>
		<guid isPermaLink="false">http://corpblawg.ynada.com/2006/11/08/screenshots-of-the-corporate-blogging-corpus#comment-6632</guid>
		<description>great post, any interesting analysis you can tell us, without revealing too much of course?</description>
		<content:encoded><![CDATA[<p>great post, any interesting analysis you can tell us, without revealing too much of course?</p>
]]></content:encoded>
	</item>
</channel>
</rss>
