<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN"
    "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
	<meta http-equiv="content-type" content="text/html; charset=utf-8" />
	<meta http-equiv="Expires" content="Mon, 01 Jan 1990 01:00:00 GMT" />
	<meta name="generator" content="Squib/0.4.0.282" />
	<meta name="author" content="Matt Mower" />
	<meta name="keywords" content="matt mower,london,paoga,squib" />
	<meta name="description" content="Curiouser and Curiouser is the weblog of Matt Mower a London based technical marketing manager for software company PAOGA. In his spare time Matt Mower enjoyes developing software applications including this weblog application Squib." />
	<title>Curiouser and Curiouser!</title>
	<link href="http://matt.blogs.it/rss.xml" rel="alternate" title="RSS" type="application/rss+xml" />
<link href="http://matt.blogs.it/themes/fragen3.14/styles/theme_candc.css" media="screen" rel="stylesheet" type="text/css" />

</head>
<body>
<div id="page">
<div id="banner">
    <h1>Curiouser and Curiouser!</h1>
    <em>'Where shall I begin, please your Majesty?' He asked. 'Begin at the beginning,'
the King said, very gravely, 'and go on till you come to the end: then stop.'</em>
</div>
<div id="nav">
    
<div class="box" id="box_about">
<p><strong>About</strong></p>

<p>Wherein Matt Mower (aka rubymatt on FreeNode) rambles about technology, the love of a good MacTop, ruby coding, rails, topics, knowledge management and learning, and politics.</p>
</div>


    
<div class="box" id="box_navigation">
<p><strong>Navigation</strong></p>

<ul>
<li><a href="http://matt.blogs.it/">Home</a></li>
<li><a href="http://matt.blogs.it/all_posts.html">All Posts by Title</a></li>
<li><a href="http://matt.blogs.it/all_archives.html">Monthly Archives</a></li>
<li><a href="http://matt.blogs.it/topics/index.html">Topics</a></li>
</ul>
</div>


    
    
<div class="box" id="box_blogroll">
<strong>Blogroll</strong><ul class="blogroll"><li><a class="orangeButton" href="http://www.tapestrycomics.com/dilbert.xml">XML</a> <a href="http://www.dilbert.com/">Dilbert</a></li>
<li><a class="orangeButton" href="http://www.tapestrycomics.com/getfuzzy.xml">XML</a> <a href="http://www.comics.com//comics/getfuzzy/">Get Fuzzy</a></li>
<li><a class="orangeButton" href="http://www.tapestrycomics.com/liberty.xml">XML</a> <a href="http://www.comics.com/creators/liberty/">Liberty Meadows</a></li>
<li><a class="orangeButton" href="http://www.phoenyx.net/feeds/comics/hedge.rdf">XML</a> <a href="http://www.comics.com/comics/hedge/">Over the Hedge</a></li>
<li><a class="orangeButton" href="http://www.tapestrycomics.com/peanuts.xml">XML</a> <a href="http://www.comics.com//comics/peanuts/">Peanuts</a></li>
<li><a class="orangeButton" href="http://atheos.de/funnies/pvp.rdf">XML</a> <a href="http://www.pvponline.com/">PvP Online</a></li>
<li><a class="orangeButton" href="http://rss.xiffy.nl/xml.php?channel=391">XML</a> <a href="http://www.userfriendly.org/">User Friendly the Comic Strip. by Illiad</a></li>
<li><a class="orangeButton" href="http://www.tapestrycomics.com/wizardofid.xml">XML</a> <a href="http://www.comics.com/creators/wizardofid/">Wizard of Id</a></li>
<li><a class="orangeButton" href="http://matt.blogs.it/rss.xml ">XML</a> <a href="http://matt.blogs.it/">Curiouser and Curiouser!</a></li>
<li><a class="orangeButton" href="http://www.pubsub.com/site_stats_feed.php?site=matt.blogs.it">XML</a> <a href="http://www.pubsub.com/linkcounts.php">PubSub PubStats for matt.blogs.it</a></li>
<li><a class="orangeButton" href="http://www.technorati.com/watchlists/rss.html?wid=2122">XML</a> <a href="http://www.technorati.com/search/matt.blogs.it">Technorati Search for: Curiouser and curiouser!</a></li>
<li><a class="orangeButton" href="http://www.terryfrazier.com/weblog/rss.xml">XML</a> <a href="http://www.terryfrazier.com/weblog/index">b.cognosco</a></li>
<li><a class="orangeButton" href="http://www.bethlet.net/atom.xml">XML</a> <a href="http://www.bethlet.net/">bethlet.net</a></li>
<li><a class="orangeButton" href="http://del.icio.us/rss/devzero/osx">XML</a> <a href="http://del.icio.us/devzero/osx">del.icio.us/devzero/osx</a></li>
<li><a class="orangeButton" href="http://del.icio.us/rss/tag/osx">XML</a> <a href="http://del.icio.us/tag/osx">del.icio.us/tag/osx</a></li>
<li><a class="orangeButton" href="http://weblog.infoworld.com/foster/rss.xml">XML</a> <a href="http://weblog.infoworld.com/foster/">Ed Foster's Radio Weblog</a></li>
<li><a class="orangeButton" href="http://blog.grahamsadd.com/atom.xml">XML</a> <a href="http://blog.grahamsadd.com/">Graham Sadd's Weblog</a></li>
<li><a class="orangeButton" href="http://feeds.feedburner.com/letTheGoodTimesRollByGuyKawasaki">XML</a> <a href="http://blog.guykawasaki.com/">Let the Good Times Roll by Guy Kawasaki</a></li>
<li><a class="orangeButton" href="http://blog.mathemagenic.com/rss.xml">XML</a> <a href="http://blog.mathemagenic.com/">Mathemagenic</a></li>
<li><a class="orangeButton" href="http://maxblumberg.typepad.com/dailymusings/atom.xml">XML</a> <a href="http://maxblumberg.typepad.com/dailymusings/">Max Blumberg Positioning Game</a></li>
<li><a class="orangeButton" href="http://www.drmartinhall.com/index.rdf">XML</a> <a href="http://www.drmartinhall.com/">Minessence -- Doc Martin's Musings</a></li>
<li><a class="orangeButton" href="http://theobvious.typepad.com/blog/atom.xml">XML</a> <a href="http://theobvious.typepad.com/blog/">The Obvious?</a></li>
<li><a class="orangeButton" href="http://onlyagame.typepad.com/only_a_game/atom.xml">XML</a> <a href="http://onlyagame.typepad.com/only_a_game/">Only a Game</a></li>
<li><a class="orangeButton" href="http://paolo.evectors.it/rss.xml">XML</a> <a href="http://paolo.evectors.it/">Paolo Valdemarin: Paolo's Weblog</a></li>
<li><a class="orangeButton" href="http://bash.org/xml/">XML</a> <a href="http://www.bash.org">QDB: Quote Database</a></li>
<li><a class="orangeButton" href="http://ross.typepad.com/blog/index.rdf">XML</a> <a href="http://ross.typepad.com/blog/">Ross Mayfield's Weblog</a></li>
<li><a class="orangeButton" href="http://www.scripting.com/rss.xml">XML</a> <a href="http://www.scripting.com/">Scripting News</a></li>
<li><a class="orangeButton" href="http://www.myelin.co.nz/post/rss.xml">XML</a> <a href="http://www.myelin.co.nz/post/">Second p0st</a></li>
<li><a class="orangeButton" href="http://www.synesthesia.co.uk/blog/feed/rss2/">XML</a> <a href="http://www.synesthesia.co.uk/blog">Synesthesia</a></li>
<li><a class="orangeButton" href="http://the.taoofmac.com/space/RecentChanges?filter=blog&amp;amp;format=rss">XML</a> <a href="http://the.taoofmac.com/space">The Tao of Mac</a></li>
<li><a class="orangeButton" href="http://anjo.blogs.com/metis/index.rdf">XML</a> <a href="http://anjo.blogs.com/metis/">Anjo Anjewierden</a></li>
<li><a class="orangeButton" href="http://www.beyondbullets.com/index.rdf">XML</a> <a href="http://www.beyondbullets.com/">beyond bullets</a></li>
<li><a class="orangeButton" href="http://bps-research-digest.blogspot.com/atom.xml">XML</a> <a href="http://bps-research-digest.blogspot.com">BPS Research Digest</a></li>
<li><a class="orangeButton" href="http://chocnvodka.blogware.com/blog/index.xml">XML</a> <a href="http://chocnvodka.blogware.com/blog">Chocolate and Vodka</a></li>
<li><a class="orangeButton" href="http://feeds.feedburner.com/Corporatebloggingblog">XML</a> <a href="http://www.corporateblogging.info/">CorporateBloggingBlog</a></li>
<li><a class="orangeButton" href="http://del.icio.us/rss/tag/lifehacks">XML</a> <a href="http://del.icio.us/tag/lifehacks">del.icio.us/tag/lifehacks</a></li>
<li><a class="orangeButton" href="http://www.firstadopter.com/index.rdf">XML</a> <a href="http://www.firstadopter.com/">FirstAdopter.com</a></li>
<li><a class="orangeButton" href="http://homepage.mac.com/dave_rogers/news.rss">XML</a> <a href="http://homepage.mac.com/dave_rogers/">Groundhog Day</a></li>
<li><a class="orangeButton" href="http://cgi.pbs.org/cgi-registry/cringely/cringelyrdf.pl">XML</a> <a href="http://www.pbs.org/cringely/">I, Cringely @ PBS.org</a></li>
<li><a class="orangeButton" href="http://marktsinfoblog.blogspot.com/atom.xml">XML</a> <a href="http://marktsinfoblog.blogspot.com">Mark T's information blog</a></li>
<li><a class="orangeButton" href="http://maxblumberg.typepad.com/maxwellbeing/index.rdf">XML</a> <a href="http://maxblumberg.typepad.com/maxwellbeing/">MaxWellBeing</a></li>
<li><a class="orangeButton" href="http://dev.metavalues.com/metavalues/timeline?daysback=90&amp;amp;max=50&amp;amp;wiki=on&amp;amp;ticket=on&amp;amp;changeset=on&amp;amp;milestone=on&amp;amp;format=rss">XML</a> <a href="http://bidwell.textdrive.com:9009/metavalues/timeline">MetaValues: Timeline</a></li>
<li><a class="orangeButton" href="http://blog.monkeymethods.org/atom.xml">XML</a> <a href="http://blog.monkeymethods.org/">monkey methods</a></li>
<li><a class="orangeButton" href="http://googleblog.blogspot.com/atom.xml">XML</a> <a href="http://googleblog.blogspot.com">Official Google Blog</a></li>
<li><a class="orangeButton" href="http://presentationzen.blogs.com/presentationzen/atom.xml">XML</a> <a href="http://presentationzen.blogs.com/presentationzen/">Presentation Zen</a></li>
<li><a class="orangeButton" href="http://simon.incutio.com/syndicate/rss1.0">XML</a> <a href="http://simon.incutio.com/">Simon Willison's Weblog</a></li>
<li><a class="orangeButton" href="http://www.unstruct.org/wp-rdf.php">XML</a> <a href="http://www.unstruct.org">unstruct.org</a></li>
<li><a class="orangeButton" href="http://www.wingedpig.com/index.rdf">XML</a> <a href="http://www.wingedpig.com/">wingedpig.com - Mark Fletcher's Blog</a></li>
<li><a class="orangeButton" href="http://feeds.feedburner.com/Wonderland">XML</a> <a href="http://crystaltips.typepad.com/wonderland/">Wonderland</a></li>
<li><a class="orangeButton" href="http://psychcentral.com/blog/feed/">XML</a> <a href="http://psychcentral.com/blog">World of Psychology</a></li>
<li><a class="orangeButton" href="http://www.slash7.com/xml/rss/feed.xml">XML</a> <a href="http://www.slash7.com/">(24)slash7</a></li>
<li><a class="orangeButton" href="http://www.artima.com/rubycs/feeds/rubycs.rss">XML</a> <a href="http://www.artima.com/">Articles published in Ruby Code &amp; Style</a></li>
<li><a class="orangeButton" href="http://www.chadfowler.com/index.cgi?rss">XML</a> <a href="http://www.chadfowler.com/index.cgi">ChadFowler.com</a></li>
<li><a class="orangeButton" href="http://feeds.feedburner.com/curthibbs">XML</a> <a href="http://blog.curthibbs.us/articles">Curt's Comments</a></li>
<li><a class="orangeButton" href="http://eigenclass.org/hiki.rb?c=rss;tags=blog">XML</a> <a href="http://eigenclass.org/hiki.rb?c=recent">Eigenclass (blog)</a></li>
<li><a class="orangeButton" href="http://www.livejournal.com/users/drbrain/data/rss">XML</a> <a href="http://www.livejournal.com/users/drbrain/">Eric Hodel</a></li>
<li><a class="orangeButton" href="http://tomcopeland.blogs.com/juniordeveloper/rss.xml">XML</a> <a href="http://tomcopeland.blogs.com/juniordeveloper/">Junior developer</a></li>
<li><a class="orangeButton" href="http://www.koziarski.net/feed/atom/">XML</a> <a href="http://www.koziarski.net">Koz Speaks</a></li>
<li><a class="orangeButton" href="http://www.loudthinking.com/index.rdf">XML</a> <a href="http://www.loudthinking.com/">Loud Thinking</a></li>
<li><a class="orangeButton" href="http://www.mad4milk.net/feeds/tag/moo.fx/weblog">XML</a> <a href="http://www.mad4milk.net/tag/weblog/moo.fx">mad4milk feed for tag moo.fx in weblog section</a></li>
<li><a class="orangeButton" href="http://www.magpiebrain.com/index_full.xml">XML</a> <a href="http://www.magpiebrain.com/">magpiebrain</a></li>
<li><a class="orangeButton" href="http://mir.aculo.us/xml/rss/feed.xml">XML</a> <a href="http://mir.aculo.us/articles">mir.aculo.us</a></li>
<li><a class="orangeButton" href="http://jroller.org/rss/obie">XML</a> <a href="http://jroller.com/page/obie">Obie Fernandez</a></li>
<li><a class="orangeButton" href="http://outside-thoughts.octopod.info/xml/atom/feed.xml">XML</a> <a href="http://outside-thoughts.octopod.info/">Octoblog</a></li>
<li><a class="orangeButton" href="http://blog.zenspider.com/index.rdf">XML</a> <a href="http://blog.zenspider.com/">Polishing Ruby</a></li>
<li><a class="orangeButton" href="http://blogs.pragprog.com/cgi-bin/pragdave.cgi/index.rss">XML</a> <a href="http://blogs.pragprog.com/cgi-bin/pragdave.cgi">PragDave</a></li>
<li><a class="orangeButton" href="http://feeds.feedburner.com/projectionist">XML</a> <a href="http://project.ioni.st/">Projectionist</a></li>
<li><a class="orangeButton" href="http://feeds.feedburner.com/raganwald">XML</a> <a href="http://www.braithwaite-lee.com/weblog/">Raganwald</a></li>
<li><a class="orangeButton" href="http://railsexpress.de/blog/xml/rss20/feed.xml">XML</a> <a href="http://railsexpress.de/blog/">RailsExpress.blog</a></li>
<li><a class="orangeButton" href="http://onestepback.org/gemwatch.rss">XML</a> <a href="">Recent Gems</a></li>
<li><a class="orangeButton" href="http://redhanded.hobix.com/index.xml">XML</a> <a href="http://redhanded.hobix.com">RedHanded</a></li>
<li><a class="orangeButton" href="http://weblog.rubyonrails.com/feed/rss2/">XML</a> <a href="http://weblog.rubyonrails.com/">Riding Rails</a></li>
<li><a class="orangeButton" href="http://rubyweeklynews.org/index.xml">XML</a> <a href="http://www.rubyweeklynews.org">Ruby Weekly News</a></li>
<li><a class="orangeButton" href="http://blog.xeraph.org/feed/rss2/">XML</a> <a href="http://blog.xeraph.org">Slave To The Machine</a></li>
<li><a class="orangeButton" href="http://split-s.blogspot.com/atom.xml">XML</a> <a href="http://split-s.blogspot.com">split-s</a></li>
<li><a class="orangeButton" href="http://techno-weenie.net/blog/?rss=1">XML</a> <a href="http://techno-weenie.net/blog/">techno weenie</a></li>
<li><a class="orangeButton" href="http://tech.rufy.com/feed/rss2/">XML</a> <a href="http://tech.rufy.com">Technoblog</a></li>
<li><a class="orangeButton" href="http://jamis.jamisbuck.org/blog.cgi/index.rss">XML</a> <a href="http://jamis.jamisbuck.org/">the { buckblogs :here }</a></li>
<li><a class="orangeButton" href="http://onestepback.org/index.cgi/index.rss">XML</a> <a href="http://onestepback.org/index.cgi">{ | one, step, back | }</a></li>
<li><a class="orangeButton" href="http://habtm.com/xml/atom/feed.xml">XML</a> <a href="http://habtm.com/">~:caboose</a></li>
<li><a class="orangeButton" href="http://www.decafbad.com/atom.xml">XML</a> <a href="http://www.decafbad.com/">0xDECAFBAD</a></li>
<li><a class="orangeButton" href="http://www.alistapart.com/rss.xml">XML</a> <a href="http://www.alistapart.com/">A List Apart</a></li>
<li><a class="orangeButton" href="http://www.ajaxian.com/index.xml">XML</a> <a href="http://www.ajaxian.com/">Ajaxian</a></li>
<li><a class="orangeButton" href="http://www.backpackit.com/weblog/index.xml">XML</a> <a href="http://backpackit.com/weblog/">Backpack Weblog</a></li>
<li><a class="orangeButton" href="http://blog.monstuff.com/index.xml">XML</a> <a href="http://blog.monstuff.com/">Curiosity is bliss</a></li>
<li><a class="orangeButton" href="http://homepages.ihug.com.au/~naseby/rss.xml">XML</a> <a href="http://homepages.ihug.com.au/~naseby/">David Naseby's World</a></li>
<li><a class="orangeButton" href="http://www.docuverse.com/blog/donpark/rss.xml">XML</a> <a href="http://www.docuverse.com/blog/donpark/">Don Park's Daily Habit</a></li>
<li><a class="orangeButton" href="http://epeus.blogspot.com/atom.xml">XML</a> <a href="http://epeus.blogspot.com">Epeus' epigone</a></li>
<li><a class="orangeButton" href="http://dev.r.tucows.com/blog/index.xml">XML</a> <a href="http://farm.tucows.com/blog">The Farm: The Tucows Developers' Hangout</a></li>
<li><a class="orangeButton" href="http://grahamglass.blogs.com/main/atom.xml">XML</a> <a href="http://grahamglass.blogs.com/main/">Graham Glass, etc.</a></li>
<li><a class="orangeButton" href="http://haoli.dnsalias.com/index.xml">XML</a> <a href="http://haoli.dnsalias.com">h a o l i</a></li>
<li><a class="orangeButton" href="http://hypermetrics.com:3000/xml/rss/feed.xml">XML</a> <a href="http://hypermetrics.com:3000/">Hal-lucinations</a></li>
<li><a class="orangeButton" href="http://www.joelonsoftware.com/rss.xml">XML</a> <a href="http://www.joelonsoftware.com">Joel on Software</a></li>
<li><a class="orangeButton" href="http://weblog.infoworld.com/udell/rss.xml">XML</a> <a href="http://weblog.infoworld.com/udell/">Jon's Radio</a></li>
<li><a class="orangeButton" href="http://martinfowler.com/bliki/bliki.rss">XML</a> <a href="http://martinfowler.com/bliki">Martin Fowler's Bliki</a></li>
<li><a class="orangeButton" href="http://minimsft.blogspot.com/atom.xml">XML</a> <a href="http://minimsft.blogspot.com">Mini-Microsoft</a></li>
<li><a class="orangeButton" href="http://www.redhillconsulting.com.au/blogs/simon/index.xml">XML</a> <a href="http://www.redhillconsulting.com.au/blogs/simon/">My hovercraft is full of eels</a></li>
<li><a class="orangeButton" href="http://blogs.osafoundation.org/news/index.xml">XML</a> <a href="http://www.osafoundation.org/">OSAF News</a></li>
<li><a class="orangeButton" href="http://peterkaminski.com/index.rdf">XML</a> <a href="http://peterkaminski.com/">Peter Kaminski</a></li>
<li><a class="orangeButton" href="http://www.cincomsmalltalk.com/rssBlog/ralph-rss.xml">XML</a> <a href="http://www.cincomsmalltalk.com/userblogs/ralph/blogView">Ralph Johnson - Blog</a></li>
<li><a class="orangeButton" href="http://www.intertwingly.net/blog/index.rss2">XML</a> <a href="http://www.intertwingly.net/blog/">Sam Ruby</a></li>
<li><a class="orangeButton" href="http://37signals.com/svn/index_full.rdf">XML</a> <a href="http://37signals.com/svn/">Signal vs. Noise</a></li>
<li><a class="orangeButton" href="http://mmower.backpackit.com/feed/580c59a670b1f7c852e0901b7976e0e8">XML</a> <a href="http://mmower.backpackit.com/account/start">Backpack</a></li>
<li><a class="orangeButton" href="http://www.choof.org/MT/index.xml">XML</a> <a href="http://www.choof.org/MT/">choof.org</a></li>
<li><a class="orangeButton" href="http://www.idealgovernment.com/index.php/weblog/rss_2.0/">XML</a> <a href="http://www.idealgovernment.com/index.php/weblog/index/">Ideal Government</a></li>
<li><a class="orangeButton" href="http://www.idcorner.org/wp-rss2.php">XML</a> <a href="http://www.idcorner.org">The Identity Corner</a></li>
<li><a class="orangeButton" href="http://www.identityblog.com/rss.xml">XML</a> <a href="http://www.identityblog.com/">Kim Cameron's Identity Weblog</a></li>
<li><a class="orangeButton" href="http://danielsolove.blogspot.com/atom.xml">XML</a> <a href="http://danielsolove.blogspot.com">The Solove Chronicles</a></li>
<li><a class="orangeButton" href="http://www.technorati.com/watchlists/rss.html?wid=64358">XML</a> <a href="http://www.technorati.com/cosmos/search.html?url=paoga">Technorati Search for: paoga</a></li>
<li><a class="orangeButton" href="http://blogs.sun.com/roller/rss/wizidm">XML</a> <a href="http://blogs.sun.com/roller/page/wizidm">Wizard of IdM</a></li>
</ul>
</div>


    
<div class="box" id="box_syndication">
<strong>Syndication</strong>
<div id="syndication">
<ul>
	<li><a class="orangeButton" href="http://matt.blogs.it/rss.xml">XML</a></li>
	<li><script type="text/javascript">eval(unescape('%64%6f%63%75%6d%65%6e%74%2e%77%72%69%74%65%28%27%3c%61%20%68%72%65%66%3d%22%6d%61%69%6c%74%6f%3a%73%65%6c%66%40%6d%61%74%74%6d%6f%77%65%72%2e%63%6f%6d%22%3e%45%6d%61%69%6c%20%4d%65%3c%2f%61%3e%27%29%3b'))</script></li>
</ul>
</div>
</div>


</div>
<div id="wrapper">
	<div id="content">
		<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:ent="http://www.purl.org/NET/ENT/1.0/">
  <channel>
    <title>Curiouser and Curiouser! on bayesian-classification</title>
    <link>http://matt.blogs.it/</link>
    <description>RSS feed for topic bayesian-classification</description>
    <copyright>Copyright 2006 Matt Mower</copyright>
    <generator>Squib/0.1</generator>
    <managingEditor>self@mattmower.com</managingEditor>
    <webMaster>self@mattmower.com</webMaster>
    <language>en-gb</language>
    <item>
      <title>K-Collector &amp; Bayesian filtering</title>
      <pubDate>Sun, 17 Aug 2003 18:59:05 +0100</pubDate>
      <description>&lt;BLOCKQUOTE&gt;
&lt;P&gt;&lt;A href="http://www.decafbad.com/blog/geek/bayes_agg_one.phtml"&gt;Issues in using SpamBayes to filter news items&lt;/A&gt;. &lt;/P&gt;
&lt;P&gt;Despite a reading &lt;A href="http://www.srijith.net/trinetre/archives/2003/08/11/index.shtml#000373"&gt;an entry by Srijith&lt;/A&gt; discussing Bayes-based classification as unsuitable for use in news aggregators, I tied &lt;A href="http://www.spambayes.org"&gt;SpamBayes&lt;/A&gt; into my &lt;A href="http://www.decafbad.com/viewcvs.cgi/dbagg/"&gt;homebrew news aggregator&lt;/A&gt; and have been trying it out this week. I know Ive been &lt;A href="http://www.decafbad.com/blog/geek/syndicated_whuffie.phtml"&gt;talking about it&lt;/A&gt; for awhile, but procrastination and being busy all round kept me from getting to it. Funny thing is, when I finally got a chance to really check things out, the integration was a snap. Id anticipated a bit of work, but was pleasantly surprised. I doubt that any other aggregator written in Python would have a hard time with it.&lt;/P&gt;
&lt;P&gt;If, that is, anyone else wants to do it. I already knew it wasnt &lt;A href="http://diveintomark.org/archives/2003/07/07/linkdumps_are_like_sex"&gt;magic pixy dust&lt;/A&gt; but I figured it might be worth a try. I will be eating my dogfood for awhile with this, but Im thinking already that whats good for spam might not be so good for news aggregators.&lt;/P&gt;
&lt;P&gt;Srijiths &lt;A href="http://www.srijith.net/trinetre/archives/2003/08/11/index.shtml#000373"&gt;post&lt;/A&gt; mentions some snags in ignoring some of the semantics of a news item, such as whether a word appears in the items title or information about the items source. I dont think that this completely applies to how Im doing classification, since SpamBayes appears to differentiate between words found in email headers and the body itself. When I feed an item to SpamBayes for training and scoring, I represent it as something like an email message, with headers like date, subject, from, and an X-Link header for the link. However, even with this, I think Srijiths got a point when he writes that this method will miss a lot of available clues for classification.&lt;/P&gt;
&lt;P&gt;Unlike Srijiths examples, though, Im not trying to train my aggregator to sift entries into any specific categories. So far, Ive been trying to get it to discriminate between what I really want to read, and what Im not so interested in. So, I figured that something which can learn the difference between spam and normal email could help. But, although its early, Im noticing a few things about the results and Ive had a few things occur to me.&lt;/P&gt;
&lt;P&gt;See, in the case of ham vs spam, I really want all the ham and none of the spam. A method to differentiate between these two should be optimized toward one answer or the other. SpamBayes offers I dont know as a third answer, but its not geared toward anything else in-between. However, in measuring something like interest, inbetween answers are useful. I want all of the interesting stuff, some of the sort-of interesting stuff, and a little of the rest.&lt;/P&gt;
&lt;P&gt;This is also a problem for me in deciding to what I should give a thumbs up and what gets the thumbs down. Even though Ive subscribed to a little over 300 feeds, every item from each of them is somewhat interesting to me. I wouldnt have subscribed to the feed if there wasnt anything of interest there, so Ive already biased the content of what I receive. Some items are more interesting than others, but the difference between them is nowhere near the difference of wanted ham vs unsolicited spam. So, I find myself giving the nod to lots of items, but only turning down a few. SpamBayes would like equal examples of both, if possible.&lt;/P&gt;
&lt;P&gt;Ill still be playing with this for awhile, but I need to look around at other machine learning tech. Im just hacking around, but the important thing is to try to understand the algorithms better and know how they work and why. Bayes is in vogue right now, but as Mark Pilgrim intimated, its not magic. Its just advanced :)&lt;/P&gt;
&lt;P&gt;In the immortal words of &lt;A href="http://www.spidereyeballs.com/os6/set3/small_os6_d3_3596_sm.html"&gt;Mark Jason Dominus&lt;/A&gt;: You cant just make shit up and expect the computer to know what you mean, retardo! &lt;/P&gt;
&lt;P&gt;[&lt;A href="http://www.decafbad.com/blog"&gt;0xDECAFBAD&lt;/A&gt;]&lt;/P&gt;&lt;/BLOCKQUOTE&gt;
&lt;P&gt;Interesting.&amp;nbsp; Within &lt;A href="http://w4.evectors.it/"&gt;k-collector&lt;/A&gt; we already have a method for selecting a level of interest more granular than the feed and that is the topic (and, soon, groups of related topics).&lt;/P&gt;
&lt;P&gt;This allows you to say &lt;EM&gt;&lt;FONT color=maroon&gt;I'm interested in Java&lt;/FONT&gt;&lt;/EM&gt; rather than &lt;EM&gt;&lt;FONT color=maroon&gt;I want to read these 200 blogs where they talk about Java sometimes&lt;/FONT&gt;&lt;/EM&gt;.&amp;nbsp; Then within this view you could start to say &lt;EM&gt;&lt;FONT color=maroon&gt;well, so and so&amp;nbsp;is more interesting than Matt on this topic&lt;/FONT&gt;&lt;/EM&gt;.&amp;nbsp; But again, you are only dealing with the topic at hand.&amp;nbsp; You might still think I'm more interesting about something else even if I struggle for an example!&lt;/P&gt;
&lt;P&gt;We're already looking at interesting things we can do with this approach, maybe Bayesian filtering is something we should be thinking about.&lt;/P&gt;</description>
      <guid isPermaLink="true">http://matt.blogs.it/entries/00001543.html</guid>
      <ent:cloud ent:href="http://matt.blogs.it/topics/">
      </ent:cloud>
    </item>
    <item>
      <title>Crime is bad Reverend</title>
      <pubDate>Tue, 19 Aug 2003 10:01:29 +0100</pubDate>
      <description>&lt;P&gt;&lt;A href="http://osoft.us/Members/jeff/permanent/b-2003Feb22-104PM/book_view"&gt;Jeffrey Hicks&lt;/A&gt; sent me a comment in response to my &lt;A href="http://matt.blogs.it/2003/08/17.html#a1073"&gt;recent post&lt;/A&gt; thinking about introducing Bayesian filtering into &lt;A href="http://w4.evectors.it/"&gt;K-Collector&lt;/A&gt;.&amp;nbsp; He pointed me at an application he is demoing called &lt;A href="http://jrhicks.net/reverend"&gt;Reverend&lt;/A&gt;.&amp;nbsp; It's combines a&amp;nbsp;Bayesian filter with the &lt;A href="http://www.cogsci.princeton.edu/~wn/index.shtml"&gt;WordNet&lt;/A&gt;&amp;nbsp;lexical classifier to allow people to train it about whether words are good or bad.&lt;/P&gt;
&lt;P&gt;The result is that it can tell you that&amp;nbsp;crime is bad and, by inference, that bribery, racketeering, larceny and theft are bad too!&lt;/P&gt;
&lt;P&gt;Somewhat reminiscent of&amp;nbsp;some of the &lt;A href="http://www.google.com/search?hl=en&amp;ie=UTF-8&amp;oe=UTF-8&amp;q=cyc"&gt;Cyc&lt;/A&gt; inference examples.&lt;/P&gt;</description>
      <guid isPermaLink="true">http://matt.blogs.it/entries/00001555.html</guid>
      <ent:cloud ent:href="http://matt.blogs.it/topics/">
      </ent:cloud>
    </item>
    <item>
      <title>Bayes makes topics</title>
      <pubDate>Tue, 19 Aug 2003 19:57:08 +0100</pubDate>
      <description>&lt;P&gt;Doing a bit of digging into Bayesian filtering.&amp;nbsp; Although it may have other uses in K-Collector later on, my initial thoughts are using a Bayesian filter to automatically suggest topics for weblog posts via K-Collector client.&amp;nbsp; If it's good enough we might even be able to skip the user having to approve some topics.&lt;/P&gt;
&lt;P&gt;At the moment the client is using a simple keyword stemmer which is effective, up to&amp;nbsp;a point, but suggests a lot of false positives.&amp;nbsp;&amp;nbsp; I'm hoping that a trained Bayesian classifier will do a lot better.&amp;nbsp; Of course this raises the issue of how it gets trained but that is a bridge of a very different colour.&lt;/P&gt;
&lt;P&gt;Some resources I have come across:&lt;/P&gt;
&lt;UL&gt;
&lt;LI&gt;&lt;A href="http://www.paulgraham.com/spam.html"&gt;A Plan for Spam &lt;Paul Graham&gt;&lt;/A&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;A href="http://sourceforge.net/docman/display_doc.php?docid=13334&amp;group_id=63137"&gt;Project POPFile&lt;/A&gt;&lt;/LI&gt;&lt;/UL&gt;</description>
      <guid isPermaLink="true">http://matt.blogs.it/entries/00001558.html</guid>
      <ent:cloud ent:href="http://matt.blogs.it/topics/">
      </ent:cloud>
    </item>
    <item>
      <title>Experienced classifier wanted</title>
      <pubDate>Tue, 19 Aug 2003 20:18:41 +0100</pubDate>
      <description>One question bubbles up in my mind reading about Bayesian classifiers.&amp;nbsp; They all seem to be &lt;EM&gt;naive&lt;/EM&gt;.&amp;nbsp; So, what does an experienced Bayesian classifier look like?</description>
      <guid isPermaLink="true">http://matt.blogs.it/entries/00001565.html</guid>
      <ent:cloud ent:href="http://matt.blogs.it/topics/">
      </ent:cloud>
    </item>
    <item>
      <title>NewsBayes</title>
      <pubDate>Thu, 20 Nov 2003 16:23:49 +0000</pubDate>
      <description>&lt;div style="margin-left: 40px;"&gt;&lt;a href="http://weblog.infoworld.com/udell/2003/11/20.html#a851"&gt;Working with Bayesian categorizers&lt;/a&gt;. &lt;/div&gt;
&lt;blockquote cite="O'Reilly Network" style="margin-left: 40px;"&gt; There's
been some discussion in the blog world about using a Bayesian
categorizer to enable a person to discriminate along various
interest/non-interest axes. I took a run at this recently and, although
my experiments haven't been wildly successful, I want to report them
because I think the idea may have merit. [Full story: &lt;a href="http://www.xml.com/pub/a/2003/11/19/udell.html"&gt;O'Reilly Network: Working with Bayesian Categorizers&lt;/a&gt;]
&lt;/blockquote&gt;
&lt;div style="margin-left: 40px;"&gt;This month's O'Reilly Network column
was a struggle because categorization itself is a struggle. I remain
convinced that the automated classifiers that are doing such a good job
beating back the tide of spam will also turn out to be more generally
useful. But finding the right synergy between an automated assistant
and a human overseer is a subtle and tricky thing. &lt;b&gt;...&lt;/b&gt; [&lt;a href="http://weblog.infoworld.com/udell/"&gt;Jon's Radio&lt;/a&gt;]&lt;br&gt;
&lt;/div&gt;
&lt;br&gt;
Great write-up by Jon of his experiments trying to hotwire Bayesian categorizers for auto-classifying blog posts.&lt;br&gt;
&lt;br&gt;
I think his conclusions demonstrate that there is value in the
approach, certainly as Jon notes, for auto-classification of incoming
items as in a news aggregator.&amp;nbsp; Of course this is exactly what we
are doing with &lt;a href="http://w4.evectors.it/"&gt;K-Collector&lt;/a&gt; and I
have mused for a while about whether we could use a Bayesian classifier
to improve how we do that (at the moment we are using some pretty
simple keyword analysis).&lt;br&gt;
&lt;br&gt;
I think one of the biggest obstactles is the training.&amp;nbsp; We have
somewhere around 400 topics and many thousands of posts.&amp;nbsp; How hard
would it be to train a classifier?&amp;nbsp; How long would it take?&amp;nbsp;
Could you sell it?&lt;br&gt;
</description>
      <guid isPermaLink="true">http://matt.blogs.it/entries/00001744.html</guid>
      <ent:cloud ent:href="http://matt.blogs.it/topics/">
      </ent:cloud>
    </item>
    <item>
      <title>Whose personality do you want today?</title>
      <pubDate>Fri, 21 Nov 2003 17:47:08 +0000</pubDate>
      <description>&lt;a href="http://www.decafbad.com/blog/"&gt;l.m.orchard&lt;/a&gt; &lt;a href="http://radiocomments.userland.com/comments?u=0107808&amp;p=1243"&gt;commented&lt;/a&gt; regarding the &lt;a href="http://matt.blogs.it/2003/11/20.html#a1243"&gt;using Bayesian analysis on news&lt;/a&gt;.  In fact, as soon as I saw it I remembered, I had read his &lt;a href="http://www.decafbad.com/blog/geek/bayes_agg_one.html"&gt;piece&lt;/a&gt; already.  It was probably his writing that triggered my initial interest in using a Bayesian classifier in &lt;a href="http://w4.evectors.it/"&gt;K-Collector&lt;/a&gt;.&lt;br&gt;
&lt;br&gt;
Re-reading that piece I got an interesting different angle since his
approach was to blend a Bayesian classifier with his news aggregator to
try and have it prioritize news he would find interesting and not to
categorize it by topic.  I think this is a much more scalable
task, from a K-Collector perspective, than what &lt;a href="http://weblog.infoworld.com/udell/2003/11/20.html#a851"&gt;Jon&lt;/a&gt;
is experimenting with.  I think the efforts of training a
system-wide recognizer to differentiate between topics would be too
much for most users of the product to bear.&lt;br&gt;
&lt;br&gt;
Our product roadmap for K-Collector already includes allowing users to
personalize the system.  For example we think that people should
be able to say which feeds they think are relevant on different
topics.  Notice that this is a much very granular relationship
since it means that I can say "Matt Mower is a real expert on the topic
sock puppets" but that this says nothing about how relevant I am on
"dating." or any other topic.  Indeed each user might rate
the exact same sources differently over a wide range of topics.&lt;br&gt;
&lt;br&gt;
What might be interesting is if people could "share" and "subscribe to"
preference maps.  As a new user of the system you might not really
know who is relevant on any particular topic.  But imagine you
worked with &lt;a href="http://www.hyperorg.com/blogger/"&gt;David Weinberger&lt;/a&gt;, &lt;a href="http://www.dijest.com/aka/"&gt;Phil Wolff&lt;/a&gt;, or &lt;a href="http://weblog.siliconvalley.com/column/dangillmor/"&gt;Dan Gillmor&lt;/a&gt;. 
If you knew them and trusted their judgement you could pick one of
their preference maps as a starting point and immediately gain a
usseful insight into the data as it is structured by topic.  You
might even switch between personalities to get more perspective!&lt;br&gt;
&lt;br&gt;
Thanks to l.m.'s piece I am now wondering also about whether a Bayesian
classifier might be more use in helping users to establish their own
preference maps about which content is most relevant to them.&lt;br&gt;</description>
      <guid isPermaLink="true">http://matt.blogs.it/entries/00001756.html</guid>
      <ent:cloud ent:href="http://matt.blogs.it/topics/">
        <ent:topic ent:href="http://matt.blogs.it/topics/ipodder.xml" ent:id="ipodder" ent:classification="user"/>
        <ent:topic ent:href="http://matt.blogs.it/topics/podcasting.xml" ent:id="podcasting" ent:classification="user"/>
      </ent:cloud>
    </item>
    <item>
      <title>Tar'n'feather spammers</title>
      <pubDate>Sun, 10 Apr 2005 10:24:10 +0100</pubDate>
      <description>&lt;p&gt;I've been looking for a good corpus to train a Bayesian classifier on (still looking).  In my quest I've come across some interesting ideas and work.  Today I came across one which I thought was pretty good indeed.&lt;/p&gt;
&lt;p&gt;Marty Lamb of &lt;a href="http://www.martiansoftware.com/"&gt;Martian Software&lt;/a&gt; (who seem to be a small Java software house) has published an idea to &lt;a href="http://www.martiansoftware.com/articles/spammerpain.html"&gt;use statistics to cause spammers pain&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;Philosophically the idea strikes me as quite similar to &lt;a href="http://www.hashcash.org/"&gt;HashCash&lt;/a&gt; which is a system for preventing denial of service attacks by forcing a client to do work before it can use resources.  For normal users this work has little impact but when mounting a DoS attack it (theoretically) becomes prohibitive.&lt;/p&gt;
&lt;p&gt;Marty's idea is to bake a Bayesian filter into SMTP servers and have them analyze a message token-by-token as it arrives.  With each token a spam probability is calculated and used to vary the the speed of the connection.  As the server reaches the conclusion that a message is spam the connection slows to a crawl and it takes the spammer longer and longer to deliver the message.  Marty describe it thus:
&lt;blockquote&gt;&lt;p&gt;If an incoming message looks like spam, the connection could be slowed dramatically, consuming the spammer's resources and wasting their time. This would transform the server into a sort of dynamic tarpit, in which the spamminess of the incoming message affects the viscosity of the tar. As the spam probability goes up, the socket speed goes down.&lt;/p&gt;
&lt;p&gt;If enough of these dynamic tarpits were in place (or just a handful were placed in the right places), the spammers' mail software would bog down, reducing the rate at which they can send messages, in turn reducing the fees they can charge their customers. If these tarpits were ubiquitous, they could completely change the economics of spam, creating a scarcity of bandwidth experienced only by spammers.&lt;/p&gt;
&lt;p&gt;Ouch. :)&lt;/p&gt;
&lt;p&gt;Of course, this doesn't eliminate the problem presented by careless or uninformed sysadmins who provide open relays; spammers going through open relays wouldn't feel a thing. But the sysadmins that provide those services certainly would. It isn't too far-fetched to imagine that they might start to notice the effect on their server when all of its outbound SMTP connections are slowed to a crawl.&lt;/p&gt;&lt;/blockquote&gt;&lt;/p&gt;
&lt;p&gt;If this works then all we need to do is add feathers!&lt;/p&gt;</description>
      <guid isPermaLink="true">http://matt.blogs.it/entries/00002486.html</guid>
      <ent:cloud ent:href="http://matt.blogs.it/topics/">
      </ent:cloud>
    </item>
    <item>
      <title>A Bishop and a Reverend walk into a bar</title>
      <pubDate>Sun, 24 Apr 2005 21:59:29 +0100</pubDate>
      <description>&lt;p&gt;Last weekend I ported the &lt;a href="http://www.divmod.org/projects/reverend"&gt;Reverend&lt;/a&gt; bayesian classifier from Python to Ruby (thanks &lt;a href="http://homepages.unl.ac.uk/~alsburyj/"&gt;Jon&lt;/a&gt; for the original pointer to Reverend).  What I didn't realise was that &lt;a href="http://tech.rufy.com/"&gt;Lucas Carlson&lt;/a&gt; was, simultaneously, working a similar library... ah well, there's a good chance we'll roll my work into his more general classifier library.&lt;/p&gt;
&lt;p&gt;Anyway for what it's worth my port, &lt;a href="http://rubyforge.org/projects/bishop/"&gt;Bishop&lt;/a&gt;, is available for download and as a RubyGem.  I anticipate at least one more release before deciding what to do with it.&lt;/p&gt;</description>
      <guid isPermaLink="true">http://matt.blogs.it/entries/00002525.html</guid>
      <ent:cloud ent:href="http://matt.blogs.it/topics/">
      </ent:cloud>
    </item>
    <item>
      <title>Bayesian sillyness on a Sunday afternoon</title>
      <pubDate>Sun, 24 Apr 2005 22:25:29 +0100</pubDate>
      <description>&lt;p&gt;&lt;a href="http://www.memescope.com/"&gt;MemeScope&lt;/a&gt; is a project i've been working on for a little while now.  The aim is to research ideas  about RSS, tagging, and information management within a personal context.  Working in &lt;a href="http://www.ruby-lang.org/"&gt;Ruby&lt;/a&gt; and utilizing the &lt;a href="http://www.rubyonrails.com/"&gt;Rails&lt;/a&gt; framework allowed me to make quick progress on the basic application however, lately, I've found it hard to make time with work and study (exams in May -- groan) pressing in on me.  Today though I made time to do a little experiment using &lt;a href="http://rubyforge.org/projects/bishop/"&gt;Bishop&lt;/a&gt;.  This has been done very quickly and crudely but I hope it will at least indicate a useful path to follow.&lt;/p&gt;
&lt;p&gt;Searching my bloglines subscription page I grabbed the first bunch of RSS2.0 feeds I could lay my hands on.  They were (in the order in which I could remember whether these people used vanilla RSS2.0): &lt;a href="http://paolo.evectors.it/"&gt;Paolo&lt;/a&gt;, &lt;a href="http://matt.blogs.it/"&gt;me&lt;/a&gt;, &lt;a href="http://radio.weblogs.com/0001011/"&gt;Scoble&lt;/a&gt;,
&lt;a href="http://www.scripting.com/"&gt;Dave Winer&lt;/a&gt;, &lt;a href="http://www.hyperorg.com/blogger/"&gt;David Weinberger&lt;/a&gt;, &lt;a href="http://marc.blogs.it/"&gt;Marc Canter&lt;/a&gt;, &lt;a href="http://www.jackvinson.com/"&gt;Jack Vinson&lt;/a&gt;, &lt;a href="http://www.terryfrazier.com/weblog/"&gt;Terry&lt;/a&gt;, and &lt;a href="http://blog.mathemagenic.com/"&gt;Lilia&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;I created a classifier and trained it with the feed of each person, using their name as a pool.  Then I fed another couple of blogs through and looked at the guessing results.&lt;/p&gt;
&lt;p&gt;&lt;a href="http://www.problogger.net/"&gt;Problogger&lt;/a&gt;:&lt;ul&gt;
&lt;li&gt;Jack = 0.468318829421744&lt;/li&gt;&lt;li&gt;Lilia = 0.00342010541710014&lt;/li&gt;&lt;li&gt;Marc = 0.282793689816727&lt;/li&gt;&lt;li&gt;Me = 0.393100714237438&lt;/li&gt;&lt;li&gt;Paolo = 0.346225200485668&lt;/li&gt;&lt;li&gt;Scoble = 0.253258334183292&lt;/li&gt;&lt;li&gt;Terry = 0.22771214945352&lt;/li&gt;&lt;li&gt;Weinberger = 0.0167048658202115&lt;/li&gt;&lt;li&gt;Winer = 0.289825786625603&lt;/li&gt;&lt;/ul&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="http://radio.weblogs.com/0110772/"&gt;Seb Paquet&lt;/a&gt;:&lt;ul&gt;&lt;li&gt;Jack = 0.576259503998576&lt;/li&gt;&lt;li&gt;Lilia = 0.235407686747673&lt;/li&gt;&lt;li&gt;Marc = 0.417682412318975&lt;/li&gt;&lt;li&gt;Me = 0.30854512908974&lt;/li&gt;&lt;li&gt;Paolo = 0.397301832715516&lt;/li&gt;&lt;li&gt;Scoble = 0.324509204588817&lt;/li&gt;&lt;li&gt;Terry = 0.291899188854501&lt;/li&gt;&lt;li&gt;Weinberger = 0.283141685408383&lt;/li&gt;&lt;li&gt;Winer = 0.108271483644474&lt;/li&gt;&lt;/ul&gt;&lt;/p&gt;
&lt;p&gt;&lt;a href="http://seblogging.cognitivearchitects.com/"&gt;Sebastian Fiedler&lt;/a&gt;:&lt;ul&gt;&lt;li&gt;Jack = 0.5&lt;/li&gt;&lt;li&gt;Lilia = 0.24819059477181&lt;/li&gt;&lt;li&gt;Marc = 0.5&lt;/li&gt;&lt;li&gt;Me = 0.234146370920656&lt;/li&gt;&lt;li&gt;Paolo = 0.326586572303767&lt;/li&gt;&lt;li&gt;Scoble = 0.317525809666623&lt;/li&gt;&lt;li&gt;Terry = 0.344058594866734&lt;/li&gt;&lt;li&gt;Weinberger = 0.208336788611639&lt;/li&gt;&lt;li&gt;Winer = 0.203113831786535&lt;/li&gt;&lt;/ul&gt;&lt;/p&gt;
&lt;p&gt;The numbers here are probabilities indicating (as far as the classifier is concerned) the likely hood of the guessed feeds being the same as the sample feeds.  There is probably far too little data here to determine anything useful however, given the people concerned, I do find something suggestive about the results.  Enough to make me want to look harder.&lt;/p&gt;
&lt;p&gt;For calibration here are two other results.  Feeding Paolo's feed back in for guessing:&lt;ul&gt;
&lt;li&gt;Jack = 0.162912565311238&lt;/li&gt;&lt;li&gt;Lilia = 0.00398573502931349&lt;/li&gt;&lt;li&gt;Marc = 0.0498689666595031&lt;/li&gt;&lt;li&gt;Me = 0.00988167948078622&lt;/li&gt;&lt;li&gt;Paolo = 0.5&lt;/li&gt;&lt;li&gt;Scoble = 0.0404166297297532&lt;/li&gt;&lt;li&gt;Terry = 0.0308895504863602&lt;/li&gt;&lt;li&gt;Weinberger = 0.0169976173877616&lt;/li&gt;&lt;li&gt;Winer = 0.0083263139102045&lt;/li&gt;&lt;/ul&gt; and for my own feed:&lt;ul&gt;&lt;li&gt;Jack = 0.355604304977086&lt;/li&gt;&lt;li&gt;Lilia = 0.00691730756404335&lt;/li&gt;&lt;li&gt;Marc = 0.148014510140305&lt;/li&gt;
&lt;li&gt;Me = 0.5&lt;/li&gt;&lt;li&gt;Paolo = 0.0873206208996655&lt;/li&gt;&lt;li&gt;Scoble = 0.0863911457566834&lt;/li&gt;&lt;li&gt;Terry = 0.105989792649879&lt;/li&gt;&lt;li&gt;Weinberger = 0.0781196567440771&lt;/li&gt;&lt;li&gt;Winer = 0.00582495354516671&lt;/li&gt;&lt;/ul&gt;&lt;/p&gt;
&lt;p&gt;Next time round I'm going to be more thorough, allow the use of RSS1.0 and Atom feeds, think about adjustments for text quantity and experiment with the use of the Robinson-Fischer algorithm for combining probabilities.&lt;/p&gt;</description>
      <guid isPermaLink="true">http://matt.blogs.it/entries/00002528.html</guid>
      <ent:cloud ent:href="http://matt.blogs.it/topics/">
      </ent:cloud>
    </item>
  </channel>
</rss>

	</div>
</div>
<div id="footer">
	<div class="info">
Copyright 2006 Matt Mower -- <a href='http://squib.rubyforge.org/'>Squib</a> Version 0.4.0 (Release 282)&nbsp;&nbsp;Updated: 19/01/2006 18:45
	</div>
</div>
</div>
</body>
</html>
