ARCHIVE: http://davidjanesblognews.blogspot.com/2002_12_01_archive.html ARCHIVE: http://davidjanesblognews.blogspot.com/2002_12_08_archive.html ARCHIVE: http://davidjanesblognews.blogspot.com/2002_12_15_archive.html ARCHIVE: http://davidjanesblognews.blogspot.com/2002_12_22_archive.html ARCHIVE: http://davidjanesblognews.blogspot.com/2002_12_29_archive.html ARCHIVE: http://davidjanesblognews.blogspot.com/2003_01_05_archive.html ARCHIVE: http://davidjanesblognews.blogspot.com/2003_01_12_archive.html ARCHIVE: http://davidjanesblognews.blogspot.com/2003_01_19_archive.html ARCHIVE: http://davidjanesblognews.blogspot.com/2003_01_26_archive.html ARCHIVE: http://davidjanesblognews.blogspot.com/2003_02_02_archive.html ARCHIVE: http://davidjanesblognews.blogspot.com/2003_02_09_archive.html ARCHIVE: http://davidjanesblognews.blogspot.com/2003_02_16_archive.html ARCHIVE: http://davidjanesblognews.blogspot.com/2003_02_23_archive.html ARCHIVE: http://davidjanesblognews.blogspot.com/2003_03_02_archive.html ARCHIVE: http://davidjanesblognews.blogspot.com/2003_03_09_archive.html ARCHIVE: http://davidjanesblognews.blogspot.com/2003_03_16_archive.html ARCHIVE: http://davidjanesblognews.blogspot.com/2003_03_30_archive.html ARCHIVE: http://davidjanesblognews.blogspot.com/2003_04_06_archive.html ARCHIVE: http://davidjanesblognews.blogspot.com/2006_10_15_archive.html AUTHOR: David P. Janes DATE: 1:21 PM ----- BODY:
Testing
-------- AUTHOR: David P. Janes DATE: 4:33 AM ----- BODY:
We're up

I need to integrate the two blogs properly into the system, but besides that we're off and running. Our new home is at www.blogmatrix.com.

-------- AUTHOR: David P. Janes DATE: 6:59 AM ----- BODY:
Bigger problems

We're down, at least for the rest of the day (Sunday), due to an upgrade of the db at hostmatters that has broken my libraries. I have the replacement system ready to go and it will be available at www.blogmatrix.com in the next 12 to 24 hours. Old URLs with automatically be redirected, though hopefully you'll all update your URLs.

-------- AUTHOR: David P. Janes DATE: 4:28 AM ----- BODY:
Big Technical Problems

Very little is going to be updated for the next two or three days until I can get a new server configured and running. Service should return to normal by Thursday or Friday morning. Our apologies.

-------- AUTHOR: David P. Janes DATE: 4:32 AM ----- BODY:
Lupy

Lupy is a port of Lucene to Python. Yay! Via Don Park.

-------- AUTHOR: David P. Janes DATE: 4:08 AM ----- BODY:
10,000

We've crossed the 10,000 blog mark. The only real problem we're having is that disk space, even after upgrades, is still short. I may have to host this myself somewhere soon.

-------- AUTHOR: David P. Janes DATE: 3:37 AM ----- BODY:
Recent News

There's lots of new little things happening:

-------- AUTHOR: David P. Janes DATE: 3:53 AM ----- BODY:
Sorry

For any apparently flakiness recently. Hostmatters has been fairly unreliable, plus I've been working on a new "satellite" system for download and processing blogs, to distribute the load across the Internet.

-------- AUTHOR: David P. Janes DATE: 3:48 AM ----- BODY:
Bugs

Ah, I see the permalinks reports are going to need a little more work.

-------- AUTHOR: David P. Janes DATE: 3:45 AM ----- BODY:
Updates

I've done a couple of updates to Janes' Blogosphere this morning, cleaning out a queue of work items I've had for a few weeks. Among the chanegs:

-------- AUTHOR: David P. Janes DATE: 5:19 AM ----- BODY:
Update
ThreadTrack is now official. Put it on your blog today.
-------- AUTHOR: David P. Janes DATE: 4:57 AM ----- BODY:
Flying blind

There's now isn't any hit counter on the pages anymore.

-------- AUTHOR: David P. Janes DATE: 5:13 PM ----- BODY:
Bug

The template rewriter was making validating XHTML sites non-valid. I've fixed this. If you don't use XHTML, there's nothing to worry about!

-------- AUTHOR: David P. Janes DATE: 1:02 PM ----- BODY:
Update

Sorry, there's been a bagload of improvements to this site over to this site of the last two weeks which I haven't documented here. I've added a new navbar to the right (css rules yet to be pinned down), removed the old navbar from the top, added support for RSS 3.0 (dubbed R3 here so our naive users will not get confused), changed the entire markup language, added a "technology" blog (so I won't bore you with that here), and added many tools for making the underlying database more reliable and faster.

I think that's it.

-------- AUTHOR: David P. Janes DATE: 5:25 AM ----- BODY:
Endorsement

Your very own RSS feed Technology

Now that I have my own list of feeds that I read, I don't rarely visit blogs directly. Now I just browse my list and go there if the title looks interesting. But, there are a lot of blogs that I wish had their own RSS feed.

Well, I found Janes' Blogosphere. It allows anyone to create their own RSS feed with it's screen scraping service, no matter how the blog was created. Now nobody has an excuse to not have their own feed.

ostang, on the web

-------- AUTHOR: David P. Janes DATE: 2:14 AM ----- BODY:
Endorsement

David Bigwood is tracking David Janes' efforts to jumpstart a viable weblog metadata initiative. Double the Davids, double the potential!

The Shifted Librarian

-------- AUTHOR: David P. Janes DATE: 2:14 AM ----- BODY:
Endorsement

From Janes' Blogosphere Technology This looks promising, tools and standards. I'll have to look at it in more detail.

David Bigwood, Catalogablog

-------- AUTHOR: David P. Janes DATE: 2:14 AM ----- BODY:
Endorsement

JANES ADDITION: Besides the blogger RSS feed, available on the upper left-hand side of the screen, we've just added a feed produced by David Janes' RSS scraper. It looks like it'll be more reliable than the blogger-produced version; you can test it here, using SOAPClient. Thanks again to David Janes for providing this and other excellent free blog-handling tools.

Eugene Volokh

-------- AUTHOR: David P. Janes DATE: 10:55 AM ----- BODY:
Technology Blog

All future posts about the workings of Janes' Blogosphere will now be over here, at the Technology blog.

The first order of business is a proposal for the Standardization of Weblog Metadata. Standardized Weblog Metadata is what makes it easy for me to produce RSS feeds for your blog, and in the future, will make it easier for other tools (such as the various Ecosystems) to understand your blog also.

-------- AUTHOR: David P. Janes DATE: 1:55 AM ----- BODY:
Busy...

I'm reworking the QSM format. See this blog's raw HTML for details, and more announcements coming soon.

-------- AUTHOR: David P. Janes DATE: 2:57 PM ----- BODY:
Updates

-------- AUTHOR: David P. Janes DATE: 1:25 PM ----- BODY:
Update

I've changed the navigation bar at the top, once again, to make it easier to get to the News and to Add a blog (formerly "Join"). They're both in bold right now, till people get used to seeing them. "Login/Settings" or "Logout" has now become "My Login" all the time. I've fixed up the presentation, but not the content, of the Help and FAQ pages. Finally, I've made what you've seen in BlogTrack persist over multiple sessions -- a badly needed feature.

-------- AUTHOR: David P. Janes DATE: 3:42 AM ----- BODY:
Outage

If you noticed problems yesterday, it's because of this problem at HostMatters.

-------- AUTHOR: David P. Janes DATE: 2:31 PM ----- BODY:
Bug

Sorry for the flakiness of searching -- it needs a major overhaul, and I'm stretch very thin these days. It works, but not always. Soon!

-------- AUTHOR: David P. Janes DATE: 2:25 PM ----- BODY:
Update

I made a major upgrade to the presentation of BlogTrack, BlogInfo and Search in the left pane, making extensive use of style sheets. I've testes this on IE and Phoenix (Mozilla), but if you see a problem with these or with some other browser, please send me a note!

-------- AUTHOR: David P. Janes DATE: 5:20 AM ----- BODY:
Endorsement

Finally, I've debated about whether or not to post a blogroll. Since I wouldn't use it myself, the value to me would be minimal; plus, I'd rather not be in a position to be accused of “playing favorites.” There are plenty of great blogs out there, and you don't need me pointing you in the right direction. (I have put some links to some services I've found useful, however; GeoURL and Janes' Blogosphere are both neat tools that I recommend heartily.)

Chris Lawrence

-------- AUTHOR: David P. Janes DATE: 3:34 AM ----- BODY:
Endorsement

I'm cleaning up in the multilingual category this week!

Moins de 12 heures après avoir enregistrer son site et modifier la template, le site de Melanie a maintenant un fil RSS qui fonctionne. C'est limité au sens que ça ne publie pas tout le message mais c'est assez pour générer un peu de traffic de la part des autres bloggers. Merci encore à David Jane, pour son outil sur Blogosphere.

Procrastinations

-------- AUTHOR: David P. Janes DATE: 3:02 AM ----- BODY:
FAQ: what is QSM?

QSM is:

Sub-questions:

-------- AUTHOR: David P. Janes DATE: 12:08 PM ----- BODY:
Metadata and Blog Directories

Here's another blog directory doing Metadata, called Blizg. We really need to coordinate this! Expect a lot more messages about QSM from me in the near future.

-------- AUTHOR: David P. Janes DATE: 10:19 AM ----- BODY:
Endorsement

Check out Janes' Blogosphere, by David Janes. He describes it to me as a "web based aggregator and search engine for blogs that works against RSS feeds and a large number of scraped blogs.

Pretty cool.

Here's the one for this very blog, and here's a search for its author.

Jim Flowers

-------- AUTHOR: David P. Janes DATE: 10:18 AM ----- BODY:
Endorsement

Janes' Blogosphere Newsfeeds for Blogspot, etc.? Thanks a lot, David. I've missed out on a lot of great writing due to last year's archive problems with BlogSpot and the lack of RSS/XML. Beta or not, a great service

Tenorman

-------- AUTHOR: David P. Janes DATE: 2:39 AM ----- BODY:
Endorsement

A excellent source for finding weblogs is Daypop, a news search engine that allows you to restrict your search to weblogs. Also try Janes' Blogosphere. (That link finds weblog commentary on "space shuttle.")

Use the Net for Alternative Coverage, Steve Outing, Poynter.org

-------- AUTHOR: David P. Janes DATE: 2:51 PM ----- BODY:
Thank you

I'd like to thank everyone who's sending me bug reports, "stupid" questions (they're not), and requests for features. I need your feedback! Keep it coming please.

-------- AUTHOR: David P. Janes DATE: 2:44 PM ----- BODY:
Bug fixes

-------- AUTHOR: David P. Janes DATE: 12:12 PM ----- BODY:
FAQ: what am I supposed to do with the QSM:BLOGROLL ... tags

This is the first in a series of of entries that will be explaining some of the technology behind Janes' Blogosphere. I may edit these from time to time, as my understanding (or ability to explain) improves over time. There is no particular order to how I'm adding theses to the news blog, except to note that it's generally user question driven.

The Template Rewriter adds two lines (among many others) to your Blog's Template, the first one generally directly after the "<body>" tag and the second one just before the place where your blog's entries go:

<!-- QSM:BLOGROLL -->
...
<!-- QSM:END BLOGROLL -->

The purpose of these tags is to indicate to scrapers (such as Janes' Blogosphere) where the blogroll is in your blog. What your blogroll is is entirely up to you, but generally

-------- AUTHOR: David P. Janes DATE: 11:59 AM ----- BODY:
Endorsement

Merci à Emmanuelle qui m'a fait connaître le site de Jane's Blogosphere. Les outils de David Janes m'ont enfin permis de créer un fil RSS, ce qui n'était pas facile auparavant avec Blogger. Je l'ai placé sous la rubrique "Veux-tu" dans la colonne de droite de ce blogue/carnet. Je n'utilise pas ces fils moi-même, mais si vous vous servez du mien, faites-moi signe pour me dire si j'ai bien fait ça!

ni vu ni connu

-------- AUTHOR: David P. Janes DATE: 5:44 AM ----- BODY:
Endorsement

Here's Kevin Steel's RSS feed. You'll note that there's nothing in it. Why, you ask? Because my RSS feeder only shows the last 48 hours of entries. I may adjust this to be 24 hours + guaranteed one entry, just to show that it's working.

I was talking about my weblog, on the way home from work, with Kevin Steel, and for some reason I happened to mention that I didn't expect I'd have to add too many bells and whistles to my hand-coded site. Approximate quote: "By the time XML/RSS becomes essential for weblogs--or indeed acquires any observable importance at all--I'm sure someone will have figured out a way to automatically translate my source code into RSS output. Figuring out how to add RSS to my site would be a source of great grief and annoyance to me, and of none at all to some clever coder. So I'll wait for that."

Approximate length of wait: five hours. Now that's Internet time! I just now discovered, by chance, that David Janes' Blogosphere is offering RSS feeds for the weblogs it "scrapes", including my site. If you want yours added, all you need to do is follow his instructions. You can view the Janesian RSS output for ColbyCosh.com even with a regular web browser. It's a bit clumsy: Janes' code hasn't learned to tell my entry titles apart from my body text. But that's a lot to ask, and I can't imagine who'd complain (plus it sounds like I could solve it myself with an HTML tweak--maybe I'll look into that on some day when I'm feeling customer-service-y). Some aggregators may only read RSS feeds, but I doubt any actual humans limit their weblog-viewing in this way. Thanks to David for the free benefit; I'll add the appropriate button to my sidebar shortly.

Colby Cosh

-------- AUTHOR: David P. Janes DATE: 3:50 AM ----- BODY:
Endorsement

My first bilingual endorsement. Thanks, Emmanuelle!

Merci, Dave. Pour ceux qui veulent installer un bouton feed RSS sur leur site, fastoche, grâce à David Janes' blogosphere. (Via the Volokh conspiracy) / Thanks, Dave. For those who want to put a RSS feed button on their site, in two seconds, thanks to David Janes' blogosphere. (Via the Volokh conspiracy)

Emmanuelle Richard, emmanuelle.net

-------- AUTHOR: David P. Janes DATE: 3:26 AM ----- BODY:
Endorsement

New Blogosphere Feature. From the hard-working operator of the Ranting and Roaring blog: "(David) Janes' Blogosphere is now providing RSS feeds for all ... "

Izzy Lyman, icky.blogspot.com

-------- AUTHOR: David P. Janes DATE: 10:27 AM ----- BODY:
New Feature: RSS feeds

Janes' Blogosphere is now providing RSS feeds for all the blogs it scrapes that don't have already provide an RSS feed (over 900). Why do you care? Because many folks out there, such as Radio Userland, AmphetaDesk, and other aggregators, only read blogs that provide RSS updates. Unfortunately, many blogging programs -- particularly Blogger and Greymatter -- don't provide RSS feeds. Blogger Pro provides the ability to do this but is incorrectly implemented -- i.e. it doesn't work.

To get your RSS feed, complete the following steps:

Your blog's not listed? Do the following:

We'll be adding a "http://www.weblogs.com" like page soon, which we'll announce here when it's ready (probably by the weekend).

-------- AUTHOR: David P. Janes DATE: 5:17 AM ----- BODY:
We're number 5

Janes' Blogosphere is now number 5 in Google listings. It's some pretty good competition ahead of me, so moving up's going to be a little tough. We'll do our best though!

-------- AUTHOR: David P. Janes DATE: 9:17 AM ----- BODY:
Update

The "all blogs" listing function (from BlogTrack, especially) now moves "The" and "A" to the end of the blog's title, so too many entries don't collection under "T" and "A".

-------- AUTHOR: David P. Janes DATE: 8:58 AM ----- BODY:
Update

If you select "Custom Blogroll" from within BlogTrack, the blogs will now be listed in the left frame. This is the first step in a reformation of the Blogroll functions. We are also going to add the ability to add your own, non-Blogosphere blogs into the list.

-------- AUTHOR: David P. Janes DATE: 1:42 PM ----- BODY:
Update

There's many small incremental changes happening, but they're almost all on the back-end. Go over to my home page and check out ThreadTrack, which you'll see listed on every entry.

-------- AUTHOR: David P. Janes DATE: 2:18 PM ----- BODY:
Update

(Hopefully) fixed a bug that stopped search requests from getting the most recent data.

-------- AUTHOR: David P. Janes DATE: 1:35 PM ----- BODY:
Update

Fixed a BlogInfo bug that caused frames to become nested.

-------- AUTHOR: David P. Janes DATE: 12:10 PM ----- BODY:
Back up from backups!

I had to restore the "entry" and "entrylink" database tables from yesterday morning. Sigh. It's a good think I back up every morning.

-------- AUTHOR: David P. Janes DATE: 5:56 AM ----- BODY:
Down today

The MYSQL database on Hostmatters has crashed in such a way to lose all of the data. Grrr. I won't be able to recover the data until later today.

-------- AUTHOR: David P. Janes DATE: 3:39 AM ----- BODY:
Endorsement

I thought I had the only one good use for frames!

One more related note: if you haven't already done so, check out David Janes' nifty new blog reading and searching tool, Janes' Blogosphere. I'm finding it quite useful even though it employs that annoying, vile, and evil HTML construct: frames.

Mark Wickens

-------- AUTHOR: David P. Janes DATE: 4:50 AM ----- BODY:
Testing

Cross-link #2.

-------- AUTHOR: David P. Janes DATE: 4:50 AM ----- BODY:
Testing

Cross-link #1.

-------- AUTHOR: David P. Janes DATE: 3:14 AM ----- BODY:
Update

There's been a significant set of changes made to the "back-end" of the system over the weekend, both to improve quality and speed. I've stopped "discovery" of new blogs, though those joining will still get added in the normal way (i.e. every morning around 5:00 AM, when I wake up). The reason for this is I want to take a pause while I examine the data I have on the system in-depth, for the amount because totally unreasonable. I'm adding a significant new function called "ThreadTrack" that will function like a comments system for multi-blog conversations. Hopefully this will be up and running by mid-week.

-------- AUTHOR: David P. Janes DATE: 3:59 PM ----- BODY:
Update

I'm doing a significant back-end update to the software. Things may be flaky until tomorrow.

-------- AUTHOR: David P. Janes DATE: 2:50 PM ----- BODY:
Update

I have the comment system working again. D'oh!

-------- AUTHOR: David P. Janes DATE: 10:51 AM ----- BODY:
Update

The left hand frame of BlogTrack now displays all the blogs that don't have current entries. This is part of a move to allow users to add arbitrary, non-tracked, blogs to their blogrolls.

-------- AUTHOR: David P. Janes DATE: 4:20 AM ----- BODY:
Bugs

-------- AUTHOR: David P. Janes DATE: 4:18 AM ----- BODY:
Update

BlogTrack displays old entries (i.e. before your last checkpoint) if there's an followups. I've modified the code that does this such that it will display a dash-lined between the new entries and the old entries.

-------- AUTHOR: David P. Janes DATE: 3:38 AM ----- BODY:
Endorsement

Thanks, Doc!

A sphere of ends
Check out Janes' Blogosphere, by David Janes. He describes it to me as a "web based aggregator and search engine for blogs that works against RSS feeds and a large number of scraped blogs. Pretty cool. Here's the one for this very blog, and here's a search for its author.

"Doc Searls"

-------- AUTHOR: David P. Janes DATE: 2:08 AM ----- BODY:
Welcome, Doc Searls readers

Cross-posted from my day blog.

Is anyone interested in RSS feeds from the blogs I am scraping? Please mail me if you are.

-------- AUTHOR: David P. Janes DATE: 9:41 AM ----- BODY:
Endorsement

Best Weblog Directory or Update Monitor
1. Janes' Blogosphere

"Patrick Ruffini"

-------- AUTHOR: David P. Janes DATE: 9:36 AM ----- BODY:
Endorsement

JUST WHEN YOU NEED TO KEEP TRACK OF STUFF This is a cool little tool. It lets you keep track of a bunch of different blogs at once so you can see when they're updated. It also displays recent posts with about the first sentence so you can jump right to them. Cool.

"Alex Knapp", Heretical Ideas

-------- AUTHOR: David P. Janes DATE: 4:53 AM ----- BODY:
Janes' Blogosphere, full-text searching, and a plea for help

This is cross-posted from my day-blog.

First, the good news: Janes' Blogosphere now allows "full-text" (i.e. google- or altavista-like) searching of blogs for content. For example, try the following likes (all will open in a new window)

OK, you get the idea. The database of text only goes back about a week right now, but it's being updated hourly with the latest posts and I'm going to start adding archives of blogs that have signed up with me.

Now, the plea: If you like this, please give me a mention in your blog, or better yet, add a Janes's Blogosphere button to your links. I've done a lot of work on this project and I think it's pretty damn useful. And I'd like more people to be using it!

If you haven't used Janes' Blogosphere before, please try out BlogTrack, which I think is the "core" function of the system: it tracks updates to the blogs you like to read and lets you read them all very quickly. I've found that it saves me an incredible amount of time per day in reading blogs.

And... if you really love me, nominate Janes' Blogosphere for a Bloggie under the category "best weblog directory or update monitor". There's only a couple of days left, so get going. I'd nominate you! In fact, I may have already!

-------- AUTHOR: David P. Janes DATE: 12:05 PM ----- BODY:
System up

We've been up since 7 this morning! I've been busy work on big changes which I hope to reveal later today or tomorrow at the latest.

-------- AUTHOR: David P. Janes DATE: 12:57 AM ----- BODY:
System down

There's something broken with the server's database. Hopefully, we'll be back online in a few hours.

-------- AUTHOR: David P. Janes DATE: 11:09 AM ----- BODY:
Update

There's a PayPal button on the front page now, just in case you have a few dollars you don't need anymore!

-------- AUTHOR: David P. Janes DATE: 11:05 AM ----- BODY:
Endorsement

Here's a handy resource for bloggers: Janes' Blogosphere lists and displays blogs as they are updated, letting you read many blogs quickly. Do a search on a URL here.

"Ad Orientem"

-------- AUTHOR: David P. Janes DATE: 1:14 PM ----- BODY:
Update

The "All Blogs" report has been improved. In particular, special character starting letters for blog titles are ignored and the report has been divided into multiple pages (since we're now tracking almost 2000 blogs!)

-------- AUTHOR: David P. Janes DATE: 8:37 AM ----- BODY:
Update

When BlogInfo ([bi]) is selected from within BlogTrack, the result now displays in the right frame rather than a new window.

-------- AUTHOR: David P. Janes DATE: 8:37 AM ----- BODY:
Update

The "Login/Logout/Settings" page has been changed significantly -- especially code-wise. The Settings (for adjusting things such as the frame split) has no been merged into this page. You can no longer add your blog when your create a login: you must go through the Add Blog pages. The Change Password section should now work!

-------- AUTHOR: David P. Janes DATE: 8:37 AM ----- BODY:
Bug fix

Thank you to Christi Turner for pointing out a nasty little crash in BlogTrack. This is now fixed.

-------- AUTHOR: David P. Janes DATE: 8:36 AM ----- BODY:
Happy New Year
-------- AUTHOR: David P. Janes DATE: 6:47 AM ----- BODY:
Update

I am now aggressively adding new blogs to the system, on the order of several hundred per day. These blogs are all RSS feeds (of varying sorts), so there's very little overhead on the system -- except for disk space.

-------- AUTHOR: David P. Janes DATE: 1:45 AM ----- BODY:
Update

There is now a User Settings page, from which you can adjust the frame division in BlogTrack. Beyond that, there's not too much yet.

-------- AUTHOR: David P. Janes DATE: 4:35 AM ----- BODY:
Update

I've reduced the "cycle" to checking once every 4 hours until the 26th, since we don't expect a lot of activity in the Blogosphere in the next few days.

Merry Christmas, all!

-------- AUTHOR: David P. Janes DATE: 5:59 AM ----- BODY:
Bug fix

The Ambler is now in the correct city, correctly spelled.

-------- AUTHOR: David P. Janes DATE: 3:16 AM ----- BODY:
Bug

Deleted misplaced postings here.

-------- AUTHOR: David P. Janes DATE: 3:09 AM ----- BODY:

Janes' Blogosphere now has a "Add your blog" interactive webpage (as opposed to having to mail me).

-------- AUTHOR: David P. Janes DATE: 5:14 AM ----- BODY:
Bug: sticky locations

Once a blog gets assigned to a particular geographical location -- "vancover", for example -- it can't be moved afterwards. This needs fixing, but there's only so many hours in a day and I'm trying to come up with a better registration system right now.

-------- AUTHOR: David P. Janes DATE: 12:57 PM ----- BODY:
Update

Not much happening today. All of CBC's regional sites are now being tracked, as well as the National Post's news page.

-------- AUTHOR: David P. Janes DATE: 12:14 PM ----- BODY:
Update: blog directory

BlogTrack now has an integrated blog directory, for selecting blogs for your Custom Blogroll.

-------- AUTHOR: David P. Janes DATE: 7:49 AM ----- BODY:
Commentary

'Blog Metadata

There are several projects to add metadata to Web logs to provide better access to them. However, everybody seems to be working in isolation. I began by adding Dublin Core, A-Core and PICS. That was OK. DC and PICS were standards, and A-Core was based on DC. Next came Blogchalk Simple and easy to apply. There was a tool to search 'blogs that had been chalked, but that seems to be no longer working.

This was followed by the Weblog MetaData Initiative (WMDI). This was standards based, an extension of Dublin Core. A tool to read and compile WMDI is available. It will read qualified DC, no need for the WMDI additions. Nice.

The latest is Janes' Blogosphere. It uses non-standard markup but has a suite of tools to create, read and interpret the metadata and 'blog entries.

All this work going in different directions. What is needed is a metadata standard for 'blogs, like WMDI that supports a suite of tools like Jane's to add the medadata to the template and interpret it. Talk to each other people and work together. A widely adopted standard metadata for Web logs could be used in RSS, OPML, OAI-MHP and other Web services. It could improve access in so many ways.

"Catalogablog", David Bigwood

A few quick comments on this:

-------- AUTHOR: David P. Janes DATE: 7:40 AM ----- BODY:
Bugs

-------- AUTHOR: David P. Janes DATE: 7:38 AM ----- BODY:
Update

The right frame of BlogTrack has now been cleaned up a little, and the geographical list of blogs is now "integrated". Next step: integrating selecting blogs by name, and maybe searching a little later.

-------- AUTHOR: David P. Janes DATE: 2:07 AM ----- BODY:
Update

I've just uploaded the first half of a substantial upgrade to the website. Navigation should be significantly easier now, as pages are laid out more hierarchically and the top of the page contains a "you are here" display. BlogTrack now include its own help page and it also makes sure the URL displayed in the browser is "sharable". The second half of the upgrade goes up tomorrow morning.

-------- AUTHOR: David P. Janes DATE: 6:04 AM ----- BODY:
Update: new look for news

This blog is now compatible with the overall look and feel. The links above may not exactly work today -- they're set up for the next version of the website, which I'll be loading tomorrow morning.

-------- AUTHOR: David P. Janes DATE: 2:22 AM ----- BODY:
Update

There's a lot more blogs being scraped now, after a marathon of code improvement and regex writing over the weekend. I've decided that the next small step will be an improvement of the UI.

-------- AUTHOR: David P. Janes DATE: 5:07 AM ----- BODY:
Update: more disk space

I just upgraded my Host Matters account and now have a pile more disk space, and $USD 40 less dollars (a year). Seems like a reasonably good deal to me. Now to add more blogs.

-------- AUTHOR: David P. Janes DATE: 4:49 AM ----- BODY:
Bugfix

I found a nasty little bug that meant that people who had created blogs through "Login" weren't actually getting scraped. Fixed, and now you are!

-------- AUTHOR: David P. Janes DATE: 7:37 AM ----- BODY:
Endorsement

Janes' Blogosphere This is an interesting idea, "a web-based microcontent aggregator for blogs (and soon newspapers) [that] displays all the current entries from the blogs listed in a reference blog's blogroll." What is interesting is that the concept is predicated on the fact that it takes too long to read blogs. Too long? By David Janes.

OLDaily

-------- AUTHOR: David P. Janes DATE: 7:22 AM ----- BODY:
Notice: I'm alive

I'm here in beautiful cold St. John's with wet feet and now-working Internet connection. I'm going to spend the next few days fixing up blogs which are downloaded, but not scraped, and adding a lot of RSS-only feeds to the system.

-------- AUTHOR: David P. Janes DATE: 5:20 AM ----- BODY:
Update: for Greymatter and Radio Userland users

The Template Rewriter knows how to correctly rewrite Greymatter and Radio Userland templates now. A few notes:

-------- AUTHOR: David P. Janes DATE: 2:08 AM ----- BODY:
Notice

I'll be flying off to sunny St. John's for my Winter Vacation later this morning, so I may not be answering e-mails until tomorrow morning, or until I figure out how my parent's Internet connection works.

-------- AUTHOR: David P. Janes DATE: 1:15 AM ----- BODY:
Feature: Custom Blogroll

Here's the super the new feature I've been hinting about for the last couple of days: Custom Blogrolls. Anyone with a Login can now create a "Custom Blogroll" on the website, to track the blogs that specifically interest them. This means you no longer have to use a "reference blog" to select which blogs you want to read using BlogTrack.

Use the "[+]" link from BlogTrack or "[Add]" or "[Add All]" from Search or any of the reports to add blogs to your Custom Blogroll.

I've also updated the colour scheme to make it a little more obvious what is a link and what is not, but it may be a little too Christmasy.

-------- AUTHOR: David P. Janes DATE: 12:33 AM ----- BODY:
Endorsement

The Blogtrack at Janes' Blogosphere

This is an excellent new resource, providing a single point where a number of entries on a number of weblogs can be reviewed and read at one time.

The LitiGator

I corrected the possessive form my name.

-------- AUTHOR: David P. Janes DATE: 1:14 PM ----- BODY:
Endorsement

Another Catalog of the Blogosphere Steven Cohen has come across yet another blogosphere aggregator - Janes' Blogosphere.

The "Janes" in the title is David Janes, a Canadian blogger (eh). Naturally, David is maintaining a blog to highlight changes, endorsements, and improvements, but here's a description straight from the site:

... Offhand, I don't see a count of how many blogs are being indexed at the moment. It's an interesting twist on an online aggregator (reloading external content in a frame), plus a mix of the various ecosystems and Technorati. My favorite feature so far, though, has to be the geographical breakdown of those blogs indexed in the database (although it looks to be a subset of those currently indexed). Could dovetail nicely with political initiatives, meet ups, and business/social networks in the big "B" Blogosphere, although the obvious barrier here is the onus on bloggers to sign up and add code to their templates.

Another great find, Steven!

The Shifted Librarian

For the record, I believe it's around 800 blogs right now be aggregated, the list mainly drawn from:

  1. my favorite blogs (roughly speaking, the warblogosphere and the cream of the nerdosphere)
  2. the Blogs4God collection -- my kind beta test group
  3. the English-language blogs from the top 500 in Blogstreet

There are a number of different strategies for putting data into the system, including a scraper than will work against 70% of blogspot sites out of the box, a regex-driven scraper, a QSM -- my markup -- scraper, and a RSS "scraper". Not ever blog in my list is parsable, but the software has an adaptive strategy to ensure it doesn't check unworking/unchanging blogs very often.

The problem with RSS blogs is that:

  1. Many RSS feeds to not provide the full entry content (though I see that most Radio blogs do)
  2. I cannot find out their "blogroll" information easily without going back to scraping the HTML feed
  3. I cannot get additional information about the blog, such as location information, from an RSS feed

I plan to solve these problems by:

  1. occasionaly (once a week) scraping the HTML blog, even if there is an XML feed
  2. working with NZ Bear and folks at the WMDI initiative to (hopefully!) define RSS 1.0, hopefully RSS 0.9x/2.0, and XHTML extensions for defining "meta" data.
  3. introducing the secret new feature tomorrow

Right now, the real driver of the system is bandwidth, diskspace, and money, and my faith in how well the software works and scales. Once things really get rolling, I hope to be doing at least a order of magnitude more blogs. That said, I'm passively waiting for people to sign up right now, especially those with legacy/non-RSS systems, just because I am still officially "in beta". If anyone is interested in being tracked, then they can contact me or create a login.

If you like the geographical breakdown stuff -- which really is just a subset of people who have signed up + blogs where I know the location of -- I'll be introducing even cooler features based on this in thew New Year. Jenny's guess -- Could dovetail nicely with political initiatives, meet ups, and business/social networks in the big "B" Blogosphere -- is exactly where I plan to go with this.

-------- AUTHOR: David P. Janes DATE: 4:46 AM ----- BODY:
Endorsement

Coming soon to this very space: an explanation of what "hidden" means (in the context of BlogTrack)

Canadian weblogger David Janes has introduced a new tool which shows you, at a glance, the latest content on the weblogs in a blogroll of your choice. That's not a very good capsule explanation. You'll have to go play with the thing itself; its purpose will be obvious. You can even start with my own blogroll.

I'm ashamed to admit that Janes showed me this long-secret project months ago and I didn't understand it. I went to look at the URL he sent me (by carrier pigeon, for maximum secrecy), and whether because the thing wasn't working or my brain wasn't, its exact function wasn't clear to me. It seems like a potentially useful thing. I don't especially like that so many of the weblogs on my roll are "hidden" in the multi-frame display: why should that be? And if they're truly "hidden" shouldn't there be a way to "reveal" them? There doesn't seem to be one. But it's still a rough beta and the ingenuity is very impressive. Especially nice is that the code seems to recognize separate entries on my crude site, which is hand-built and uses no MT or CSS or XML or anything else that sounds like a model of Ford sedan. (Unless Ford is working on a car called the Notepad.)

Colby Cosh

-------- AUTHOR: David P. Janes DATE: 3:02 AM ----- BODY:
System Up

Thanks for your patience: it's back up, thanks to the timely support of the people at Hostiing Matters. You should have probably been in bed when this all went down anyway!

-------- AUTHOR: David P. Janes DATE: 2:42 AM ----- BODY:
System Down

Hosting Matters is having problems with their MySQL database, which is required for Janes' Blogosphere to work. In the meantime, I have to add better error reporting! Hopefully, we'll be back up in a few hours.

-------- AUTHOR: David P. Janes DATE: 1:12 PM ----- BODY:
Endorsement

BLOGTRACK: Check out BlogTrack (from David Janes), a tool for quickly skimming the contents of many blogs (the link I gave shows you a display starting with our blog, but of course you can use other blogs as the root, too). Here's the summary, by Janes himself:

Looks like a great new option, and one that illustrates the power and flexibility of the Web.

Eugene Volokh

-------- AUTHOR: David P. Janes DATE: 4:53 AM ----- BODY:
Update

Check out the new geographical location aggregation service. Want to read all the blogs in Ontario? Here they are. Los Angeles blogs? Try this.

There's only a small subset of the total blogs we track in here so far, so please add your blog by visiting Template Rewriter and adding your geographical information!

-------- AUTHOR: David P. Janes DATE: 5:44 PM ----- BODY:
Update

Entries are now expired after 20 days, not 10. This should help the "reappearing entry" problem, where Janes' Blogosphere expires an entry, but it's still listed on the blog and thus gets resurrected occasionally.

-------- AUTHOR: David P. Janes DATE: 5:41 PM ----- BODY:
Update

The Template Rewriter now has a more descriptive help system. Known bugs:

-------- AUTHOR: David P. Janes DATE: 3:51 PM ----- BODY:
Endorsement

I was going to title this ALL BLOGS4GOD.COM BLOGGERS, like I occasionally do in emails to the moderators. I didn't because you all know who you are and you've already heard the buzz.

David Janes has designed an aggregator, and has given every blogger here at the portal an opportunity to use it. We started last week in a beta test. It will be useful to your moderators. It will be useful for you. Let's run it down.

... Back? That's who David Janes is and what his BlogTrack does. Mr. Janes and his aggregator are going to get a lot of attention.

Bene Diction at Blog4God

-------- AUTHOR: David P. Janes DATE: 3:40 PM ----- BODY:
Endorsement

RSS FEEDS TO NOWHERE A THING OF THE PAST? Janes' has apparently turned its gaze away from reporting on the latest battleship tonnage totals and is now chronicling the Blogosphere. Er, I'm not sure it's the same Janes', but this site has the most promising aggregator or meta-blog model I've seen yet. Here's the BlogTrack for me and everyone on my blogroll. It may not be as pretty as Radio's, but at least it's free. Although the number of blogs it tracks is fairly limited, it does do a fairly good job of catching up with posts that link back to me, a lot faster than Blogdex or Movable Type's seemingly useless TrackBack feature.

Patrick Ruffini

-------- AUTHOR: David P. Janes DATE: 3:36 PM ----- BODY:
Endorsement

Doing his part to bring order to chaos. Check out Janes' Blogosphere. Here's a sample Blog Track, for Natalie Solent.

Moira Breen

-------- AUTHOR: David P. Janes DATE: 3:31 PM ----- BODY:
Endorsement

Just lookit this. David Janes has created this amazing blog speed-reading tool. It's called Jane's Blogosphere (neat name, too.) See it at Ranting and Roaring. Here's a version tailor-made for my blog. I'm honoured. I had no idea David was working on this, and I don't know whether he's planning to give it away or sell it, but it looks simply fascinating.

ADDED LATER: He's planning to give it away. Nice chap. And the Janes-related names almost generate themselves; look for "Janes' Fighting Words".

Natalie Solent

-------- AUTHOR: David P. Janes DATE: 6:46 AM ----- BODY:
Hi Folks

This is where I'm collecting news, tips and endorsements for Janes Blogosphere.

--------