Thursday, July 03, 2008

Simple persistant databases

One of my concerns with this project was the requirement of storage information from previous searches (and the current search). A RDBMS (relational database management system) such as MySQL and PostGreSQL seem heavy especially using JDBC (Java database connection) and potentially slow. A RDBMS might be required if there a large number of users all accessing a web application/service, however to experiment with various algorithms for adaptivity then something lighter seems better.

One example I have found that fits the bill is JDBM (Java Database Manager) which is "a transactional persistence engine for Java" and can store objects (as well as BLOBs). In an object orientated environment this makes more sense than the conversion of objects to entities (rows or records) and the use of procedural code to do he transaction.

Monday, June 30, 2008

Personalisation vs. Serendipity

One of the advantages of the current Web search engines is the possibility of finding stuff that just wasn't on our radar. Of course there is also the disadvantage of wading through vast quantities of unwanted dross. The current method of ranking the results of search queries, based upon the population of user's click-throughs, is pretty good at returning the most popular results and suffices most of the time, if you can get the query term right. However, by personalising (or adapting) to the individual user's needs and wishes it would be possible to prioritise the results of the search or perhaps reduce the volume of information. However does the adaptation reduce the opportunity of finding something unexpected?

If something like the "I'm Feeling Lucky" button on returning a serendipitous list of results, rather than the one resultant Web page currently by Google, could allow the user to find a wider and weirder match to the search query. Is the chance find more important than getting to what you really really want fast and first time round? The opportunity of reducing the time take to find is more important as the user is in focused "search mode" rather than a relaxed "surf mode" where the user is more likely to follow the serendipitous links.

Although serendipity is fun, expands our knowledge and wastes our time, targeted searches can reduce the frustration of not finding the information we seek giving us time to follow those happy accidents.

Sunday, June 29, 2008

A Reason for Me to Blog

This blog has been inactive some time. I have considered why I don't blog (& why others do), and that might be the topic of a future post, but in the mean time this blog has been reactivated as a public interface to my MSc (in Computer Science) summer project. The title is

SearchMe: Towards Adaptive Searching of The World Wide Web for the Individual.

This topic is not far from the original concepts of the Personal Hyperspace so it seems appropriate to use this blog.

I would be immensely grateful for any thoughts and comments you may have on any of the posts, many of which will be questions for which I currently do not have answers.