Thursday, June 09, 2005

Still More Data to Tidy up

Well I finished finding existing Customers lost in our data, (there were about 6,800 I managed to match up - the rest appear to be redundant customers!?) and I started putting together the statistics when I hit a small hitch. Some duplication of property references in our data. I did anticipate that it may happen but hoped I'd find away to strip them out of the database automatically but because the duplicates aren't clones but different buildings at the same address, the key fields didn't correlate. So that means I have to remove the duplicates and since it is difficult to identify the 'real' customer from the dups they have to be viewed by a human. Luckily there are only 3901 of these issues and I've already written the tool I need to correct them and I'm already down to 1900 left to process. Then those stats should flow...

Debian 3.1 'Sarge' Has been released finally, that's good because when I upgraded the database server I installed a snapshot release of 'Sarge' from March 25th. Now I can get the finalised disk images and get security updates sometime in the next few days.

In other news, I got the latest episode of Hitch Hikers Quintessential Radio Series, I am thoroughly enjoying it, just not happy there's only 2 more episodes. There was a lot I had forgotten about the story, I thought Arthur would have been hunting 'Perfectly Normal Beasts' and making sandwhiches by now though. I did notice a few odd references to arthur making sandwhiches so I guess that is building up, Oh and Random has yet to come into the story.

I'm starting to think about my next OU course, Not sure what I'll do but I want to try and aim to be doing at least 90 points. so far the courses I'm on are only worth 60 points and I know I can do more work. I've found the courses so far to be extremely good. I'm enjoying it totally, and my marks seem to be reflecting that.

0 Comments:

Post a Comment

<< Home