The Minutaie of Life

No Comments »

A good friend of mine has arranged access to the digitised records of the New York Emigrant Savings Bank for 1850-1883. nyebrecord.jpgWhat a wondrous treasure trove of information! These records contain the deposit details for thousands of newly-arrived immigrants to New York from 1850. The bank was established by the Irish Emigrants Society and served a largely Irish population. Amazingly, the Emigrant Savings Bank is still around, holding about $15 billion in assets.
These older records are an immediate resource for genealogists. In addition to transaction details, the records include a ‘test book’ which contains information on place of residence, spouse and children, occupation, and additional other nuggets of information1. This information was compiled when a depositor wished to send money back home to Ireland. I am particularly fascinated by the ledgers which record deposits and withdrawals for a large groups of people over a substantial period of time. There is a huge further digitisation project here to continue to enter data from these records into formats allowing for further study.
Read the complete article… »

  1. Check out the finding aid from the NYPL referenced above for more details []
Tags: Genealogy, History

Open Source Genealogy

1 Comment »

I have been searching for ways to improve my genealogical research. I set two specific criteria for my search:

  • A cross-platform browser/editor that uses GEDCOM files natively;
  • A means to share genealogical data in a free and open manner

phpged.jpgTwo open source products have emerged that work together to meet my needs: PHPGEDViewer (PGV) and Genesis (an open source PGV research tool) part of the Distributed Family Tree Project.
Read the complete article… »

Tags: Genealogy, HCI, Info Architecture

The Secret Life Underground

No Comments »

Russos at LiveJournal posted an absolutely exquisite set of photographs (many HDR) of the deep underground in Moscow. Many relate to subway construction, repair and abandonment. Others seem to have deep subterranean natural caverns. Absolutely amazing views of things we never see. Thanks for EnglishRussia for catching these and doing some translation so English readers can appreciate what it is we are seeing. By the way, unless you read Russian (I will admit to not) use the English Russia link as it gives the full set as well. I am sure that they are available on the Russos site, but I cannot navigate the Cyrillic. There’s another set of photos at Russos which I don’t have translation for and sense it might even be an abandoned station. Interesting contrast to the abandoned TTC one that is expected to draw crowds.

subways.jpg


Read the complete article… »

Tags: Architecture, Genealogy, Russia

Collaborative Record Matching

1 Comment »

I have been of late explores various means for the automated longitudinal matching of census manuscript records. Its a huge challenge and I seem to have spent as much time identifying potential problems as opposed to identifying potential solutions. This is not say I haven’t pondered a couple solutions, but the list of challenges remains much longer and seems to be growing much faster - but, all this means is a more challenging research problem, demanding some innovation in methodology. Fun!

googleimage.gifBut there is a paradigm shift happening. One that I have been participating in, and certainly embrace, but am seldom always cognizant of. The idea of online collaboration continues to permeate more and more of our everyday tasks. Emerging from specialized research objectives such as the SETI@Home initiative, which sought to use excess personal computing capacity distributed around the world, to other efforts today that take advntage not only of excess processor cycles to the idea of carrying out manual tasks through engagement of the masses in specific tasks.

I started playing with the Google Image identification programme a few months back. If you haven’t tried it, it basically involves matching you with a random online user and you spend 90 seconds typing in words to describe a picture displayed to both users. You quickly type words that come to mind until both users type in the same word, at which point the engine accepts that that word is likely to be a relevant descriptor. The key to participation is that the exercise if fun, fast and you can hop on at anytime and given the global scope, you will quickly be paired with an online user. Moreover, you have the small satisfaction of being part of a bigger exercise of improving the descriptors attached to Google’s image search repository. This little ‘game’ also clearly illustrates one of the downsides of Google’s repository, as these descriptors are determined through a process which renders them simple rather than more specialized. as I ‘play’ I realize that I may recognize the image as a particular movie poster, but also think that my online partner may not catch the subtleties, so I may resort to simply choosing a predominant colour as a suggested word, rather than the name of the movie or say an actor in the movie. As a result I choose the more obvious descriptor word to encourage faster match. The objective in the Google match is to match words for the highest number of images during the 90 second period, which may not achieve the best descriptions. However, the process does deliver some basic descriptions terms that an automated process would miss. The key is making it fun for the participants.

Down this same vein, Kris Inwood pointed me at a census initiative, Automated Genealogy. Working down this same premise of trying to funify a process requiring mass user intervention, at Automated Genealogy, the site is a meeting point for genealogists to signup for and manually enter into a database manuscript census records. The hope here is to engage that vast army of genealogists out there to contribute time to help their fellow genealogists and have access to records which benefit their own research efforts. Collaboration at its best. Additionally they have begun a similar process to match Canadian manuscript census records between the 1901 and 1911 censuses. This is the same task that I have been ruminating over developing an automated process for. At AG they are using automated means to do simple matching and then allowing users to refine the match where human discretion is required. This is a clever approach to a real world research problem. As to progress, the published results indicate that they have transcribed 93.15% of the entire Canadian census for 1911 and 99.99% of the 1901 census with 55.15% of the proofing carried out on this one.

This is a great example of this emerging trend to mobilize individual efforts en masse to assist with processes that in the past would have been carried out by a small group of specialized researchers. Both processes recognize that tasks can be divided and appropriate and different resources applied to varying stages. Mass collaboration on simple tasks made fun!

Tags: Business Idea, Census, Genealogy, Info Architecture, Technology

1891 Census Project Passes Milestone

No Comments »

census.gifOn Tuesday, I had the pleasure of meeting with Kris Inwood, Director of the 1891 Census Project at the University of Guelph along with his staff at a review of this exciting project.

Census project staff have been entering data since 2002 and as of last Friday have completed the data entry phase. They have compiled a database comprising 328,000 records which represents a 5% sample of the entire population of Canada in 1891. They have oversampled in certain urban areas as well as in the west of Canada to 10%. There is also a 100% capture of group quarters (households with more than 30 residents indicated in the manuscript census records). The next step in the project is to begin coding columns such as religion and occupation to allow for systematic use by researchers.

Over the life of the project participants have also been conducting research on their own interests using census data. A number have completed very interesting papers examining topics such as the character and nature of the enumerators, the foibles of the enumeration process, methodology involved in locating aboriginal persons in the census and a survey of contemporary newspaper coverage of the census itself.

Additionally impressive, many of the participants have contributed to a series mini-biographies of individuals and families in the census which will hopefully be shared via the census website. These papers illuminate the human side of manuscript census records and they also provide very useful case studies demonstrating how census manuscript data can be used in a variety of research contexts.

Kris suggests that they are very close to being able to provide researchers with the opportunity to begin to use this data outside the project and avenues are now being explored to provide systematic dissemination of the dataset.

Tags: Census, Genealogy, Info Architecture

Making Connections

1 Comment »

tree.gifGenealogy remains one of the more popular pastimes in modern culture. Embracing Web2.0 Ajax comes Geni.com, which is quite viral. It offers a very easy guided data entry process geared towards encouraging contact with relatives to have them fill in their own information and gradually flesh out a very comprehensive tree. Its extremely fun to play with which is enhanced by the immediate feedback that you get seeing the tree evolve. Its quite intuitive to use. I may actually share it with a couple relatives and see how well the collaborative effort works.
Read the complete article… »

Tags: Culture, Genealogy, Technology
Original WP Theme & Icons by N.Design Studio Modified by Shawn Day
Hello   Admin Entries RSS Comments RSS Login