• Campers

    Matt Geimer

    CTO of MATRIX and Ph.D. Student at Michigan State University, actively involved in engaging undergraduates in computer science with digital humanities.

    My Posts

    Data, KORA, and Ponies Session Notes

    Saturday, March 20th, 2010 | mgeimer

    Contact me (matt dot geimer at matrix dot msu dot edu or @herrgeimer on twitter ) for any questions about KORA!

    Things we talked about:

    • Data – how it affects your overall project/scheme layout
    • Stuff about KORA
      Open Source digital repository / digital archive  / content management system
      Very flexible – allows you to do almost anything w/ digital content (regarding storage/representation at least)

      Based on the idea of open access to content made easy to get to
    • Things I didn’t get to talk about but would love to facilitate
      Open source software (specifically running a project like KORA)
      can you add twitfacefliksquarespace to your project?’ and the implications of integrating into open (and closed) data sources
      Undergraduate programmers – they rock!
      How do we determine the ‘best’ way to store metadata in schemes beyond well defined ones (such as DublinCore)
      Data access, data restrictions, data vetting, pretty much anything about data
      The double-edged sword of  your (or your IT/CS person’s) favorite programming language
      Making use of your data beyond it’s original purpose
    • Ponies – there were indeed pictures of ponies 🙂

    Overall it was a more ‘traditional’ talk – mostly Matt talking about KORA, but software demos are like that.  Questions were answered about search performance, issues addressed included using other indexing applications to provide additional search possibilities.  There should be a ‘howto run KORA on a ubuntu server start to finish’ up on the KORA site soon.

    Dealing with Data, Digital Repositories, and KORA

    Thursday, February 18th, 2010 | mgeimer

    Although it may seem like a simple idea to store data, the reality that most of us are very aware of is that the opposite is the case.  Having worked at MATRIX for the last 3+ years, including the redesign and rewrite of KORA the digital repository software at MATRIX, the pains of storing and safeguarding data  are very well known to me.  I will be talking about issues surrounding building a system like KORA and how computer scientists view the data very differently from the humanists (usually).  There are many other issues related to dissemination of data that we store and ease of use of these systems.   I also plan to possibly talk about the following topics (in no particular order):

    • Open source software (specifically running a project like KORA)
    • Dealing with the never ending questions like ‘can you add twitfacefliksquarespace to your project?’ and the implications of integrating into open (and closed) data sources
    • Undergraduate programmers – they rock!
    • How do we determine the ‘best’ way to store metadata in schemes beyond well defined ones (such as DublinCore)
    • Data access, data restrictions, data vetting, pretty much anything about data
    • The double-edged sword of  your (or your IT/CS person’s) favorite programming language
    • Making use of your data beyond it’s original purpose
    • Ponies

    Right now I am still in the process of getting a final topic list put together – so please feel free to suggest anything in the comments either adding additional topics or suggesting I remove something.   I am currently engaged in research in both audio (my Masters Thesis) and images (the Quilt Index / Digging into Data) regarding similarity measures and would be happy to discuss anything related to that as well.