Although it may seem like a simple idea to store data, the reality that most of us are very aware of is that the opposite is the case. Having worked at MATRIX for the last 3+ years, including the redesign and rewrite of KORA the digital repository software at MATRIX, the pains of storing and safeguarding data are very well known to me. I will be talking about issues surrounding building a system like KORA and how computer scientists view the data very differently from the humanists (usually). There are many other issues related to dissemination of data that we store and ease of use of these systems. I also plan to possibly talk about the following topics (in no particular order):
- Open source software (specifically running a project like KORA)
- Dealing with the never ending questions like ‘can you add twitfacefliksquarespace to your project?’ and the implications of integrating into open (and closed) data sources
- Undergraduate programmers – they rock!
- How do we determine the ‘best’ way to store metadata in schemes beyond well defined ones (such as DublinCore)
- Data access, data restrictions, data vetting, pretty much anything about data
- The double-edged sword of your (or your IT/CS person’s) favorite programming language
- Making use of your data beyond it’s original purpose
Right now I am still in the process of getting a final topic list put together – so please feel free to suggest anything in the comments either adding additional topics or suggesting I remove something. I am currently engaged in research in both audio (my Masters Thesis) and images (the Quilt Index / Digging into Data) regarding similarity measures and would be happy to discuss anything related to that as well.