Wednesday, October 22, 2008

birds are weird

Okay, this is going to be complicated.

Scanning: working my way through the set of Australian Hand Weaver and Spinner that we had originally agreed would be my "test set," but I am thinking more and more that they are probably going to be THE set, because all this other stuff is taking so long. That's good because it's a finite collection of not-too-unreasonable size, but it's disappointing because I was hoping to get a lot more done.

OCR: still under consideration. We were supposed to have a meeting with the guild officers and put all these things before them, so that no one can later fuss at Sue for not having been privy to all the decision making, but Sue's son got married this past weekend and she had to cancel our meeting. We have not yet rescheduled, so idk if we are going to meet this coming Sunday or the 2nd as originally scheduled. She's still thinking about OCR, though, in the sense of "how usable will the end product be?" I ran a journal through Abbyy Finereader, and the results looked pretty good, but I don't really know how to use Finereader, so if they want word docs or something at the end of all this, I'll need to figure out how to do that.

Storage/server: Ugh, this is killing me. What I need is a way to store these files I am creating, these PDFs that are averaging 17MB in size. I was trying to install Fedora to use that, but a) installing Fedora was really freaking hard and I was in over my head, and b) I realized that was the wrong way to go about it because having that installed on my machine won't do them any good. So I started looking for a way to make some sort of online storage system work, and I am at a loss.

The situation is: They don't have a machine of their own, or a server, so there's nowhere for me to install a system like Fedora or Greenstone. They have webspace because of their domain (wssaustin.org), but I can't, like, install a thing to the space because it's just space, not a server. I looked into google sites, which I thought would be a great storage idea, but they have a 10MB limit on the files you can upload there. I also investigated google base, but I am pretty sure that anything you upload to the google base becomes publicly available to everyone using it, and there's too much copyright uncertainty for that. So then I started looking at online file management systems, like, I could utilize the space they already have that goes with their domain, but getting into that realm is overwhelming and I feel like real quick you get into the territory of just buying a system.

Sue has said that if I present a good argument for it, they'd probably be willing to pay a monthly fee for space, but I can't come up with something that would be appropriate--and somebody already pays for wssaustin.org.


I am at a loss! I was thinking before that I would figure out the server/storage space problem down the line, but now it is apparent that while I can keep scanning till the world looks flat, I really need to be also coming up with a plan for where this stuff's going to go, before I run out of time.


I am frustrated because my goal was to digitize and provide an organized system of storage and use, but more and more it is looking like my project is just turning into digitization. I really want to be able to build a library, or storage system, or whatever, because this is the sort of thing I'd like to do, but I am frustrated by my lack of resources and I keep coming up blank. I need advice! HELP D:

Thursday, October 16, 2008

PLACEHOLDER

I am going to write real things about my project very, very soon, but not tonight.

Tuesday, September 30, 2008

oh my god i can't believe installing fedora is so hard

Monday, September 22, 2008

Update #Next

Had meeting with SP yesterday afternoon. She wanted to know if my final products were going to be images or text; she is concerned, and thinks the committee of officers will be concerned, about the end functionality of the digitization project. She poses the question, "How usable do we want this to be," referring to whether relying on well-tagged items will be enough, or if everything should be text searchable. It occurs to me as I write this that argument #1 for sticking to the original plan of scanned images is that if you want it to be a good resource, you have to have good metadata. Obviously I am no expert on weaving terminology, so that would have to come from them (subjects, keywords, etc).

So, my current tasks are:

1. Create a "talking points" memo covering pros and cons of images vs. text; available, effective OCR software; what constitutes a "file" and a "document," and what standard file size, I guess, is; what will be necessary for the WSSA e.g. server space, a dedicated guild computer, etc. I think there is something else I need to cover in this paper but my notes are at home.

2. Figure out what sort of software to use for the collection. Fedora is another option I have been looking into; its advantage over greenstone is that imaged scans, such as the pages of a journal, are clearly associatable with their parent images in a hierarchical layout. Also, I have been looking at the documentation for both and it will not be easy but I think I have a better chance of getting fedora up and running than greenstone. Possibly.

3. Start the scanning so that at least I can get that part going, so that I have something to work with as soon as everyone comes to a conclusion about point 1.


Re: OCR research, I am going to go into the Rebecca wiki because I remember Russell posting results of his investigations into finding online OCR resources; also Liza told me there is Adobe OCR software on the lab computers.

Friday, September 19, 2008

caaaaaaaapstone report 1

I, uh, guess I'm going to reuse this old artist-formerly-known-as-blogspot that I had lying around.

Initial objectives:
-acquire equipment necessary to do this project!
-research web standards re: digital images. LoC standards? is there a universal preferred size? etc?
-try to find existing free digital library software that is suitable

Completed:
-purchased new iMac!!
-brought home first batch of journals to scan as a pilot test
-looking into using greenstone. I am not sure if this is something I can do myself, or even if I have a place to put it, but I am keeping it in mind because I think it would be really cool.



Meeting with S.Plattsmier this Sunday afternoon. Hopefully will have some conclusions re: greenstone to relate to her.

Note: SP has mentioned wanting to tie the digital images I'm creating in to the existing LibraryThing catalog, but I think that is for a future Capstone project.

Friday, December 16, 2005

slam!

pizza shooters
taco pops
steak zaps
cheese fluffs