Adam--
To your Akubra question: We've been working on an Akubra architecture utilizing
multiple filesystems multiplexed together in a binary tree developed from a
custom URI hash calculation, which we're putting into production this month.
The relevance is that we're very interested in bringing Akubra's Quiescing
Storage design into play for HSM (our IT folks currently support a Quantum
StorNext instance which we use for workflow but without strong integration with
the repository architecture), and would be interested in discussing partnership
with another institution to do just that.
An alternative would be to employ a rules-based storage architecture like
iRODS. An iRODS-Fedora integration is available now, although I believe it is
under redesign for Akubra.
I'm thinking that in any naive implementation of your first two designs,
applications that request datastreams from Fedora will have to be aware that
responses from an HTTP request to a Fedora Web service may take as long to
appear as the HSM takes to retrieve material. If your HSM is like ours, that is
often several minutes and must involve a checksum operation for surety. This
implies some sophisticated behavior on the part of requesting applications that
must present some unusual workflow to an end user. ("Click here and wait ten
minutes for download to begin"?) Of course, your HSM may be better than ours
and be able to respond more immediately with more immediately useful
information. If it is and can, those designs make more sense. (And I envy you
your HSM!)
---
A. Soroka
Online Library Environment
the University of Virginia Library
On Mar 21, 2011, at 4:56 PM, Adam Wead wrote:
> Dear all,
>
> I'm currently mulling options as to how to implement a storage solution for
> our fedora repository. For backup and disaster recovery, I'm using Tivoli
> Storage Manager, with a SAN and LTO tape environment. I also have a Tivoli
> add-on called Space Management which acts as a hierarchical storage manager
> and moves files from disk to tape while leaving a portion of the file on the
> disk. We'll be holding a lot of uncompressed video files which won't change
> (ever) and won't be accessed very often. Possible strategies include:
>
> 1. Place Fedora's objects directory atop an HSM-managed filesystem, exported
> to the fedora server via NFS. Tweak HSM policies so that large files are
> moved out to tape, but Fedora can still "see" that they're there. Smaller
> derivative files remain on disk for faster access. Advantages: Fedora just
> gets a filesystem. Everything else is pushed back to the storage layer.
> Drawbacks/unknowns: complexity, performance via NFS and HSM, unknowns, others?
>
> 2. Keep the large files away from Fedora as externally-referenced objects
> sitting on an HSM filesystem; they're pulled from tape when needed.
> Derivative and datastream data stays in the repository. Advantages: simpler
> implementation. Drawbacks: more complicated ingest putting derivative files
> in one place and preservation files in another, maintaining link between the
> object reference and its actual file.
>
> 3. Akubra? The above two examples assume the default low-level storage
> method in versions prior to 3.4. I haven't used Akubra and don't know what
> it's able to do in this regard, or if it's a better option to use with #1 or
> #2 instead of the default objects directory.
>
> From what I've seen on the wiki, HSM directly with Fedora is on the wishlist,
> but not ready. Is anyone out there using Fedora with HSM in some way,
> doesn't have to be IBM specifically, and could offer suggestions, advice,
> caveats?
>
> thanks in advance,
>
> ...adam
> ____________________________________________
> Adam Wead
> Systems and Digital Collections Librarian
> Rock and Roll Hall of Fame and Museum
> 216.515.1960 (t)
> 215.515.1964 (f)
> ------------------------------------------------------------------------------
> Enable your software for Intel(R) Active Management Technology to meet the
> growing manageability and security demands of your customers. Businesses
> are taking advantage of Intel(R) vPro (TM) technology - will your software
> be a part of the solution? Download the Intel(R) Manageability Checker
> today!
> http://p.sf.net/sfu/intel-dev2devmar_______________________________________________
> Fedora-commons-users mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/fedora-commons-users
------------------------------------------------------------------------------
Enable your software for Intel(R) Active Management Technology to meet the
growing manageability and security demands of your customers. Businesses
are taking advantage of Intel(R) vPro (TM) technology - will your software
be a part of the solution? Download the Intel(R) Manageability Checker
today! http://p.sf.net/sfu/intel-dev2devmar
_______________________________________________
Fedora-commons-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/fedora-commons-users