OSArchive

From alpha to beta

Posted on 17.1.2014 by Mikko Lampi

With the year 2014 the winter has finally came to Finland. In addition to that, we have development running at full speed. Our target is to hit the beta release at the beginning of March. The roadmap is a combination of user feedback from earlier agile sprints and specifications made during Capture project. The main focus with the beta sprint is on the user interface, search and indexing, ingest and management of the data. All the core features are in place already since alpha release, but it required some knowledge and good understanding on how things are in the early stages of software development.

I will post some previews of the user interface later. The interface will be in English and Finnish. It can be localized and customized easily to support your desired language, regional preferences and organization policies. Effort is also put into making the interface user friendly. Since the early pre-alpha versions, we have iterated the user interface development with the designated users. Stay tuned for the interface mockups or even screenshots.

Searching and indexing will be implemented mainly with Solr. It is the de facto open source engine for this kind of content. Also Finna (nice user interface for accessing Finnish archives, libraries and museums) uses Solr. Traditionally Finnish has been somewhat tricky language but now we can have a good support and we can benefit from open source work done already. We have a working integration in between our data repository (Fedora Commons) and Solr but we need to work on indexing metadata and rich text contents and building end-user features. We will build a new kind of search and browse interface based on faceting and visualizing the data instead of just an empty search box waiting for you to know what you need to input. Of course the Google-like search box will be there also if you like it better.

The third large sprint is to ready the tools for ingest and management of the archive data. We will build a workspace that will contain pre-ingest and discovery tools, manual ingest, workflows and batch ingest as well as management of the existing data. Again, we will make it highly configurable and suitable for ingesting data of any kind; Not just documents or strictly formatted metadata. For beta and pilot phases, we will introduce only reference features but they can be extended and will be well documented.

Lastly, the simple workflow engine has been now completed and will be published as a standalone project on Github later this year. The developer, Heikki Kurhinen, wrote it for the project as his thesis work. The engine will be featured in later article as soon as we get it integrated with our main software.

I will write more on the topics introduced as our team progresses.

Mikko

Posted in Data management, Software development | Tagged agile software development, beta release, digital repository, Open Source, software, software development, Solr, user friendly, user interface, Workflow, workflow engine | Leave a comment

Alpha release

Posted on 1.11.2013 by Mikko Lampi

Last week we reached a major milestone and released the first alpha version of our software. We defined the alpha as functional software which wraps up developments made during sprints last summer. The text focus is iterative improvement, improving code quality and public beta release by the end of February.

Last time I wrote about agile development and experiences from development sprints in summer. While the results were good and mostly positive, there is always room for improvement. I think better code management and profound unit testing need to be involved. Now it took a couple of weeks integrating, testing and bug fixing to get there. Sure there were also iterations based on early feedback. But this is something not to overlook.

The alpha release if primarily targeted for project partners who participate in early development. We don’t want to scare the public with our hideous development interfaces. The use of the current user interface requires a good understanding about the underlying processes and functionality. Beta release will introduce new user interface and better quality code and documentation. If you’re interested, this is the time to get involved. We will also publish a live demo.

Alpha features include:

Metadata model (known as Capture model)
Fedora Commons content models (document models, place, event, action, agent)
Metadata management (forms, validation, indexing, linked data)
Web based ingest forms
Archive file management (archive copies, attachments, previews for images and PDFs)
Search and browse user interfaces
Full-text indexing
User and access management
Ingest and data filtering workspace prototype
Feedback module with screen capture
Developer friendly user interface

And there is much more going on under the hood. Comment or send email if you’re interested in those.

Mikko

Posted in Software development | Tagged agile software development, alpha release, development, digital repository | Leave a comment

Modeling batch ingest process and developing workflow engine

Posted on 17.10.2013 by Heikki Kurhinen

I worked as a programmer for the OSA-project during the summer, however because of our fast developing pace, I never found time to write to this blog. Now, when I’m back at school for the last year of my studies I decided to do my bachelor’s thesis for the OSA-project.

The topic is going to be developing (hopefully) simple workflow engine which is going to handle the execution of micro-services. The communication with the workflow engine is going to be developed using REST that is because then user can develop their own UI or easily integrate this workflow engine into their projects. Also there needs to be possibility for timed runs and different triggers, for these I’m going to use Quartz scheduling library. Also possibility for using the workflow engine as a distributed application is something that needs to be researched.

I’m also going to create a batch ingest workflow and required micro-services as an example on how to create workflows with my engine. That project started by modeling the batch ingest workflow. Because I don’t have much archiving experience I started out by looking at other archiving systems like Archivematica and Islandora. After inspecting these systems and doing my own research I could start doing some modeling.

To do the actual model I used software called yEd which was free and easy to use. I didn’t want to use too complex modeling methods because I think it is important to be able to create models which can be understood also by people without deep technical knowledge. For example someone who has worked with archiving for long time but doesn’t understand that much about programming or process modeling can still see what is going on in the workflow and may be able to provide important knowledge about archiving.

Here is the current model:

Of course later this process will be fully configurable but these are going to be settings for now.

Posted in Software development | Tagged Open Source, OSA, Representational state transfer, Workflow | Leave a comment

Agile and user focused development

Posted on 16.9.2013 by Mikko Lampi

Agile development is not only hype. Our experiences on the approach this summer were very positive. We went from technology proof-of-concept to the alpha version very agile and fast, only in four months. Now, we are almost there. Just a small round of bug fixing and polishing. We are already online with private proto and hope to make the beta version a public release by the end of the year.

For many businesses and organizations, user focused development and user focused products are an important strategy. Likewise Mikkeli University of Applied Sciences has made it a key part in its strategy in digital archiving and services. The other two are the productization and development of digital information management and archiving.

I had a presentation about agile and user focused development a few weeks back at the Liikearkistopäivät 2013 (it’s a three-day event organized by Finnish Business Archive Association, FBAA). Below are presented the key points and some experiences from our sprint.

For those who understand Finnish language, here is a small comic relief for starters.

There are two drivers which will usually lead to weak results and defocus the user: management (bureaucracy) driven development and IT driven development. Though both are excellent tools. It is also known (for 40 years) that the waterfall model is not suitable for software projects. The development should always be driven by the need and the people who would use it. By making the development process open and transparent, you can engage people and eliminate the resistance to changes. You could even make the solution open source. But opening software or decision making is not a shortcut to success. There needs to be a clear understanding of the goals and means to achieve them.

It should not be surprising how much attitude problems can escalate. The common problems include:
“What an end user could possibly know about designing and implementing an IT system ” “We cannot show the unfinished software to the customers”
“User research is too expensive and humbug”
“Our business is a special case. We cannot use any standard solution”
“The software should adapt our legacy processes”
The list could go on and on. Both sides, the customer and the provider, have attitude issues which need to fixed.

Responsibilities should be made clear to all involved sides. It is much easier to know who is responsible for what if the whole project is managed in small pieces. Dividing colossal projects would be a good start for at least the public sector and the government. The idea is not that hard and even some Finnish ministries have added user friendliness to their strategies and recommendations.

There are tens or hundreds of various methods for user research in any point of the process. The complete life cycle of products, projects and processes can be managed. Website like All About UX contain very useful information. http://www.allaboutux.org/

Agile development is not a single methodology but more like a term that covers a wide range of totally different approaches. They should all share the idea behind the Agile Manifesto. http://agilemanifesto.org/

In our summer sprint, we made use of feedback driven development and prototyping features in a rapid pace. The test users were the actual end users of the future system. We could make changes to the implementation on a weekly basis and could add or remove features without cumbersome change processes. It is the only way to build software for businesses without being an actual professional in that said business; And we software developers are rarely.

Mikko Lampi

Posted in Software development | Tagged agile software development, software, user focused development, user friendly, waterfall model | Leave a comment

Summary of 2013 first half

Posted on 14.6.2013 by Mikko Lampi

This year has been full of development and fast progress. Unfortunately, it has meant less time to write on the blog. In this post, I’ll brief you on the current status and latest news.

Open source in content management seminar
Last week we had a very interesting seminar about open source in general and at memory organization point of view. As a keynote speaker we had Michael ‘Monty’ Widenius talking about open source and business. MySQL is an excellent example of how to make open source worth your while. The key points in his speech were community, patience and understanding the ecosystem and users. As Monty said, open source supports well human nature. In a nutshell, you need to work and pay less for getting more value. But there is also difference of being free as a beer and being free as a speech. Open source is the sustainable way of operating in our business (among others); And what Monty said was in line with our views.

Other speakers included a representative from the Ministry of Finance, Mikael Vakkari, who told us how European Union and public goverment regulates and encourages the public sector and administration to use open source. The course is right but it takes time to get there. We heard also about experiences on building open source communities from our local partner Otavan Opisto. It confirmed what Monty had said: it takes time to get people involved and usually it can’t be taken for granted. A common method is to involve a community manager to inspire and gather people together, as well as coordinate the operations.

The director of Brages Pressarkiv, Jessica Parland-von Essen, emphasized that open source is not only about technology and software but about trust and knowledge. Anything that is open and public can be inspected and reviewed. People tend to distrust the unknown and secrets. Open source on the other hand helps to make things like licensing understandable. Anyone could (at least in theory) see how open system works. This does not mean that the data should be open or the system should be accessible to the public. A good example is Linux operating system. While the system itself is open, your personal data is yours only.

Finally, the seminar concluded with a discussion panel on the future of memory organizations and the democratization of the Internet; and whether the archives should or should not adapt. It was an interesting topic and it was quite clear that some changes are happening in the near future.

Theses
A few theses have been made or being made about the topics OSA covers. They were introduced in the previously mentioned seminar. There are two theses which studied the usability, service designs and user interfaces of the light archive system. Tytti Vuorikari studied the service concepts, service blueprints and various user profiles and Outi Hilola’s work concentrated on the data visualization and the user interfaces for finding and processing information from the archives.

This kind of research and field study is a necessity when using an agile approach. The said theses helped us to understand the needs and focus the development work on areas requiring the most attention. Methods included interviewing, tailing, benchmarking and organizing workshops. The results were then analyzed and restructured and described. Benchmarking included some related projects like National Digital Library’s Finna. Available also in English at https://www.finna.fi/.

There are some technology oriented theses under work. I will write more about them in the near future. The topics include building a complete virtualized private cloud to provide either platform as a service or infrastructure as a service.

OSA roadmap
Below is presented the estimated roadmap of the project milestones.

We are currently about to release the first cycle of development builds. There will be a limited release for our project partners. Beginning from alpha or beta release we could provide access to others as well. As this is open source project, the code and other products will be available on request and eventually in Github or the like.

Currently, we have all the essential software in place. These include Fedora Commons, Solr with Fedora integration, MariaDB, LDAP and a web application to wrap them together and to provide easy access. We also have a rough user interface and initial services for ingesting and accessing objects. During the summer, we hope to advance very rapidly. The initial plan is to release a new development build for testing each week and work incrementally until we reach the alpha version at the end of August 2013 and the beta at the end of the year. At that time, we should be pilot ready.

Mikko Lampi

Posted in Archive system, Fedora Commons, Project Information, Software development, Uncategorized | Tagged Fedora Commons, memory organization, Michael Monty Widenius, Open Source, open source communities, Roadmap, Seminar, software, Thesis | Leave a comment

Seminar: Open Source in Content Management

Posted on 30.4.2013 by Osmo Palonen

Our project will organise an open seminar on 7th June 2013 in the Mikkeli University of Applied Sciences Main Campus in Mikkeli. The seminar Open Source in Content Management will open at 11.15 and the venue is Mikpoli hall. For our international readers I have to confess that the seminar will be in Finnish. We consider to record the seminar and stream the speeches with the help of digital archive services of MUAS. In that case we will inform you later.
The programme consists the keynote speech by Michael Monty Widenius, the founder of MySQL and MariaDB. Mikael Vakkari of the Ministry of Finance will explain the Open Source policy and practice in the Finnish public sector and Antti Leppä of Otavan Opisto will tell the long history and experiences using Open Source in the distance and blended learning, for example.
The second theme is called Open Source and Archives. Dr. Jessica Parland-von Essen, Chief of Archives at Brages Pressarkiv, will tell first about the expectations the archives have for Open Source solutions. The second speaker Osmo Palonen, Project Manager of MUAS, will tell about the ideas behind the Project OSA. The rest of this theme will be connected in the activities done and planned in the project: Mikko Lampi, Lead Engineer of the project will report the studies that are made in the project and in general reported in this blog and also draw an overview of the architecture and Open Source solutions to be build in this project. The students and their teachers will have also an opportunity to shortly present the bachelor’s thesis’ that are in connection with the Project OSA.
At the end of the seminar will be discussion about openness of archives and expertise. In that we like to challenge the memory organisations to find out how the world is changing when the status of archivists and museum specialists as gatekeeper of information is fading away.
The programme of the seminar in Finnish can be found on MUAS website http://www.mamk.fi/ajankohtaista/tapahtumat/101/0/avoin_lahdekoohdi_aineistonhallinnassa_-seminaari_7_6.

Posted in Archive system, Data management, Digital archiving, Linux, Project Information, Software development | Tagged communications, Database, Open Source, public | Leave a comment

Selecting an open source framework for web application

Posted on 26.3.2013 by Liisa Uosukainen

We have been choosing the framework for OSA web application. We had a few requirements: we want to use Java, it must be an open source application development framework, and framework should be scalable and easy to configure (no complex XML configuration).

There are several easy to use Java web frameworks. We took into consideration Stripes, Vaadin and Liferay. The idea of studying Vaadin and Liferay was to check whether we could get the benefits of building UI quickly. Struts and Spring-MVC were not considered, since those require a lot of configuration.

Liferay is an open source portal for building websites and web applications. It is described as a content management framework. It integrates well with many Java systems. Liferay provides portlets and gadgets to develop applications. For me, Liferay was difficult to get started with. Liferay offers two editions: free Community Edition and Enterprise Edition. EE is reported being more optimized (performs better) and having the ability to report any bugs and getting it fixed. There are also extra EE-only features available.

Vaadin is an open source Java framework for building Rich Internet Applications. When working with Vaadin framework, it is possible to use Java as the only programming language without having to write HTML and CSS. Vaadin provides a library of ready-to-use user interface components. It has nice themes and a set of components precompiled in GWT. It provides a clean framework for creating your own components as well. If we choose Vaadin, we would get nice looking UI easily. I found that there is lack of Vaadin documentation. There is a good ‘Book of Vaadin’ but that’s it.

Stripes is an open source web development framework with just a few dependencies and it is based on action based MVC pattern. I found Stripes easy to learn. A configuration is needed only in a web.xml. There is also a lot of documentation available.

We ended up starting web development with Stripes. This decision was based on the fact that we wanted to select a native web framework. We have a lot of experience of Java Stripes development in this project. By selecting Stripes, we will have a control over every functionality. We can also re-use some old code parts done in earlier projects.

Posted in Framework, Software development | Tagged framework, Liferay, Open Source, Stripes, Vaadin, web development | 3 Comments

Building a dark archive

Posted on 22.3.2013 by Mikko Lampi

One of the primary goals in our project is to build a dark archive. The motivation for this is the preservation of digital material. A dark archive is a repository which sole purpose is to preserve the data, not to provide services or general access. It is like a black box where you put stuff and know that it will be safe and sound. Of course, limited access is provided for the archive personnel but we’re not going to add any public interfaces or internet access.

We have chosen to build our dark archive with DAITSS digital preservation repository software (http://daitss.fcla.edu/). It is a production ready open source system built by the Florida Center for Library Automation (FCLA). The reasons for picking DAITSS were its technical architecture, production readiness and other design principles. As said in the DAITSS website:

Automated support for submission, ingest, archival storage, withdrawal and repository management
REST and micro-services based architecture
Enforced control for data integrity and authenticity
Preservation strategy and implementation based on format identification and characterization
Good support for text, document, image, audio and video

You can read more about DAITSS and its features here: http://daitss.fcla.edu/content/documentation

We performed initial testing with the available virtual machine during summer and fall 2012. Our experiences support these statements, and the documentation and community are top-notch. We made the decision to use DAITSS as is, and do changes by configuring it only. This way we maintain full compatibility and don’t need the technical resources required for compiling and modifying the system. We can also put more effort into other objectives like testing distributed and replaceable servers.

The system will we replicated to two geographically remote locations. We will evaluate and develop tools and methods for managing the data transport and security. It is clear that we cannot use Internet for this kind of communication. We still need to decide the levels and roles of the distribution. DAITSS offers storage pooling, there are options in Linux and the data could be replicated by our backend storage infrastructure as well.

The backend storage for archived data will be tape. The disks are used only as a temporary storage during ingest, migrations and such. The data is stored for much longer periods of time than the server hardware life cycle. Also the specifications for the hardware are quite different from your typical application server. We try to adapt by building our own servers, like Google, Facebook, Rackspace and some others do. By this we achieve a lower cost, easily expandable and replaceable environment and in total, control over the hardware. Another issue worth noticing is that not all enterprise hardware is compatible with all Linux distributions. We don’t want to limit our options by picking a corporate extension card.

We bought components for two identical servers. Both have duplicated powers, network adapters and all the basic components, but in addition have ten common hard disks attached to a disk controller. In case of breaking, you could walk in the next door PC store and buy a replacement. Expanding the farm with additional servers is not an expensive task either. We will experiment with these servers and monitor the load, reliability and such to know how we can optimize the solution. The must be a reason why this is an emerging trend.

Centos 6 will be used as the operating system. It is a robust server Linux distribution, and works well with DAITSS. If required, it could be swapped for Red Hat Enterprise Linux in case we would require a commercial support. With this kind of usage, we don’t want fast changing distribution with all the bells and whistles. DAITSS requires some relational database for system data (not the archived data). MariaDB is a good pick, because it continues the work done with MySQL but with completely open source future. Essentially, it is a drop-in replacement with better features.

At this point, tape management and access have to be done with proprietary solutions. Maybe in future this will change, but until that we will use IBM Tivoli. In Mikkeli University of Applied Sciences, we have used it for years with in production systems.

I will keep you updated on the developments and progress as we go.

Mikko Lampi

Posted in Archive system, Daitss, Dark archive, Hardware | Tagged Archive systems, Centos, DAITSS, Dark archive, Hardware, IBM Tivoli, MariaDB, Open Source, open source system | 4 Comments

Open Source – Open Development

Posted on 28.2.2013 by Osmo Palonen

Dear Readers,

My apologies being quiet so long, but there have been so many other personal developments going on. At the end of the year, I moved from city to the village and been more specialised in renovating a house project than open source. Actually, I wrote the following text at the beginning of December, but my colleague Mikko criticised this being unclear. Finally, I had time to edit the text and I post this with some updates.

I was sitting some moths ago in a seminar called OPEN (in Finnish AVOIN). This event was organised by SADe – Action Programme on eServices and eDemocracy (http://www.vm.fi/vm/en/05_projects/03_sade/index.jsp). The programme is part of the public sector ICT progress by the Ministry of Finance in Finland. The seminar was promoting the eight member projects which are using Open Source principles as well as Open Data initiative.

Most of the 8 projects are in a way regular developments within the public administration and organised traditionally, nevertheless there were some cases that waked interest in. Learner’s Service Framework will collect all certificates and diplomas as well as study plans into a Study Path of an individual during his/her years in compulsory education, vocational until the higher education as well as the continuous education, years and decades later. The Framework will also be used as a tool to create applications to the universities. When the service is ready, a citizen is able to provide the information in electronic format for the employers as well.

Even when the project itself was not a completely new idea – I tried years ago get funding for an idea of similar kind. Still, the project has something to take another look. The project uses agile software development where one of the key issues is to get the users and software developers working together. In this case, the participants are working together in the same rooms to ensure that the feedback from the users is immediately available for the developers. The project is using Scrum methodology.

For me, the most important message was: Finally someone in the public sector has understood and is able to demand a new method. The software development is not allowed to be a separated task that can be done at the best far from the end-customer and users. Instead, software and IT is just a part of service and process development; it should be open to the users and hopefully also those who will use the service – in this case the students.

Open Source Archive has to take contact and discuss the requirements with all stakeholders: the archivists know how to manage information, the users – researchers and public – know how the material can be utilized.

Posted in Software development | Tagged agile software development, software, technology | Leave a comment

Blueprinting Service Oriented Archive

Posted on 16.1.2013 by Mikko Lampi

In this post, I will present the blueprint for our service oriented archive. I will call it a light archive, because I hope the naming convention will create some contrast in between this system and a dark archive, which I’m going to talk about in later posts this spring.

A light archive is all about providing various users and groups with value and services, not just the raw archived data or records. Common services include search and discover features, data ingest, access and distribution of metadata, multimedia streams, downloadable copies and such. To achieve this the system itself must be like a modular ecosystem of evolving software, where each piece contributes to the system. A modular system can be upgraded and developed piece by piece, new features can be added and obsolete interfaces reworked, among other advantages. It’s not reasonable to develop the whole system in-house, and of course there is absolutely no point in reinventing the wheel. Reusing and integrating software, and contributing it back to the community creates an ecosystem that can produce high quality software. That is how we think our light archive should be built.

Solution overview
We’re building our solution in three layers. Sadly, not all of it can be made with open source. The main layers are storage, infrastructure and software. Below is a high-level overview of the solution.

A high-level overview of OSA solution.

Software is the most important layer. This includes the actual light archive software and all its components (like databases, application servers, subsystems). I will talk this in detail later on. The software layer is installed into Linux servers. Though you could most likely implement it on BSD or even on Windows based servers too. These servers can be physical or virtual. We’ve chosen virtual Centos and Ubuntu Linux distributions.

Infrastructure includes a hardware platform; physical or virtual. We built a virtual cluster with OpenNebula. There is more information on that in my previous posts. Briefly, the main idea is to have a scalable and reliable platform we can extend later based on the actual need. We can deploy our development and testing environment now and add more resources under the hood as we go. We can also relocate the virtual servers in more powerful environment without changes to the payload.

If you’re interested in the tech stuff, here are a few details. We run Centos 6.3 min as virtual hosts. It has a good support for both hardware and software. And there it is compatible with Red Hat if you require commercial support. Virtualization is based on KVM/QEMU with OpenNebula for cluster and cloud features, and Sunstone for management.

The storage technology is pretty much proprietary in our project, IBM and Oracle mostly. We need to use commercial software for the time being. One goal for the project is to find out if there is enough interest for making storage vendor independent and open source. Please let us know if you’re interested or have similar goals or a project.

Light archive architecture
As a base architecture, we use SOA (Service Oriented Architecture). It allows us to rapidly add and integrate applications to the system by using web services like REST and SOAP, and message brokers, XML or the like. Additional benefit is that we don’t need to modify the original applications and so maintain compatibility with the product. We are either not tied to just one technology. Though we have chosen Java as the main technology, nothing prevents us from adding, for instance, a Python based service and using a standard REST interface to communicate with it.

Here is the design this far. It is very basic but will be documented and planned as we go. This is an agile project, so we do prototyping and incremental development rather than design and write documents.

Light archive architecture.

Fedora Commons is the core software. It is an excellent mature open source project with hundreds of implementations and multiple spin-offs (e.g. Dspace). We chose Fedora because of its extendable nature and the content modeling capabilities. And the majority of the required functionality is already implemented and well tested in it. There are REST and SOAP APIs to work with in Fedora.

Data is stored in databases, in addition to OAIS packages. MariaDB is a MySQL replacement with more open source aligned future. Its upcoming release has an integration with Apache Cassandra. NoSQL is an interesting technology and we’re going to evaluate how it would work for storing archive records. The OAIS packages are going to be stored to the tape library. Previews and use copies of files are stored in a disk array. For this we need to have an abstraction layer for storage.

Search and discover features are provided by Solr, a well-known Lucene based search engine. Its features include full-text search, faceting, clustering, real-time indexing, document handling, geospatial search etc. There is a support for Solr in Fedora and most databases, including Cassandra with the Solandra project.

Analytics and reporting are a business requirement for managing the archive system. The light archive can host multiple organizations’ repositories, so the archive service provider should be able to track the usage. Of course, for the metadata, traditional big data tools could be useful for adding value to it. We’re also looking for ways to visualize the metadata, relationships and other information. Archives have huge potential for valuable and undiscovered information.

Media services, like streaming video and audio, are a must. We need to be able to let users experience the archived audiovisual data. It can be limited to identified and logged in users, paying customers, researchers or anything. But the data doesn’t provide much value if kept hidden in a dark archive. This is something that digitization companies could be interested in.

We will build an API on top of Fedora and any custom services we might develop. The API will streamline the access and add a security layer. It can also be used to publish public features online. Most likely it will be a REST API, but it’s not yet decided.

The workflow engine is an another critical component. It will handle many system processes like ingest, migration, disposal and any batch processing. Micro-services pattern is a good solution for workflows, as seen in Archivematica and DAITSS.

Finally, we need a client which consumes the web services. We will implement a web based client in this project, but it could be a desktop client or another repository as well. We have not yet decided what framework or project we’re going to use for building our client. Some Fedora projects use CMS software. Islandora for example uses Drupal as a front end. We could use our own in-house Yksa or some well-known Java platform like Liferay. We’re open to suggestions.

Next steps
Our goal is to have the working installations of all the major software during the first couple of months of this year. We need to also identify and plan the integrations.

Fedora content and service modeling is also a critical task. We’re collecting test data to use for testing and modeling work. It can also be used for testing the performance of the interfaces we’re coding. This could be a topic for a future post. As people have told me, the power and weakness of Fedora is it’s framework-like nature. It is possible and required to define what you want to do with it. While some systems dictate how to use it.

A web client is a must have for agile development. Without a user interface, we can easily get lost developing functionality without being able to test drive them. It is even more important that our partners and future users of the system can test it and tell us what they think as early as possible.

Mikko Lampi

Posted in Archive system, Fedora Commons, Software development | Tagged API, Archive, Cassandra, Database, digital repository, Fedora Commons, Light Archive, Lucene, MariaDB, Open Source, REST, software, Software architecture, software integration, Solr, Web services | Leave a comment

OSArchive

From alpha to beta

Alpha release

Modeling batch ingest process and developing workflow engine

Agile and user focused development

Summary of 2013 first half

Seminar: Open Source in Content Management

Selecting an open source framework for web application

Building a dark archive

Open Source – Open Development

Blueprinting Service Oriented Archive

Recent Posts

Archives

Categories

Meta

Follow Blog via Email

Authors