blob: 2e82b764dd7defbb533d5e5c0d6c52e1875f27bf [file] [log] [blame]
<?php require_once($_SERVER['DOCUMENT_ROOT'] . "/eclipse.org-common/system/app.class.php"); require_once($_SERVER['DOCUMENT_ROOT'] . "/eclipse.org-common/system/nav.class.php"); require_once($_SERVER['DOCUMENT_ROOT'] . "/eclipse.org-common/system/menu.class.php"); $App = new App(); $Nav = new Nav(); $Menu = new Menu(); include($App->getProjectCommon()); # All on the same line to unclutter the user's desktop'
$pageTitle = "";
$pageKeywords = "";
$pageAuthor = "";
ob_start();
?>
<div id="maincontent">
<div id="midcolumn">
<h1>The EILF Project</h1>
</p>
<?php
include_once($_SERVER['DOCUMENT_ROOT'] . "/projects/fragments/proposal-page-header.php");
generate_header("SMILA (EILF)");
?>
<h3> Please note: at the Creation Review, the project name
will be changed to SMILA (The SeMantic Information Logistics Architecture).</h3>
<h2>Introduction</h2>
<p>The EILF project is a proposed open source
project under the Eclipse Technology Project.</p>
<p>According to the <a href="http://www.eclipse.org/projects/dev_process/">Eclipse Development Process</a>, this
proposal is intended to declare the intent and scope of the EILF project as well as to gather input from the Eclipse community.
You are invited to comment on and/or join the project. Please send all feedback to the
<a href="http://www.eclipse.org/newsportal/thread.php?group=eclipse.technology.eilf">http://www.eclipse.org/newsportal/thread.php?group=eclipse.technology.eilf</a> newsgroup.</p>
<h2>Background</h2>
<p>Enterprise information systems have
mainly focused on structured information residing in databases, ERP systems,
etc. Now, however, the management of unstructured information is receiving more and more
attention in the market. The term Information Access Management (IAM) has been
coined as a name for a more general approach that bridges the two worlds.</p>
<p>Some vendors address the need for IAM tools
with proprietary approaches mainly originating in their domestic market, such
as search engine technology. In addition, some attempts have been undertaken to
address specific parts of IAM even in the open source community (cf. UIMA).</p>
<p>However, a standardized architecture
covering all parts of the IAM process on an enterprise level is still missing. This
includes the standardization of the overall architecture and frequently
used building blocks as well as a methodology for implementing IAM systems.</p>
<p>Having many years of project and product
development experience in information logistics, search and related IAM
applications, empolis GmbH and brox IT-Solutions GmbH decided to join forces
and are now seeking more interested parties, e.g. Eclipse members, to launch
the <b><i>Enterprise Information Logistics Framework</i></b> (EILF) project
addressing the above mentioned aspects.</p>
<h2>Description</h2>
<p>The EILF project will deliver the foundation
for building IAM systems ranging from medium-sized to enterprise-level
installations. For doing so, EILF will define and implement a standard
architecture making intensive use of the OSGi model to support componentization
and extensibility.</p>
<p>Mainly focusing on the server-side
architecture, EILF will provide a platform enabling all relevant steps in IAM,
including management of data sources, data access, pre-processing of
(structured and mainly unstructured) data, linguistic analysis, indexing,
search and so on. </p>
<p><img border=0
width=604
height=453
src="architecture-overview.jpg"></p>
<p><b>Figure 1</b>: Architecture Overview of EILF</p>
<p>Components from existing Eclipse projects
will be adapted and integrated in the EILF platform in order to implement specific
steps in usual IAM processes. Furthermore, EILF will deliver new components as
they are needed for the targeted IAM processes.</p>
<p>As an example, the EILF platform will allow
for the implementation of an enterprise information system by</p>
<ul>
<li>collecting documents from a variety of
(external and internal) data sources</li>
<li>pre-processing this data</li>
<li>enriching it with semantic metadata </li>
<li>indexing data and metadata</li>
<li>making all information accessible either
via an integrated search platform or via direct access through web
services</li>
</ul>
<p><img border=0
width=604
height=434
src="architecture-service-model.jpg"></p>
<p><b>Figure 2</b>: Architecture Overview - Service Model of EILF</p>
<p>To summarize, EILF will deliver a platform
as well as dedicated components for the implementation of a next generation IAM
and information logistics purposes.</p>
<p>Thus, EILF will allow enterprise
organizations to use latest information access/management technology in their
business processes.</p>
<h2>Scope</h2>
<p>The EILF project will address a set of
features and components that are required for the implementation of typical
information logistic platforms. Some of the most important aspects will be
described briefly in the following.</p>
<h3>Standardized and Scalable Architecture</h3>
<p>EILF will deliver a highly scalable
architecture for IAM processes based on well-established standards. The
objective is to serve a wide range of application scenarios from medium-sized
simple search environments to enterprise-level business information systems. Scalability
will be achieved by an appropriate design which ensures that additional
resources (e.g. hardware) can easily be added to a running system in order to
achieve a higher throughput of data etc. The underlying infrastructure will
probably rely on messaging systems and queuing for use cases aiming at high
throughput of data while other approaches need to be provided for synchronous
processes.</p>
<h3>Componentization</h3>
<p>A major focus of EILF will be on
componentization of the overall system architecture, thus ensuring that other
open source tools, products by different vendors, or even project-specific extensions
can easily be plugged into the system. Componentization will make use of OSGi
while the various OSGi services need to be orchestrated by an appropriate and
user-friendly mechanism that still needs to be identified. </p>
<p>In addition, we believe that the
configuration and orchestration of the appropriate components should not only
be possible by highly technical people actually <i>programming</i> new
“workflows” but rather this should be configurable by an experienced and
trained user. Standards, such as BPEL appear to be promising candidates for
this – in particular when appropriate tools become available.</p>
<h3>Data Source Management</h3>
<p>For any IAM platform, access to a variety
of data sources is essential. Some of these data sources can be considered
fairly standardized (e.g. when crawling Web resources) while others are highly
proprietary. While EILF cannot deliver ready-to-use solutions for all such data
sources, the objective is to make available a set of standardized connectors
(crawlers) for the most relevant data sources – but also to provide a framework
by means of which connectors to other data sources can easily be integrated.</p>
<h3>Operating and Management</h3>
<p>Operation, management and monitoring of the
IAM solution are particularly relevant for enterprise-level systems. EILF will
address that requirement by providing tools and approaches by means of which
the current state of any EILF-based system can be monitored in terms of a general
health check of the entire application as well as the single components. In
addition, this shall allow verifying utilized hardware resources in order to
foresee upcoming problems and bottlenecks. Standards, such as SNMP, will be
addressed here.</p>
<p>Additionally, enhancing the management
capabilities of IAM technology in the area of backup management is on focus.
One of the major aspects for a next generation IAM technology will be the
ability to integrate in backup and maintenance procedures of enterprise
companies. This fact will be more and more important, because the amount of
required computing will increase depending on the increased usage of semantic
technologies in the future.</p>
<h3>Authentication and Authorization</h3>
<p>Just as any other enterprise application,
IAM systems as well require Authentication and Authorization be handled in a
careful manner. This includes access rights with regards to information
available in the system, permissions to (not) execute certain operations, but
also user profiles which may potentially drive processes in the IAM system.
Also, functional aspects such as single-sign-on (SSO) are very important.</p>
<h3>Reporting</h3>
<p>Analytics and Business Intelligence
Reporting are essential parts to any IAM system as the information provided in
such reports not only allows to optimize the usage of the system but also to
identify missing information via knowledge gap analysis and similar approaches.
</p>
<p>The powerful reporting engine provided by
Eclipse BIRT will very likely be utilized with EILF even though this aspect may
be shifted to later releases of the project.</p>
<h2>Out of Scope</h2>
<p>As of today, the EILF project does not focus
on the client (user interface) part of IAM systems. This limitation includes
the integration into existing enterprise portals or other user interface
approaches. Sample clients might potentially be provided but probably mainly
for demonstration purposes.</p>
<p>Also, the implementation of specific
aspects of IAM (such as e.g. leading edge linguistic technology or ontology
management components) is out of scope although the integration of such
components in the EILF platform should be straightforward.</p>
<p>In terms of completeness, EILF will provide
basic working components while commercial vendors may offer more advanced
implementations of those components, for example relating to scalability or
feature sets.</p>
<p>All of the above are subject to change
should the need arise to more tightly integrate one or more of these aspects
with the current scope of EILF.</p>
<h2>Organization</h2>
<h3>Mentors</h3>
<ul>
<li>Wayne Beaton, Eclipse Foundation</li>
<li>Markus Knauer, Innoopract</li>
</ul>
<h3>Initial Committers</h3>
<p>The initial committers will ensure the
setup of the EILF platform as well as the initial implementation of both the
platform and selected components. The agile development process will follow
the Eclipse Development Process standards for openness and transparency. Hence, we will actively
encourage contributions to EILF.</p>
<p>The initial committers and contributors are:</p>
<ul>
<li>Georg Schmidt (brox IT-Solutions GmbH): architect,
project lead and committer</li>
<li>Igor Novakovic (empolis GmbH): team lead,
committer</li>
<li>Jürgen Schumacher (empolis GmbH):
architect, committer</li>
<li>Daniel Stucky (empolis GmbH): architect, contributor</li>
<li>Sebastian Voigt (brox IT-Solutions GmbH):
senior developer, committer</li>
<li>Ralf Rausch (brox IT-Solutions GmbH): developer,
contributor</li>
<li>Ralf Schumann (brox IT-Solutions GmbH):
developer, contributor</li>
<li>Thomas Menzel (brox IT-Solutions GmbH):
architect, committer</li>
</ul>
<h3>Code Contribution</h3>
<p>empolis GmbH as well as brox IT-Solutions
GmbH offer initial code contributions for some of the components based on their
experience and long-standing development expertise in the information logistics
area.</p>
<h3>Interested Parties</h3>
<p>Both empolis GmbH as well as brox
IT-Solutions GmbH express their explicit interest in the project as also
indicated by the list of committers and contributors above.</p>
<p>Furthermore, Arexera Information
Technologies GmbH (represented by their CTO, Bidjan Tschaitschian) as well as
project partners on behalf of Siemens and SAP expressed their interest in the
EILF project. </p>
<p>More generally, universities and research
centers are interested in using a standardized framework for search
technologies to use it as a foundation for further research. The project fits
also the interests of the Fraunhofer institutes as well as the DFKI (German Institute
for Artificial Intelligence) because of its integration in processing chains of
research projects. </p>
<p>Moreover, EILF will be highly interesting
for commercial software when it comes to integrating specific components and
building blocks into a fully operational environment without having to care
about the entire infrastructure from scratch. The cooperation between brox IT-Solutions
GmbH and empolis GmbH is already a very good example of that: while the primary
focus of brox’ technology is on accessibility and management of (unstructured)
data, the key benefit of empolis’ technology is on indexing and semantic
refinement of that data – in a productive environment, however, both aspects
need to be addressed simultaneously.</p>
<p>Last but not least one of the main focuses
of standardization departments of enterprise companies will be on
standardization of building blocks in their companies. The EILF project is the
first approach to deliver such a standardized approach for IAM technology.
Therefore, also enterprise companies have a major interest on the outcome of
the EILF project.</p>
<h3>Relationships to other Eclipse projects </h3>
<p>The following projects currently hosted by
Eclipse might potentially become relevant in the course of EILF (in
alphabetical order):</p>
<ul>
<li><b>BIRT</b> (<a
href="http://www.eclipse.org/birt/">http://www.eclipse.org/birt/</a>)
provides reporting features potentially relevant for the collection of information
about usage of EILF components.</li>
<li><b>BPEL</b> (<a
href="http://www.eclipse.org/bpel/">http://www.eclipse.org/bpel/</a>) very
likely will be used to orchestrate services as part of EILF.</li>
<li><b>Corona</b> (<a
href="http://www.eclipse.org/corona/">http://www.eclipse.org/corona/</a>)
could potentially serve as a server-side runtime environment for EILF
services.</li>
<li><b>Connectivity</b> (<a
href="http://www.eclipse.org/datatools/project_connectivity/">http://www.eclipse.org/datatools/project_connectivity/</a>)
could deliver functionality for gathering content from diverse data
sources.</li>
<li><b>EclipseLink</b> (<a href="http://www.eclipse.org/eclipselink/">http://www.eclipse.org/eclipselink/</a>)
can potentially provide components for the persistence of data inside EILF.</li>
<li><b>Equinox</b> (<a
href="http://www.eclipse.org/equinox/">http://www.eclipse.org/equinox/</a>)
definitely will be utilized in order to implement componentization based
on OSGi.</li>
<li><b>Higgins</b> (<a
href="http://www.eclipse.org/higgins/">http://www.eclipse.org/higgins/</a>)
addresses identity management which will become relevant for EILF sooner
or later.</li>
<li><b>Target Management</b> (<a href="http://www.eclipse.org/dsdp/tm/">http://www.eclipse.org/dsdp/tm/</a>)
could potentially be utilized for the administration and monitoring of an EILF
installation that is deployed to distributed servers.</li>
<li><b>Rich Ajax Platform</b> (<a href="http://www.eclipse.org/rap/">http://www.eclipse.org/rap/</a>)
addresses more the client part but nevertheless can become relevant for
the implementation of EILF clients.</li>
<li><b>SOA Tools</b>
(<a href="http://www.eclipse.org/stp/">http://www.eclipse.org/stp/</a>) as
well as <b>Swordfish</b> (<a href="http://www.eclipse.org/swordfish/">http://www.eclipse.org/swordfish/</a>)
both appear to be relevant when it comes to the service architecture of EILF.</li>
<li><b>TPTP</b> (<a href="http://www.eclipse.org/tptp/">http://www.eclipse.org/tptp/</a>) very
likely will be used for performance measurements and monitoring purposes.</li>
</ul>
<h3>User Community</h3>
<p>EILF aims at delivering a de facto standard
in the IAM area. Consequently, an active community providing solid feedback is
essential. This includes a community of developers active in different areas
(architecture, IAM components, UIs) as well as potential users of EILF who
implement specific IAM systems.</p>
<h3>Tentative Plan</h3>
<h4>2008-03 Version 0.5</h4>
<ul>
<li>Basic architecture settled and
implemented</li>
<li>Simple search application available at
least on demonstrator level</li>
</ul>
<h4>2008-06 Version 0.6</h4>
<ul>
<li>More components available e.g. supporting
the integration of linguistic preprocessing</li>
<li>More data sources accessible</li>
</ul>
<h4>2008-10 Version 1.0 – Release 1.0</h4>
<ul>
<li>Orchestration of components / services </li>
<li>Sample application</li>
<li>Documentation and related material</li>
<li>Full support of the OSGI architecture
model</li>
</ul>
<h4>2009-03 Version 1.5</h4>
<ul>
<li>Improved support of Semantic Web
features, ontologies etc.</li>
</ul>
</div>
</div>
<?php
$html = ob_get_contents();
ob_end_clean();
# Generate the web page
$App->generatePage($theme, $Menu, $Nav, $pageAuthor, $pageKeywords, $pageTitle, $html);
?>