7k
n f
lo
i s
s
O
M rt
o §
o
4-1
C
	
	« ^ Volume 26 Number 4 December 2002 ISSN 0350-55961
	Informatica
	An International Journal of Computing '
	and Informatics - J- ' Si'vi 1: ■ J . ^ 5, - - -i
Q / /@) -^-h	1 The Slovene Society Informatika, Ljubljana, Slovenia
J/
!
.'J
Informatica
An International Journal of Computing and Informatics
Archive of abstracts may be accessed at USA: http://, Europe: http://ai.ijs.si/informatica, Asia: http://www.comp.nus.edu.sg/ liuh/Informatica/index.html.
Subscription Information Informatica (ISSN 0350-5596) is published four times a year in Spring, Summer, Autumn, and Winter (4 issues per year) by the Slovene Society Informatika, Vožarski pot 12,1000 Ljubljana, Slovenia.
; ;	The subscription rate for 2002 (Volume 26) is
-	USD 80 for institutions,
I	- USD 40 for individuals, and
-	USD 20 for students
Claims for missing issues will be honored free of charge within six months after the publication date of the issue.
KT^ Tech. Support: Borut Žnidar, Kranj, Slovenia.
Lectorship: Fergus F. Smith, AMIDAS d.o.o., Cankarjevo nabrežje 11, Ljubljana, Slovenia.
£	Printed by Biro M, d.o.o., Žibertova 1, 1000 Ljubljana, Slovenia.
> - ^ - . - -
Orders for subscription may be placed by telephone or fax using any major credit card: Please call Mr. R. Mum, Jožef Stefan Institute: Tel (+386) 1 4773 900, Fax (+386) 1 219 385, or send checks or VISA or Eurocard card number or use the bank account number 900-27620-5159/4 Nova Ljubljanska Banka d.d. Slovenia (LB 50101-678-51841 for domestic subscribers only).
Informatica is published in cooperation with the following societies (and contact persons): Robotics Society of Slovenia (Jadran Lenarčič) Slovene Society for Pattern Recognition (Franjo Pemuš)
Slovenian Artificial Intelligence Society; Cognitive Science Society (Matjaž Gams) Slovenian Society of Mathematicians, Physicists and Astronomers (Bojan Mohar) Automatic Control Society of Slovenia (Borut Zupančič)
Slovenian Association of Technical and Natural Sciences / Engineering Academy of Slovenia (Igor Grabec) ACM Slovenia (Dunja Mladenič)
Informatica is surveyed by: AI and Robotic Abstracts, AI References, ACM Computing Surveys, ACM Digital Library, Applied Science & Techn. Index, COMPENDEX*PLUS, Computer ASAP, Computer Literature Index, Cur. Cont. & Comp. & Math. Sear., Current Mathematical Publications, Cybemetica Newsletter, DBLP Computer Science Bibliography, Engineering Index, INSPEC, Linguistics and Language Behaviour Abstracts, Mathematical Reviews, MathSci, Sociological Abstracts, Uncover, Zentralblatt fiir Mathematik
The issuing of the Informalica journal is financially supported by the Ministry of Education, Science and Sport, Trg OP 13,1000 Ljubljana, Slovenia. .
Post tax payed at post 1102 Ljubljana. Slovenia taxe Percue.
INFORMATICA
AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS
INVITATION, COOPERATION
Submissions and Refereeing
Please submit three copies of the manuscript with good copies of the figures and photographs to one of the editors from the Editorial Board or to the Contact Person. At least two referees outside the author's country will examine it, and they are invited to make as many remarks as possible directly on the manuscript, from typing errors to global philosophical disagreements. The chosen editor will send the author copies with remarks. If the paper is accepted, the editor will also send copies to the Contact Person. The Executive Board will inform the author that the paper has been accepted, in which case it will be published within one year of receipt of e-mails with the text in Informatica I^TgX format and figures in . eps format. The original figures can also be sent on separate sheets. Style and examples of papers can be obtained by e-mail from the Contact Person or from FTP or WWW (see the last page of Informatica).
Opinions, news, calls for conferences, calls for papers, etc. should be sent directly to the Contact Person.
QUESTIONNAIRE
Send Informatica free of charge
Yes, we subscribe
Please, complete the order form and send it to Dr. Rudi Murn, Informatica, Institut Jožef Stefan, Jamova 39, 1111 Ljubljana, Slovenia.
Since 1977, Informatica has been a major Slovenian scientific journal of computing and informatics, including telecommunications, automation and other related areas. In its 16th year (more than five years ago) it became truly international, although it still remains connected to Central Europe. The basic aim of Informatica is to impose intellectual values (science, engineering) in a distributed organisation.
Informatica is a journal primarily covering the European computer science and informatics community - scientific and educational as well as technical, commercial and industrial. Its basic aim is to enhance communications between different European structures on the basis of equal rights and international refereeing. It publishes scientific papers accepted by at least two referees outside the author's country. In addition, it contains information about conferences, opinions, critical examinations of existing publications and news. Finally, major practical achievements and innovations in the computer and information industry are presented through commercial publications as well as through independent evaluations.
Editing and refereeing are distributed. Each editor can conduct the refereeing process by appointing two new referees or referees from the Board of Referees or Editorial Board. Referees should not be from the author's country. If new referees are appointed, their names will appear in the Refereeing Board.
Informatica is free of charge for major scientific, educational and governmental institutions. Others should subscribe (see the last page of Informatica).
ORDER FORM - INFORMATICA
Name: .................................................... Office Address and Telephone (optional):
Title and Profession (optional): .....................................................................
........................................................... E-mail Address (optional): .............
Home Address and Telephone (optional): ....................
........................................................... Signature and Date: ...................
Informatica WWW:
http://ai.ijs.si/informatica/ http://orca.st.usm.edu/informatica/
Referees:
Witold Abramowicz, David Abramson, Adel Adi, Kenneth Aizawa, Suad Aiagić, Mohamad Alam, Dia Ali, Alan Aliu, Richard Amoroso, John Anderson, Hans-Jurgen Appelrath, Ivän Araujo, Vladimir Bajič, Michel Barbeau, Grzegorz Bartoszewicz, Catriel Been, Daniel Beech, Fevzi Belli, Simon Beloglavec, Sondes Bennasri, Francesco Bergadano, Istvan Berkeley, Azer Bestavros, Andraž Bežek, Balaji Bharadwaj, Ralph Bisland, Jacek Blazewicz, Laszlo Boeszoermenyi, Damjan Bojadžijev, Jeff Bone, Ivan Bratko, Pavel Brazdil, Boštjan Brumen, Jerzy Brzezinski, Marian Bubak, Davide Bugali, Troy Bull, Leslie Burkholder, Frada Burstein, Wojciech Buszkowski, Rajkumar Bvyya, Netiva Caftori, Particia Carando, Robert Cattral, Jason Ceddia, Ryszard Choras, Wojciech Cellary, Wojciech Chybowski, Andrzej Ciepielewski, Vie Ciesielski, Mei Ó Cinnéide, David Cliff, Maria Cobb, Jean-Pierre Corriveau, Travis Craig, Noel Craske, Matthevi- Crocker, Tadeusz Czachorski, Milan Češka, Honghua Dai, Bart de Decker, Deborah Dent, Andrej Dobnikar, Sait Dogru, Peter Dolog, Georg Dorfner, Ludoslaw Drelichowski, Matija Drobnič, Maciej Drozdowski, Marek Druzdzei, Marjan Družovec, Jozo Dujmović, Pavol Duriš, Amnon Eden, Johann Eder, Hesham El-Rewini, Darrell Ferguson, Warren Fergusson, David Fiater, Pierre Flener, Wojciech Fliegner, Vladimir A. Fomichov, Terrence Forgarty, Hans Fraaije, Hugo de Gans, Eugeniusz Gatnar, Grant Gayed, James Geller, Michael Georgiopoius, Michael Gertz, Jan Golinski, Janusz Gorski, Georg Gottlob, David Green, Herbert Groiss, Jozsef Gyorkos, Marten Haglind, Abdelwahab Hamou-Lhadj, Inman Harvey, Jaak Henno, Marjan Hericko, Elke Hochmueller, Jack Hodges, Doug Howe, Rod Howell, Tomžš Hruška, Don Huch, Simone Fischer-Huebner, Alexey Ippa, Hannu Jaakkola, Sushil Jajodia, Ryszard Jakubowski, Piotr Jedrzejowicz, A. Milton Jenkins, Eric Johnson, Pelina Jordanova, Djani Juričič, Marko Juvancic, Sabhash Kak, Li-Shan Kang, Ivan Kapust0k, Orlando Karam, Roland Kaschek, Jacek Kierzenka, Jan Kniat, Stavros Kokkotos, Fabio Kon, Kevin Korb, Giiad Koren, Andrej Krajne, Henryk Krawczyk, Ben Kroese, Zbyszko Krolikowski, Benjamin Kuipers, Matjaž Kukar, Aarre Laakso, Les Labuschagne, Ivan Lah, Phil Laplante, Bud Lawson, Herbert Leitold, Ulrike Leopold-Wildburger, Timothy C. Lethbridge, Joseph Y-T. Leung, Barry Levine, Xuefeng Li, Alexander Linkevich, Raymond Lister, Doug Locke, Peter Lockeman, Matija Lokar, Jason Lovi'der, Kim Teng Lua, Ann Macintosh, Bernardo Magnini, Andrzej Malachowski, Peter Marcer, Andrzej Marciniak, Witold Marciszewski, Vladimir Marik, Jacek Martinek, Tomasz Maruszewski, Florian Matthes, Daniel Memmi, Timothy Menzies, Dieter Merkl, Zbignievi' Michalewicz, Gautam Mitra, Roland Mittermeir, Madhav Moganti, Reinhard Moller, Tadeusz Morzy, Daniel Mossé, John Mueller, Jari Multisilta, Hari Narayanan, Jerzy Nawrocki, Ranee Necaise, Elzbieta Niedzielska, Marian Niedq'zwiedziriski, Jaroslav Nieplocha, Oscar Nierstrasz, Roumen Nikolov, Mark Nissen, Jerzy Nogieć, Stefano Nolfi, Franc Novak, Antoni Nowakowski, Adam Nowicki, Tadeusz Nowicki, Daniel Olejar, Hubert Osterie, Wojciech Olejniczak, Jerzy Olszewski, Cherry Owen, Mieczyslaw Owoc, Tadeusz Pankowski, Jens Penberg, William C. Perkins, Warren Persons, Mitja Peruš, Stephen Pike, Niki Pissinou, Aleksander Pivk, Ullin Place, Gabika Polöicovä, Gustav Pomberger, James Pomykalski, Dimithu Prasanna, Gary Preckshot, Dejan Rakovič, Cveta Razdevšek Pučko, Ke Qiu, Michael Quinn, Gerald Quirchmayer, Vojislav D. Radonjic, Luc de Raedt, Ewaryst Rafajlowicz, Sita Ramakrishnan, Kai Rannenberg, Wolf Rauch, Peter Rechenberg, Felix Redmill, James Edward Ries, David Robertson, Marko Robnik, Colette Rolland, Wilhelm Rossak, Ingrid Rüssel, A.S.M. Sajeev, Kimmo Salmenjoki, Pierangela Samarati, Bo Sanden, P. G. Sarang, Vivek Sarin, Iztok Savnik, Ichiro Satoh, Walter Schempp, Wolfgang Schreiner, Guenter Schmidt, Heinz Schmidt, Dennis Sewer, Zhongzhi Shi, Märia Smolärovä, Carine Souveyet, William Spears, Hartmut Stadtler, Olivero Stock, Janusz Stoklosa, Przemyslaw Stpiczynski, Andrej Stritar, Maciej Stroinski, Leon Strous, Tomasz Szmuc, Zdzislaw Szyjewski, Jure Šile, Metod Škaija, Jih Šlechta, Chew Lim Tan, Zahir Tari, Jurij Tasič, Gheorge Tecuci, Piotr Teczynski, Stephanie Teufel, Ken Tindell, A Min Tjoa, Vladimir Tosic, Wieslaw Traczyk, Roman Trobec, Marek Tudruj, Andrej Ule, Amjad Umar, Andrzej Urbanski, Marko Uršič, Tadeusz Usowicz, Romana Vajde Horvat, Elisabeth Valentine, Kanonkluk Vanapipat, Alexander P. Vazhenin, Jan Verschuren, Zygmunt Vetulani, Olivier de Vel, Valentino Vranić, Jožef Vyskoc, Eugene Wallingford, Matthew Warren, John Weckert, Michael Weiss, Tatjana Welzer, Lee White, Gerhard Widmer, Stefan Wrobel, Stanislaw Wrycza, Janusz Zalewski, Damir Zazula, Yanchun Zhang, Ales Zivkovic, Zonling Zhou, Robert Zorc, Anton P. Železnikar
An Assessment of the Organization Virtuality with Three Different Reference Models
Cene Bavec
School of Management in Koper, Slovenia Phone: +386 5 610 2000; fax; +386 5 610 2015 E-mail: cene.bavec@guest.arnes.si
Keywords: virtual organization, level of virtuality, modeling organization, colored Petri nets, virtual government Received: May 17, 2002
The main objective of the research was to test a holistic view on virtual organizations with different perceptions of virtuality. Traditional and virtual organizations could be seen as two extremes of more general model of organization. To asses a transition from the traditional to the virtual organizations we have to grade organizational virtuality. In the paper we discuss three basically different reference models used to asses this transition. Two models are well known - the Mowshowitz's switching principle, and the Model of Business Networking (MBN) as a representative of models preferred by the ICT experts. They see the virtual organizations through implementation of the ICT, particularly the Internet. To express other characteristics of virtual organizations we also presented the model based on the Colored Petri nets and fuzzy logic that we originally used to study an organized anarchy. All three models were implemented to assess the case of the Custom Administration in Slovenia. An assessment confirms that organization of the Custom services clearly demonstrates an efficient utilization of the Internet and other features of virtual organizations.
1 Introduction
Many authors argue that the theory of virtual organizations leads to a generalization of the traditional organization theory. It is not yet a prevailing organizational concept (Klüber et al, 1999) but, Internet, networked and virtual organizations have already proved to be an efficient organizational paradigm that brought to the business world a higher level of flexibility, efficiency, resource utilization and better customer services.
An intuitive perception of virtual organizations is often inadequate and misleading so we are still searching for new managerial principles and practical tools for every day management that could replace still prevailing traditional organizational principles bom in the industrial age. The theory of virtual organizations is presently very chaotic. We still haven't developed practically useful indicators to make an objective assessment of virtual organization and to distinguish them from the other forms of organization.
We have learned from practical experience that it is not realistic to classify organization in only two classes -virtual and traditional (Jansen at al, 1999). These should be seen as two extremes of a more general model of organizations. If we need to describe and to assess a transition from the traditional to the virtual organizations we have to grade their virtuality.
In this paper we discuss the indicators that could assess virtuality of specific business organization. In the case presented we studied a real business environment to underline practical issues of virtual organizations and to
raise a general issue of their virtuality and efficiency. We used three very different reference models of virtual organizations. Two models are well known - the switching principle and metamanagement (Mowshowitz, 1999) and the Model of Business Networking (Klüber at al, 1999) that represents a class of models used mainly by the IT and Internet specialists.
To learn more about the structure and the internal nature of virtual organizations we also presented a formal definition of the organization, based on the Colored Petri nets and fuzzy sets logic. The definition was implemented in a computer model of non-hierarchical organizations and the organized anarchy (Bavec, 2001). The model was not initially designed to describe virtual organizations. Nevertheless, it predicted some features of non-traditional organizations as fiizziness of organizational rules and boundaries.
2 Reference Models Used
The main objective of the research was to test a holistic view on virtual organizations with different perceptions of virtuality:
•	the Switching Principle - is mainly a managerial view with emphasis on organizationafflexibility and manageability,
•	the Model of Business Networking - it defines inter-organizational relations and predominantly an ICT view on organization with emphasis on modularity and business transparency.
• Model based on the Colored Petri nets formalism
shows an internal view on "fuzzy" organizational
structures and information flows.
We were well aware that models are not compatible and not even comparable but, we had intentionally selected such different perceptions of virtual organizations. The goal was to get some deeper understanding of potential indicators that could be used in their assessment. To overcome this methodological obstacle we used the models to separately assess seven features of virtual organizations proposed by Mertens et al. (1998):
•	boundary crossing
•	complementary core competencies
•	sharing of knowledge
•	geographical dispersion
•	changing participants
•	participant equality
•	electronic communication
2.1 Switching Principle and Metamanagement
The first reference model we used was based on the switching principle and metamanagement introduced by Mowshowitz (1999). In the simplest way we could describe it as an ability of organization to dynamically select the best performer or executor (need-fulfillment) for a particular task (need). That means that an organization treats tasks separately from their potential performers. Switching would take place when replacing one performer would bring benefits that are greater than direct and indirect costs of replacement. Another concept introduced by Mowshowitz is metamanagement. It is basically management of virtually organized tasks and managerial implementation of the switching principle.
This principle may seem trivial at the first glance but it opens an entirely new view on the organization. We should notice that in the traditional organization theory and practice it was always a sign of serious miss-planning or "bad organization" when we had to change (switch) a performer in the production phase of the task when organization was already implemented.
Possibility of switching undoubtedly adds to organization and managerial flexibility, but the question that still remains is just how realistic it could be in every day business. Basic idea of virtuality is that switching could be done relatively fast. It would be difficult for the management to implement all traditional risk analysis, so the trust becomes an important decision and even an economic factor. We have to trust the new partner (Ishaya, Macaulay, 1999) and be reasonably confident that he will integrate into our operations and perform his role according to our expectations. If the mistrust is too high it could overwhelm other benefits.
From the Mowshowitz's model we could assume that the level of virtuality is correlated with the ability to implement the switching principle or metamangement. We could use it to assess changing participants and
participant equality from the Mertens' list of virtual organization features.
2.2 ICT Oriented Models of Virtual Organization
Another, very ditferent perception of virtual organization is seen in the Model of Business Networking (Kliiber et al, 1999). It is a typical representative of models preferred by the IGT experts as they see virtual organizations through implementation of IGT, particularly the Internet. But, the model is more general and incorporates important features of virtual organizations that are highly relevant for the management.
The Model of Business Networking (MBN) has the following elements that were parts of our assessment of a real business environment:
•	Customer processes (determine the design of a value chain),
•	Integrators and Aggregators (third parties included into business relationships),
•	Business Bus (logical space where complex services among business partners are flexibly and efficiently exchanged with the support of service providers),
•	Business Ports (standardized interface to access the Business Bus).
Similar approaches are widely used, often under different names in design and implementation of information systems based on open networking like the Internet.
According to the MBN, integrators and aggregators are an essential element of networked and virtual organizations. They provide different business services: knowledge, coordination, process, information, and transaction services. They behave in the way to "soften" or even eliminate organizational boundaries between business partners. The Business Busses and the Business Ports describe inter-organizational relations and interfaces that define mainly an information structure of virtual organizations. But, more generally they describe complementary core competencies, sharing of knowledge, geographical dispersion and electronic communication.
3 Modeling Organizations with Colored Petri Nets
3.1 Rationale Behind the Model
In the year 1994 we developed a model of organization based on the extended Colored Petri nets and fuzzy logic to study organized anarchy and influence of information systems on the organization. Their superior semantic power makes possible a very rich representation of the organization and overcomes some limitations of classical representations. A general definition of the organization was based on abstract fuzzy sets with axiomatically assigned properties. The organization was defined as a
set of rules that determined the chain of authority, description of working (organizational) places, and other organizational relations. It also determined conditions under which organizational processes could change their states. The quality of organization can be measured only through its impact on processes so it must be modeled together with them. The Petri nets proved to be an efficient way to combine organizational structure and the processes in the organization.
This methodology was used for modeling properties that reflect ambiguity or deviation from the traditional hierarchical organization. The study also exposes a paradigm that could be called an informed anarchy paradigm in analogy with the organized anarchy. The informed anarchy paradigm is based on empirically founded facts that unclear technology of allocation, dissemination, and also faulty understanding of information are prevailing properties of organizations.
An object-oriented model was developed that interlinks a formal organization with decision and information processes into an integral model. It was based on three classes of objects: organization, decision processes and information processes. An important feature of the model was its ability to model conditions on micro level, that means both on the level of working (organizational) place and individual process.
Further development confirmed that the same model represent also some relevant features of virtual organizations.
3.2 Model of Organization Based on Colored Petri Nets
Petri nets are well proven tools for systems modeling that can describe dynamic and static properties of the system. Similar situations, only much more complex, can be met in business organizations, so we experimented with the possibility to model them with Colored Petri nets (CPN). We published some results of the computer simulation (Bavec, 2001) - relation between the level of organization anarchy, load of problems, formal and informal information systems, and efficiency of decisionmaking.
We defined an organization ORG as a 12-tuple or an extended Colored Petri net (Bavec, 1995):
ORG = (B,P, T,D, QR.h O.ò.n.p. c)
B = (b], b2, ... bj)	a finite set of colors
P = (pi,p2, ..■ Pn)	a finite set of places
T = (ti,t2, ... t J	a finite set of transitions
D = (dj, d:,... dj
C = (Cl, C;, ... Ct) R = (r,. r2, ... rj
a finite set of organizational places (working places, divisions, etc.)
a finite set of concepts or objects
a finite set of organizational relations r = {r\rcDxD }
sn/TirriDncn« = 0
1:	an input function that maps a set of
transitions // into places p,
O: T—^P^ an output function that maps a set of transitions into places p,
S: D^ P	a function that maps organizational
places di G D into places pi G P
rj: C —f P	a function that maps concepts or
objects Ci e C into placesPiEP
p: /?—> T	a function that maps organizational
relations r, G R into transitions ti G T
a: Z—>• P	a function that maps multi-set of tokens
z(pi) G Z into places pi&P
k^ [0,1]	a threshold, an additional condition for
firing transitions.
We are aware that business and human organizations, particularly as complex ones as the virtual organizations, can't be highly formalized (structured). Consequently, there is a question how far can we go with formal definitions. But, the model confirms that we could model some features of virtual organizations with the CPN and its derivations (Deng et al. 1990).
In the model we implemented fiizzy logic, mainly through the threshold XE [0,1] which additionally controls firing of tokens. We also proved that the introduction of concepts or objects (also Bastide, 1996) C/ G C assigned to organizational places di ED (they could be everything from working places to organizational units) provided us with the tool to model complex relations between organization as the set of rules, and processes that are running in accordance with the organizational rules.
With the controlled firing of tokens in the CPN and fuzzy logic we could describe and study features of virtual organizations like the switching principle, the ambiguity of organizational relations and particularly the boundary crossing.
4 The Case Study - Assessment of the Government Agency
4.1 Beyond the Business Partnership
Emerging experience and the theory of virtual organization is based on the present business practice with very few examples from the government administration. But, there is widely spread belief among researchers that governments and public administrations are one of the most promising grounds for an introduction of virtual organizations. The development of Internet and innovative customer and citizen oriented services in governments in developed countries more than justify this assumption. It is encouraging to notice a similar development also in the middle developed countries, such as Slovenia.
In the case presented we studied the collaboration of the Custom Administration in Slovenia (CA) with different private companies. The case reveals development steps from the traditional government agencies towards highly efficient and technologically advanced organizations that clearly articulate elements of the new organizational paradigm - the virtual organizations. The CA has developed a sophisticated and efficient Custom information system in which 98% of all custom declarations are lodged electronically up to the highest security standards.
The development of the CA information system and computer applications were outsourced to the private company ZZI from Ljubljana. The application was linked to the operation environment of users. Jointly with its partners, ZZI developed an interface to enable its software to be integrated into larger operation information systems based on the BAAN, SAP, NAVISION, etc. It is important to notice that the CA has been equally opened to other potential partners that could develop software and services to enable different users to link their systems to the Custom information system, enabling them to submit the customs declarations and other documents by the Internet.
Beside the software development ZZI and nearly 30 other providers also offer transmission services for the electronic data interchange within different environments and among different partners. The server solutions and programs used by clients facilitate the automatic data interchange within the different environments and applications. Such service for example is the transmission of the messages from the Internet environment of partners to the X.400 environment of the Custom Administration.
4.2 Implementation of the Switching Principle and Metamanagement
At the beginning of the research we were concerned with the notorious rigidity of the government organization. It seemed quite unrealistic to notice any sign of the switching principle in the government agencies. But, what we really found was a clear presence of elements of metamanagement and implementation of the switching principle.
Services such as the transmission of the messages from the Internet environment of partners to the X.400 environment of the Custom Administration and other forms of electronic data interchange services performed by private companies are without any doubt "switchable". The official policy of the CA is in favor of tighter inclusion of trustworthy participants into the Custom Information System. This just confirms this development. Transferring elements of the CA authority to partners express another important feature of virtual organizations - the trust. From the government point of view it is a major breakthrough to realize that it is more convenient and even cheaper to trust partners and to control them more "softly" in indirect and off-line mode.
Looking into the project and official documents of the CA reveals that the switching principles as well as the role of trust are introduced entirely intuitively, without any reference to virtual organizations. It shows that evolution of virtual organization could be entirely spontaneous and a natural development in the competitive and technologically advanced environments.
4.3	Implementation of the MBN
According to the Model of Business Networking (MBN) integrators and aggregators are essential elements of networked and virtual organizations. They are the third parties included into business relationship. In the case of the Custom information system they are not government agencies, nor the users of the system. Electronic data interchange services performed by the ZZI and other companies are "infomediary" (Osterie, 1999). The companies around the CA are playing the role of integrators and aggregators with standardized procedures and with standardized interfaces and can be interchanged and replaced. It gives the CA a very high flexibility, accompanying with noticeable cost reduction and better customer services.
Again, the CA implemented the most visible feature of virtual organization - a very high flexibility.
The overall architecture of the CA information system is modular with surprisingly similar topology as the MBN. Terminology used is different, but its structure could be described with the MBN features. Modular system design and application of the Business Bus (the way to exchange of business services) and the Business Ports (standard business and technological interfaces) offers a tool for optimal organizational design in the CA and their partners.
The Business Bus in this case is a virtual world of custom declaration processing that is separated from the physical world of goods, importers and custom houses. Locations of custom warehouses are the matter of convenience and agreement between the CA and an importer. It indicates the presence of vital elements of virtual organization.
4.4	"Fuzzy" Organizational Bonds
With the description of the virtual organization with the CPN we could identify and formalize some internal features like the strength of managerial and organizational bonds. We could study the mechanisms that make some positions in organization logically members of two or even more different organizations. Many positions or tasks in the ZZI are so strongly linked to the CA, and also opposite, that employees often don't know who their boss really is and to whom to report to in some cases.
It means that some organizational relations are not just ftjzzy but they could also extend out of the organization. That is a relatively simple explanation for ambiguous boundary limits and boundary crossing.
The most interesting feature that the CPN modeling offers is its ability to model fuzzy information and decision processes that are the characteristics of the organized anarchy and also of virtual organizations. We were quite intrigued by the fact that we can implement so many ideas of organized anarchy directly on the virtual organizations.
4.5 Results
The reference models used confirm that the organization of the Custom Administration clearly demonstrates features of contemporary organizations with an efficient
utilization of the Internet and even more hidden elements of virtual organization. In the absence of proven methodologies and indicators for assessment of virtual organizations we assessed the features of virtual organizations proposed by Mertens et al. (1998).
We transformed the whole problem into seven separated and independed assessments. Each feature was ranked on the scale from I to 100 and plotted on the radar chart (Figure 1). The picture reveals an uneven development of virtual organizations futures - some of them are very pronounced, others are still very close to the traditional organizations.
boundary crossing 100^
sharing of knowledge
electronic communication
complementary core competencies
geographical dispersion
participants equality
changing participants
Figure 1 : An assessment of the basic features of the virtual organizations - the case of the Custom Administration in Slovenia
An electronic communication and the geographical dispersion are very developed but, in every day business they are the easiest goals to achieve. They represent more technical than managerial challenge. Obviously, it is much more difficult to achieve managerial goals like sharing of knowledge, participation equality and particularly changing participants. They are prerequisites for introduction of the switching principle and metamanagement. Results also confirm what we could intuitively expect - boundary crossing and complementary core competencies are somewhere in the middle on the scale of managerial problems. They could be achieved in the next step, after introducing electronic communication and geographical dispersion on a broad scale.
Conclusion
The research has revealed that the CA had progressively developed towards virtual organization entirely
intuitively. At the beginning, their main goal was to outsource development of their information system and to utilize the ICT, as much as possible. It confirms a well known fact noticed in the business community that we presently face an evolution rather than revolution towards virtual organizations. This evolution is spontaneous in technologically advanced environments. But, even if we accept the fact that the emergence of virtual organizations could be spontaneous the management still needs deeper insight into challenges of the new organizational paradigm. It will soon turn out to be one of the most important expertise of contemporary managers.
The case of the Custom Administration also presents a fine example of virtuality - the virtual world of custom declaration processing is separated from the physical world of goods, importers and custom houses. It provides such a high flexibility that Slovenian accession to the European Union won't require any changes in organization of the CA - their virtual world will be
simply extended from Slovenian borders to the whole EU.
An assessment of the degree of virtuality proved to be a real challenge. We are still short of any useful methodology or a set of relevant indicators. Nevertheless, a simple case we investigated showed that we could combine different models in search for more holistic view on virtual organizations.
We were able to detect weaknesses and obstacles in managerial strategies and also to grade their goals from the easiest to the more complex. Technical issues like extensive introduction of the Internet are relatively easy to achieve and to manage. One of the most pronounced features of virtual organizations like boundary crossings is also quite common, even in the early phase of the development of virtual organization.
The ability to change participants has received the lowest grade in our research. It seems that the real managerial challenge is hidden in the switching principle and metamanagement. It could lead us to the conclusion that fully developed virtual organizations are still difficult to achieve. For that reason management needs much deeper understanding of challenges and obstacles in the transition from traditional to virtual organizations. Researchers could contribute with models and tools that would enable managers to set relevant goals and to asses their efforts.
References
[1]	Bastide R. (1996): Approaches in unifying Petri nets and the Object-Oriented Approach, Working paper, L.I.S., Université Toulouse
[2]	Bavec C. (1995) Object Oriented Modelling of Organization, Ph.D.Dissertation (in Slovenian), University of Ljubljana, Faculty of Economics
[3]	Bavec C. (2001): "Modeling of Management Decision-making Processes in Organized Anarchy", Informatica, 25 (2001) 375-379
[4]	Bavec C., Zorko Z. (2002): Evolution of Networked and Virtual Government Agencies - The Case from Slovenia, Procedings of 3rd European Conference E-Comm-Line 2002, September 26-27, 2002, Bucharest, Romania
[5]	Cohen M. D., March J. G., Olsen J. P. (1972) A Garbage Can Model of Organizational Choice. Administrative Science Quarterly, 17 1-25
[6]	Davidow W.H., Malone M.S. (1992) The Virtual Corporation, Harper Collins, New York
[7]	Deng Y., Chang S.K. (1990): A G-net Model for Knowledge Representation and Reasoning, lEE trans. On Knowledge and data Engeneering, Vol. 3, No. 3
[8]	Drucker P.F. (1999): Management Challenges" for the 21®' Century, Butterworth-Heineman
[9]	Hesselbein F., Goldsmith M., Beckhard R. (1997): The Organization of the Future, The Drucker Foundation, Jossey Bas, San Francisco
[10]	Ishaya T., Macaulay L.(1999): The Role of Trust in Virtual Teams, Proceedings of the 2nd International VoNet Workshop, September 23-24, 1999, Simowa Verlag Bern
[11]	Jansen W., Steenbakkers W., Jägers H. (1999): Electronic Commerceand Virtual Organizations, Proceedings of the 2nd International VoNet Workshop, September 23-24, 1999, Simowa Verlag Bern
[12]	Jensen K. (1992): Coloured Petri Nets, Basic Concepts, Analysis Methods and Practical Use, Vol. I, Springer-Verlag, Berlin Heidelberg
[13]	Klüber R., Alt R., Osterie ZH. (1999): Emerging Electronic Services for Virtual Organizations -Concept Framework, Proceedings of the 2nd International VoNet Workshop, September 23-24, 1999, Simowa Verlag Bern
[14]	Mertens, P., Griese J., Ehrenberg D. (1998): Virtuelle Unternehmen und Informationsver-abeiting, Springer, Berlin
[15]	Morabito J., Sack 1., Bhate A. (1999): Organization Modeling - Innovative Architectures for 21" Century, Prentice Hall
[16]	Mowshowitz A. (1997): Virtual Organization: Avision of Management in the Information Age, The Information Society, Vol. 10
[17]	Mowshowitz A. (1999): The Switching Principle in Virtual Organization, Proceedings of the 2nd International VoNet Workshop, September 23-24, 1999, Simowa Verlag Bern
[18]	Strausak N. (1998): "Résumé of VoTalk", VoNet Workshop, April 27-28,1998, Simowa Verlag Bern
Using Image Segmentation as a Basis for Categorization
Janez Brank
Department of Intelligent Systems Jožef Stefan Institute, Ljubljana, Slovenia janez.brank@ijs.si
Keywords: image categorization, segmentation, generalized kernels Received: June 26, 2002
Image categorization is the problem of classifying images into one or more of several possible categories or classes, which are defined in advance. Classifiers can be trained using machine learning algorithms, but existing machine learning algorithms cannot work with images directly. This leads to the needfor a suitable way of representing or describing images such that learning algorithms can work with them. We consider a representation based on texture segmentation and a similarity measure between segmented images which has been used successfully in the related area of image retrieval. A generalized kernel for use with the support vector machine (SVM) algorithm can be built from such a similarity measure. We compare this approach with a more straightforward representation based on autocorrelograms, and we show that these two representations can be combined to obtain classifiers with higher categorization accuracy.
1 Introduction
Besides textual and relational data, people increasingly have to deal with pictorial data, or data in the form of images. Large pictorial databases are being produced as archives digitize their collections, and additionally the World Wide Web contains a huge number of images. Apart from purely technical problems of storing and processing such large amounts of data, the emergence of large collections of images opens the problems of enabling the users to make sense of this data and find what they need. Image categorization deals with one aspect of this problem: given a set of images and a set of predefined categories or classes, we assume that each image should belong to one or possibly several of these categories. For a large collection it would be impractical to have a human observer categorize all the images, so we want to be able to classify the images automatically after a small number of images has been classified manually to be used for training the automatical classifiers.
However, this view of image categorization as a machine learning task immediately opens up a new problem: existing machine learning algorithms generally cannot work with images directly. Instead, they often assume they will be dealing with instances described by vectors or tuples. We need to be able to represent images using structures of this kind to make use of existing machine learning algorithms.
1.1 Related work in image retrieval
We can build on existing work in image retrieval, which is a related area where the problem of representation has already been encountered. In image retrieval, the user poses a query to the system and the system
should find images that are somehow relevant to the query. Thus a way of representing the query, a way of representing images, and a way of comparing a query and an image (to determine if the image is relevant with regard to this query) are needed. One approach that is both technically feasible and useful enough to be commonly used in practice (e.g. in web image search engines such as Google) is to describe each image using a few keywords, and the user's query can then request images whose description includes or excludes particular keywords. However, this approach is only feasible if textual descriptions of images can be obtained automatically (e.g. from the HTML file that linked to an image); it is usually too costly to have a human maintainer prepare such descriptions manually for a larger database. In addition, this textual approach suffers from problems of polysemy: different people would use different words to describe an image, and the same words may mean different things to different people. Therefore it is often desirable to rely solely on what can be automatically extracted from the images themselves. The user's query is then often simply a request to look for images similar to a given query image or sketch (this approach is known as "querying by contenf', or "content-based image retrieval").
There are several close parallels between image retrieval and image categorization. In categorization, if a new image is similar to training images from a particular category, it should probably itself belong to that category; in content-based image retrieval, if an image from the database is similar to the query image, it should probably be shown to the user. Thus we see that both areas need a way of representing images and assessing similarity between them. Many image representations and similarity measures have been proposed in image
retrieval, and we would like to examine some of them from the point of view of image categorization as well.
One popular class of image representations is based on simplifying the image by approximating the color of each pixel by the nearest color from a predefined and fixed color palette; this can also be seen as partitioning (or quantizing) the space of all possible colors. Some information is then recorded about the presence of each color on the image. When simply the proportion of the image covered by (the pixels of) that color is stored, the resulting description is called a histogram [11], However, this disregards all spatial information (how the color is distributed around the image): for example, a large patch of red would affect the histogram of an image in the same way as a large number of red pixels scattered all over the image, which is surely undesirable.
Several improved histogram-like representations of images have been proposed. For example, an auto-correlogram [4] records, for each color c and for a few small integers d, the probabilify that a pixel, chosen randomly at distance d from a randomly chosen pixel of color c, will itself be of the color c. This retains information about the amount of a color present on the image, but also records something about the spatial arrangement of each color. Still, all "global" representations of this type can be seen as somewhat rigid as they record a strictly fixed amount of data for each image. They cannot take into account the fact that some images are more complex than others, that an image may contain several objects, or that it may be helpful to distinguish between an (interesting) object and (uninteresting) background.
1.2 Image segmentation
Another, more sophisticated, class of image representations is based on segmentation, or dividing an image into a set of regions such that each region is roughly homogeneous in color and/or texture. Each image is then represented by a set of regions; each region is typically described by a short vector that is a by-product of the segmentation procedure (containing e.g. the average color of the region, information about texture, and so on). Additionally, the location of each region on the image (i.e. which parts of the image are covered by that region) is often recorded as well. In general, regions might overlap, and each region might itself be composed of several disjoint parts; this is not necessarily problematic as they need not be shown to the user, and image similarity measures usually permit the regions to be disconnected,'^and sometimes work with overlapping regions as well. Representations based on segmentation can adapt well to differences in complexity between images, and have been used successfully in image retrieval [NRS99, WLWOO].
Various segmentation algorithms have been proposed in the context of image retrieval [NRS99, WLWOO]. These approaches are usually based on dividing the image into a grid of small "windows" (e.g. 4x4 pixels); each window is described by a short vector (containing e.g. the average color and possibly a few coefficients from the higher-frequency bands of a
wavelet transform, in order to capture the presence of edges or texture), and these vectors are then clustered. Each of the resulting clusters contains vectors that lie close together in their vector space, and such vectors hopefully correspond to windows that are similar in appearance; therefore it makes sense to form a region from such a group of windows. The region thus obtained can be described by the centroid of the cluster, i.e. by the average of the vectors that describe the windows from which the region was formed.
To use segmentation for image retrieval, it is also necessary to introduce a measure of similarity between segmented images. Such measures usually examine pairs of individual regions (one region from each image) and combine the measures of similarity or difference between regions into a single similarity measure between entire images. For example, the integrated region matching (IRM) measure [8] defines the distance between two images as a weighted sum of distances between regions, in which the weights are chosen so as to allow larger regions to have a larger influence on the similarity between images.
To use the representations described above for image categorization, one could use global representations (e.g. autocorrelograms) in combination with any of several machine learning algorithms (such as support vector machines, SVM); or use a segmentation-based similarity measure with an algorithm that allows an arbitrary similarity measure to be plugged into it (e.g. the nearest-neighbor method). However, our earlier work [1] has shown that the nearest neighbor method, in combination with segmentation-based image similarity measures, results in rather unimpressive performance in comparison to SVM and global representations. It is therefore our goal to try using segmentation together with support vector machines. The main challenge here is that the SVM in its original formulation assumes all training and test examples to be described by vectors with the same number of components, while in the case of segmentation the description of each image has more structure than that, and the number of regions can also vary from image to image.
2 Support vector machines
Support Vector Machines (SVMs) [3] are a relatively recent family of machine learning algorithms that have been used successfully in many application domains. In the most elementary form of this method, we assume that each training example is a vector from some d-dimensional real space, and that there are exactly two classes, called positive and negative. Several extensions to multiclass problems are possible [5], usually by converting one multiclass learning problem into several two-class problems (e.g. training one classifier for each pair of classes to separate members of one class from those of the other class).
In SVM learning, we want to separate the positive vectors from the negative ones using a hyperplane such that the positive training vectors lie on one side of the plane and the negative ones lie on the other side.
Additionally, to help make the classifier more robust and more reliable for use on unseen test vectors, we want the training vectors to lie as far from the separating hyperplane as possible. Maximizing this distance (known as the margin) from the plane to the nearest training example can be cast as an optimization problem in the following way.
Let Xj be the /'th training vector, and y, its label (which equals +1 for positive examples and-1 for negative training examples. A hyperplane can be described by the equation w^x + è = 0, where w is the "normal", i.e. a vector perpendicular to the plane, and b is a threshold that determines the actual location of the plane in space, v/x denotes the dot product of the vectors w and X. Given a particular vector x, we can determine what side of the plane it lies on by examining whether w^x + è is positive or negative. However, to ensure that the training examples do not lie too close to the plane, we must also insist that w^x + 6 has a large enough absolute value. We can describe this using the following conditions:
w'xj + b>\ and y, = -I => w^xj + b<-\,
or, more concisely: yiw'xj + è) > 1 for all training instances i. If all training examples satisfy these conditions, the space between the hyperplanes w'x + b = \ and w'x + è = -I is empty; to maximize the breadth of this margin space, we need to maximize the distance between these two planes, which equals 2/||w||. Maximizing the margin is thus equivalent to minimizing subject to the above conditions.
This optimization problem is usually also extended to allow some training instances to be misclassified (or at least lie within the margin, though perhaps on the correct side of the separating plane) if this leads to a wider margin on the other training instances (the soft margin formulation of SVM).
Solving the optimization problem gives us the values of w and b, and the resulting classifier simply works according to the formula prediction(x) = sgn[w'x + 6].
Using standard techniques fi'om optimization theory, this optimization problem can be transformed into a "dual" form. It turns out that the dual form, as well as the resulting classification rule, can be expressed so that the training vectors need never be accessed directly, as long as we are able to compute the dot product of any two vectors. In particular, the normal w can be written as w = Z, aiyiXj, where the a/ coefficients are obtained by solving the dual optimization problem. The classifier can then be described asprediction{x) = sgn[b + S, aiyixi'x].
Now suppose we used some mapping cp to map our original instances X; into some other (possibly higher-dimensional) vector space F. Let K(xi, xj) := (cp(x,), (p(Xy));.- be a function that, given two instances x, and Xj, computes the dot product {■,■)[.• (in the new space F) of their images (p(x;) and (p(xy) under the mapping cp. It follows from the above that we could train a hyperplane in F without ever working with the mapped vectors (p(x;) explicitly, as long as we are able to compute A'(x„ Xy) for any two vectors x, and Xj. The function K defined in this
way is known as a kernel. The importance of kernels arises from the fact that the mapping cp need not be linear, and for a nonlinear (p a hyperplane in F could correspond to some highly nonlinear separation surface in the original space. In this way, kernels allow the SVM algorithm to induce nonlinear models while preserving the optimization framework essentially intact. The appeal of kernels stems from the fact that a wisely chosen function K can be simple to compute and yet correspond to a complex nonlinear mapping into some very high-dimensional space F.
A kernel corresponds to a dot product in some vector space and can therefore in some sense be seen as a sort of similarity measure: the dot product of two vectors (if their length is fixed) is greatest when they point in the same direction, and then decreases as the angle between them increases, eventually becoming 0 (for orthogonal vectors) and even negative, reaching the minimum if the two vectors point in exactly the opposite direction.
However, the converse is not true: that is, not every similarity measure corresponds to a scalar product in some vector space. If we used a non-kernel similarity measure as if it were an actual kernel, we would no longer have the mathematical guarantees that the SVM training algorithm would converge, and even if it converged there would be no theoretical grounds to expect the resulting classifier to have good performance.
3 Generalized kernels
Generalized SVMs have been proposed by Mangasarian [9] to allow an arbitrary similarity function to be used in a way analogous to a kernel. In the previous section we have seen that SVM can learn nonlinear models of the form
prediction{x) = sgn[b + Z, a^y, K{Xi, x)]
where A^(x„ x) = ((p(x,), (p(x))/.' for some mapping (p to some space F and some dot product(-,-)f in F.
Now if some arbitrary function K were used instead of a proper kernel function, again giving us a classifier of the form sgn[b + Y.iaiyiK{x,,x)\, this might still be a perfectly reasonable and useful classifier, but it wouldn't necessarily correspond to some hyperplane in some vector space F to which the instances X; and x might have been mapped. Thus we couldn't obtain the a, values using the criterion of maximizing the margin, because there wouldn't even be a hyperplane whose margin to maximize. Instead, [9] proposes to minimize the value aJHa (subject to the same constraints as before, i.e. that our training instances should lie on the correct side of the separation surface) for some positive definite matrix H. (This problem has a very similar structure to the dual form of the original SVM optimization problem, and is in fact equivalent to it if K really corresponds to a dot product and a suitable matrix H is chosen.)
In the simplest case of the generalized SVM, we would take H= I (the identity matrix) and thus minimize E, a^. This can be interpreted intuitively as looking for a separation surface that can be expressed in the simplest
possible way, possibly with many a, equal to 0 (i.e. without really using the training example Xi in the description of the separating surface).
It can be shown that the formulation for // = / is equivalent to mapping each instance jc into the vector {K(x, Xi), ..., K{x, x„)) of its "similarities" (as measured by K) to all the training instances Xi,... ,x„, and then using an ordinary linear support vector machine over this new representation. For the problem of image categorization, this amounts to the intuitively appealing suggestion that two images should be treated as similar if they exhibit a similar pattern of similarities to known training images.
4	Region clustering
In this section we consider another approach to using segmentation-based representations for image categorization. Each image has its own set of regions and regions belonging to different sets are in a sense quite independent of each other. This leads to the need for special similarity measures that compare two images by considering all pairs of regions and aggregating the similarities of regions into a measure of similarity between the images.
As an alternative, we propose to bring the region-based representations of images to a "common denominator" by clustering the descriptions of all the regions of all the training images. The hope here is that each cluster would correspond to a group of similar regions from several images, while regions from separate clusters would be quite different in appearance. Thus, when comparing two images, if a region of one image belongs to a different cluster than some region of the other image, there would be no need to compare these two regions in any particular way, because knowing that they belong to different clusters already indicates that they are different in appearance and cannot really contribute towards the similarity of the two images under consideration.
Therefore, an image would then be described by recording, for each cluster of regions, what proportion of the area of this image is covered by regions of this cluster. If there are d region clusters, each image would now be represented by a J-dimensional real vector (with possibly many zero-value components, as there would probably be much more clusters than an average image has regions). With all images represented in this same d-dimensional space, we can then use the ordinary linear support vector machine to train classifiers.
5	Experimental evaluation
To compare the approaches described in the previous sections, we conducted experiments on the mise database, which is publicly available (http://www-db.stanford.edu/IMAGE/) and has already been used in image retrieval literature [13, 10], as well as in our earlier work on image categorization [1]. This database contains approximately 10000 small photographic
images (of sizes around 128 by 96 pixels). It is thematically very diverse.
We selected 1172 images from the database and manually assigned each of them to one of 14 categories (butterflies, US flag, sunsets, autumn, flowers, planets, satellite images of Earth, cars, mountains, clouds, sea, surfboards, sailboats, prairie animals). The intention of this selection was to have categories of varying size and difficulty. The smallest category (flags) contains 32 images, and the largest (sunsets) contains 224 images. Some of the categories, such as sunsets or flowers, have characteristic and easily recognizable color distributions, while some categories are quite similar in this respect and would therefore be more difficult to distinguish (e.g. sea and clouds, both of which have a lot of blue and white pixels).
To train the SVM classifiers, we used the LibSvm [2] program, which has the advantage of natively supporting multiclass problems. It uses the all-pairs approach to convert a multiclass problem to several two-class problems: for each pair of classes, a classifier is trained to distinguish members of one class from members of the other class. To classify a new example, it is shown to all the classifiers, each of which then votes for either one or the other of the two classes which it has been trained to separate. The class with the greatest number of votes is then adopted as the final prediction.
We compared the following approaches to image categorization:
1.	Images are represented in the HSV (hue, saturation, value) color space, which is quantized into 256 colors (the H axis is split into 16 equal ranges and the S and V axes into 4 equal ranges). Each image is then described by an autocorrelogram in the resulting quantized color space. The autocorrelograms are 1024-dimensional vectors and are used as input for linear SVM.
2.	Images are segmented into regions using the segmentation algorithm from WALRUS [10]. The IRM similarity metric [8] is then used to construct a generalized kernel as described in Section 3 above. In other words, each image is represented by a vector of its IRM similarities to all the training images; these vectors are then used as input for linear SVM.
3.	Images are segmented as in the previous paragraph. Each region is described by a short (12-dimensional) vector, which is a by-product of the segmentation algorithm. The vectors resulting from all the regions of all the training images are then clustered (here we use the same algorithm, BIRCH [14], that is also used by WALRUS during segmentation). An image is then described by a sparse vector specifying what proportion of the area of the image is covered by regions from each region cluster. Depending on the parameters of the segmentation, the average number of regions per image might vary from less than ten to more than a hundred; then, depending on the parameters of the clustering, the number of region clusters (and hence the dimensionality of the space in which our images are now represented) is usually on the order of a few hundred.
Once images are represented in this way, linear SVM can be used to train classifiers for them.
For the sake of comparison, we also report the performance of the nearest neighbor method with the IRM similarity metric (that is, each image is predicted to belong to the same class as the most similar training image). All performance values reported here are averages (and standard errors) based on tenfold stratified cross-validation.
Method	Classifìcation accuracy
Autocorrelograms Generalized kernels Region clustering	80.2 % ± 1.3 % 79.0 %± 1.3 % 70.0 %± 1.6%
Nearest neighbors + IRM	69.1% ±1.3%
As expected, the nearest-neighbor method is in general less successful than the approaches based on SVM. However, it turns out that the two segmentation-based approaches do not outperform the representation based on autocorrelograms. The performance of the generalized kernel method is not significantly different (using a paired t-test) from that of autocorrelograms, and the generalized kernel method has the additional disadvantage of much greater computational cost.
In addition, the performance of the region clustering approach is remarkably poor. A closer examination suggests that the partitioning of regions into region clusters is problematic and unstable. For example, if the centroid of each cluster is recorded and then all regions are distributed to the cluster with the nearest centroid, most of the regions will tend to move to a different cluster than they were originally attached to. This means that two otherwise similar regions might fall into different clusters by pure chance, and the similarity between their images would thus go unnoticed. The authors of the BIRCH clustering algorithm were aware of the possibility of such problems, and proposed several redistribution passes where the regions are redistributed to the nearest centroids, but in our experiments this did not lead to a really stable partition even after five or ten such passes.
An alternative way of making use of the region clustering approach might be to include the test images in the region clustering phase. This really amounts to a form of transduction, i.e. using test data as if it was simply additional unlabeled training data. It ensures that both the training images and the test images are really being represented in a space that treats both groups of images equally. In this setting, the performance of the region clustering increases considerably, and it achieves an accuracy of 86.4 % ± I.O %. However, for the comparison with other methods to be fair, transduction should also be included in the SVM learning process. Since LibSvm does not support tranduction, we used the SvmLight program [6] for these experiments; it implements Joachims' transductive SVM algorithm [7]. With transductive SVM, region clustering achieves an average accuracy of 91.9 % ± 1.0 %, while autocorrelograms achieve an accuracy of 90.7 % ± 1.1 %.
Although this difference is not really significant from a practical point of view (a t-test shows that it is statistically significant at a confidence level of 0.945, slightly below the usual 0.95), it suggests that the region clustering approach does have at least some potential to be useful.
Finally, we also considered combining several representations. An analysis of classification errors shows that classifiers based on different representations often make mistakes on different test images; that is, a it often happens that a test image is classified correctly by one classifier but incorrectly by another. For example, consider the classifiers based on autocorrelograms and on generalized kernels (with the IRM measure). Of the 1172 images, 828 are classified correctly by both; 120 only by the former; 100 only by the latter; and 128 are mis-classified by both. (To obtain these numbers, each image was classified by a model obtained from that 90% of the dataset which does not contain the image under consideration.)
Thus it seems that some advantage could be gained by combining the features of both of these representations. Many approaches exist for combining several classifiers, but with SVM, this can be done in a particularly simple way. If we have two representations, ([>1:	and (1)2,: X-^Fi, combining their features (or
attributes) would be equivalent to a new representation (]): X-^F^xFi defined by the formula (t)(jc) = ((j)i(x), (t)2Cv)). Now if the kernels A^i(x„ xj) and Kiixi, xj) correspond to some dot product on Fi and F2, respectively, the ftinction K{Xi, Xj) := Ki{xi, Xj) + Kjixi, xj) is a dot product on F{xF2. Thus we can obtain the equivalent of a combined representation simply by computing the sum of two kernels.
In our experiments, the combination of the autocorrelogram representation and the generalized kernel using the IRM similarity measure achieved a categorization accuracy of 83.7% ± 1.4 %. A t-test shows that this performance level is significantly better than that of either of these two representations individually.
6 Conclusions and future work
Our experiments show that it is difficult to use segmentation-based image representation methods in image categorization. Relatively complex ways of using information obtained from segmentation, such as the generalized kernel approach and (to a lesser extent) the region clustering approach, have been found able to compete with a simpler and more straightforward approach such as autocorrelograms but not to significantly outperform it. In the presence of unlabeled test images, the region clustering approach performs really well (relative to other representations) if a transductive SVM learner is not available. We have shown that it is possible to use segmentation-based representation in combination with another representation to achieve a small but significant increase of categorization accuracy.
We nonetheless believe that there must be ways of using segmentation more profitably for image catego-
rization, just as it is used in image retrieval, and that this is still an interesting topic for future work. In particular, it would be interesting to further explore the influence of the clustering algorithm used in the region clustering approach, and to look for more stable clustering algorithms that would allow the region clustering approach to perform better in the inductive in additional to the trans-ductive setting.
In addition, as segmentation is a relatively complex task, and segmentation algorithms usually depend on several parameters, it would be interesting to explore the influence of these various parameters on the segmentation (and consequently on image categorization) in a more systematic way.
The region clustering approach could also be augmented by taking the similarity between different clusters into account. Currently, regions that belong to different clusters contribute to different components of the sparse vectors that describe our images, and therefore whatever similarity might exist between two regions from different clusters cannot contribute anything towards our algorithm's perceived similarity between their two images. Acknowledging that regions can be at least somewhat similar even if they belong to different clusters might lead to an improved representation, but would (if taken to the extreme case) again require us to do the equivalent of comparing every region of one image with every region of the other image, which is what the region clustering approach was designed to avoid in the first place. Perhaps one could determine (from the region clustering process itself), for each region cluster, just a few most similar clusters and then compare pairs of regions from the closely similar clusters but ignore pairs of regions from entirely unrelated clusters.
Region clustering could also be integrated with segmentation. Currently, segmentation is being performed separately on each image, by clustering the descriptions of its 4x4 pixel windows; then, the region descriptions of all the images in the training set are clustered to form region clusters. These two steps could be merged by considering the descriptions of all windows from all the images as a single large set and performing clustering on this. Each image would then be represented by a vector of values showing what proportion of the image is covered by windows belonging to a particular cluster.
Combination of several kernels could also be pursued further, particularly in the direction of combining more than two classifiers and using weighted sums of kernels.
Additionally, the methods considered here should be tested on other datasets, as (given that widely different methods achieve highly similar categorization accuracy values on the present dataset) it is perhaps simply unrealistic to expect better performance on the current dataset, as the categories have an essentially "semantic" motivation that the current image representation methods simply cannot capture.
References
[1]	J. Brank: Machine learning on images (in Slovenian). Proc. IS 2001, Ljubljana, 2001, pp. 152-55.
[2]	C.-C. Chang, C.-J. Lin: LibSVM: a library for support vector machines (version 2.3). Dept. of Comp. Sci. and Inf. Eng., Nat'l. Taiwan University, April 2001.
[3]	C. Cortes, V. Vapnik: Support-vector networks. Machine Learning, 20(3):273-297, September 1995.
[4]	J. Huang, S. R. Kumar, M. Mitra: Combining supervised learning with color correlograms for content-based image retrieval. Proc. 5th ACM Multimedia Conf., Seattle, USA, 1997, pp. 325-334.
[5]	C.-W. Hsu, C.-J. Lin: A comparison of methods for multi-class support vector machines. Dept. of Comp. Sci. and Inf. Eng., Nat'l Taiwan University, April 2001.
[6]	T. Joachims: Making large-scale SVM learning practical In: B. Schölkopf et al. (eds.), Advances in Kernel Methods. MIT Press, 1999, pp. 169-184.
[7]	T. Joachims: Transductive inference for text classification using support vector machines. Proc. 16th ICML, Bled, Slovenia, 1999, pp. 200-209.
[8]	J. Li, J. Z. Wang, G. Wiederhold: IRM: Integrated region matching for image retrieval. Proc. 8th ACM Multimedia Conf., Los Angeles, 2000, pp. 147-156.
[9]	O. L. Mangasarian: Generalized support vector machines. In: A. J. Smola et al. (eds.). Advances in Large Margin Classifiers, MIT Press, 2000, pp. 135146.
[10]	A. Natsev, R. Rastogi, K. Shim: WALRUS: a similarity retrieval algorithm for large databases. Proc. ACM SIGMOD, 1999, pp. 395-406,
[11]	M. J. Swain, D. H. Ballard: Color indexing, int. Journal of Computer Vision, 7(1): 11-32, Nov. 1991
[12]	J. Z. Wang, J. Li, G. Wiederhold: SlMPLlcity: Semantics-sensitive integrated matching for picture libraries. Advances in Visual Inf Systems, 4th Int. Conf., 2000, pp. 360-37L
[13]	J. Z. Wang, G. Wiederhold, O. Firschein, S. X. Wei: Content-based image indexing using Daubechies' wavelets. Int. Journal of Dig. Lib., l(4):311-328, December 1997.
[14]	T. Zhang, R. Ramakrishnan, M. Livny: BIRCH: An efficient data clustering method for very large databases. Proc. ACM SIGMOD, 1996. pp. 103114.
Recognition of Image Authenticity Using Significant DCT Coefficients Quantization
Chin-Chen Chang*, Jun-Chou Chuang* and Tung-Shou Chen** * Department of Computer Science and Information Engineering National Chung Cheng University Chiayi, Taiwan 62107, R.O.C. Phone:886-5-2720411 ext. 33100 FAX: 886-5-2720859 E-mail: {ccc, lzchung}@cs.ccu.edu.tw
** Department of Information Management
National Taichung Institute of Technology Taichung, Taiwan 404, R.O.C. Phone: 886-4-22211181 ext. 2213 FAX: 886-4-22233545 E-mail: tschen@ntit.edu.wt
Keywords: JPEG, DCT, image authentication, signature, watermarking
Received: May 11, 2002
Traditional image authentication methods cannot preserve the authenticity after the processing of the JPEG lossy compression. This is because the JPEG lossy compression destroys the secret embedded in the image. However, the JPEG lossy compression has been so widely used that it simply exists everywhere. Thus, this kind of modification should be taken into consideration. To improve traditional image authentication methods, we shall propose a new method that can not only prevent images from being tampered with but also allow reasonable JPEG lossy compression. Our method works by extracting some significant discrete cosine transform (DCT) coefficients and setting a compression tolerant range. The extracted DCT coefficients will be able to survive when the image is not further modified or lossily compressed. The experimental results show that our method can tolerate JPEG lossy compression while keeping the image from illegal modifications.
1 Introduction
Digital images have been widely used in the computer	original image. The extracted VQ indices are called
world. However, without due protection, they can be	features. These features are embedded into the DCT
easily modified with image-processing tools. As a result,	coefficients located in the middle-frequency part of each
image authentication has become an important research	DCT block of the original image. The embedded
issue. If someone has maliciously manipulated an image,	features can be used for image tamper detection and
the image authentication system has to have the ability to	recovery. Unfortunately, this method cannot tolerate
point out the exact places that have been modified.	JPEG lossy compression (Pennebaker & Mitchell 1993).
The image after JPEG lossy compression may still be
Image authentication methods can be classified into two	considered acceptable, categories. They are digital-signature-based methods
(Bhattacharjee & Kutter 1998, Friedman 1993, Lu &	Hence, we shall propose a new method here in this paper
Liao 2000) and watermarking-based methods (Kundur &	that can prevent images from being tampered with even
Hatzinakos 1999, Schneider & Chang 1996, Lin et al.	when they have gone through JPEG lossy compression.
2000, Hung et al. 2001). In a digital-signature-based	Our method takes advantage of two distinct properties of
method, the original image is hashed and then encrypted	the image and extracts some features from these two
via the public key encryption (Rivest et al. 1978). The	properties for further verification, encryption result is called the "signature" of the image.
On the other hand, in a watermarking-based method.	The first ^property is that the DCT coefficients located in
watermarks are first embedded into an image and later	the uppet left positions contain most of the information
extracted from it to verily the authenticity. An image is	in an image block. To make use of the property, we
said to have gone through malicious manipulations if the	extract some features irom those significant DCT
retrieved watermark^.^/'^^e not identical to the	coefficients and modify them within a pre-defined
corresponding original watermarks.	compression-tolerant range. The extracted features can
tolerate JPEG lossy compression if the significant DCT
In 2001, Hung et al. proposed an image authentication	coefficients do not go beyond the compression-tolerant
method based on the DCT coefficients. Their method	range. Conversely, if the image has been maliciously
performs the vector quantization (VQ) technique on the	manipulated, the modified DCT coefficients will surely
360 Informatica 26 (2002) 359-366
C.-C. Chang et al.
exceed the compression-tolerant range, and that is how we still detect illegal modifications.
The second property of the JPEG lossy compression is that, after the encoding and decoding processes, the high frequency part of each block will be lost. This is because most of the DCT coefficients located in the lower right positions will become zero after JPEG lossy compression. Thus the 64 pixel values of each smooth image block will move toward their mean value. Based on this property, the proposed method calculates the maximum difference between the mean value and each pixel value for each block. The maximum difference should be decreased after JPEG lossy compression. In other words, if the maximum difference gets increased, then we know that the image is tampered with.
This paper is organized as follows. In Section 2, we shall review DCT property and the image authentication method by Hung et al. After that, our proposed image authentication method will be described in Section 3. In Section 4, some experiments and security analyses will be discussed. Finally, the conclusions will be presented in Section 5.
2 Related Work
2.1 Discrete Cosine Transformation
DCT is an image transformation method that can transform each pixel in the spatial domain into the frequency domain. It is very popular in applications such as image compression, image processing, watermarking, etc. The two-dimension transformation of DCT and inverse DCT (IDCT) are defined as follows respectively.
{2y+\)j7t^ 7N	7N
x=0 y^

1=0 j=0
Here c{i), c(/)=i/V^ for i, j=0, otherwise c(i), c(j)=l. Besides, ßx, y) is the pixel value in the spatial domain and F{i, j) is the DCT coefficient in the frequency domain. Generally speaking, 2-D DCT is often used to process blocks of 8*8 pixels each. Therefore, the parameter N is set to be 4.
An important property of DCT is that after DCT transforming, for an image block, the DCT coefficients located in the upper left positions contain most of the energy. That is to say, even if we only use those DCT coefficients to reconstruct a block by IDCT transformation, the image features of the reconstructed block will still be the same as the original block. That is why we can extract some significant DCT coefficients located in the upper left positions as our significant features.
2.2 The Hung et al.'s Image Authentication 2.2.1 DCT-Based Embedding Procedure
Hung et al.' s image authentication can be divided into two stages: feature extraction and feature embedding. In the feature extraction stage, they employ the VQ compression technique to process the original image. Note that the encoding block of VQ is 4x4 pixels. Therefore, they can obtain VQ indices as features, or called watermarks. Two 16-dimensional codebooks and Qr are used to encode the original image O respectively. The encoding results are Wj and Wf. Wj is the detection feature and Wr is the recovery feature. These two distinct features are used for image tampering detection and recovery.
In feature embedding, Wd and Wr are embedded into the DCT coefficients located in the middle-frequency part of each block of the original image. Here the DCT block is 8x8 pixels. The detection features of an image block are embedded into its block itself, but the recovery features are not because the recovery features will be destroyed if its block gets tampered with. To solve this problem, they use pseudorandom permutation operation to embed the recovery features of one block in another block.
Consider an image block ß,. Assume the embedded features are Wf={wu wj,..., Wj) and the middle-frequency coefficients are Mrinu, mj,..., m^. Here s is the total bit length of the features wf and . They use the hiding
fiinction and the pseudorandom number with a seed Sk to embed the features into the DCT coefficients located in the middle frequency part. The hiding function is defined as follows.
(1) (2)
H(mj,H'j) =
L4«J
m, + 2a
L4«J
iX4a + 2a if bi = 1 x4a if «0 = 0
(3)
Here a > 1 is the adjusting magnitude. A large a value will make an image become more distorted. On the other hand, a small a value cannot endure an error. After the embedding procedure is finished, IDCT is performed upon the above DCT block to obtain the embedded image. The embedded image can be published when the verification information is embedded into the original image.
2.2.2 Tamper Detection and Recovery Procedure
Given a test image O'. They use the embedded detection features to decide if the test image O' has been tampered with or not. If the answer is yes then they use the recovery procedure to recover the modified places. First, they use DCT transformation on the test image O', and then they retrieve the detection features Wf and the recovery features Wf in each DCT block. They use the
extraction function and the pseudorandom number with a seed St to extract the features in each DCT block. The extraction function is defined as follows.
E{mj) =
0	if [(m/ + a)mod(4a)]<2cir
1	otherwise
(4)
Here rrij is the coefficients in the middle-frequency part. As for the recovery features, because the pseudorandom permutation operation was previously used to permute them, the inverse pseudorandom permutation should be performed here. They compute = (jr:), where p~'
is an inverse permutation function.
Next, they perform VQ on the test image O'. The 16-dimensional codebook Qj is used to encode the test image O'. Consider a DCT block Bi of the test image. An 8x8 DCT block Bi can be divided into four encoding VQ-blocks (b,j, bi2, bis, b^) with 4x4 pixels each. Let Dj denote {du, d^, dß, di4) and dij be the VQ index of the closest codeword of the subblock bij. A block Bi is said to have been tampered with if Z), w^' and
RMSE(£.)>/, where RMSE is the root mean square
error (RMSE) between two blocks, and t is a threshold .
Here wf is the detection feature of the block Bi, and ß'
is the reconstructed block generated following the side-match VQ method (Chang & chen 1993) as follows.
When the method detects that a DCT block Bi has been tampered with, then the recovery procedure is performed. Assume Bi={bii, ba, è«, has been illegally modified. They use the recovery features w'' and w',' by looking up the codebook Qf to recover subblocks bu and 0,^. As for subblocks and bß, they use the side-match VQ method to reconstruct them. Assume the four neighbors of subblock bij are bj, b,, bu, and bd. Then the side-match method can use these four neighbors to reconstruct subblock bij. When the side-match method is adopted, the bit length of the recovery features can be reduced. Let the reconstructed block of Bi be ß'. Finally, we can obtain a recovered image ö •
The main drawback of this method is that it cannot tolerate inevitable innocent modification such as JPEG lossy compression. Therefore, we intend to propose a new method to improve it as follows.
3.1 The Signing Procedure
Given a gray-level image O. First, we partition it into nonoverlapping blocks, and then we use DCT to transform each block into a DCT coefficient matrix C{i, j), where /,7 ^7. The DCT coefficients located in the upper left positions contain most of the information of the image block, even if the image has been compressed by lossy JPEG. This is the reason why we often call them the "significant DCT coefficients". We extract some features from the significant DCT coefficients of each block for further verification. If the block is modified, then the significant DCT coefficients will also be changed. The extracted features will not be identical with the original ones.
To withstand JPEG lossy compression, we set the compression-tolerant range for those significant DCT coefficients. The detailed procedure is described as follows. Consider an image block Bi with its DCT coefficient matrix C. The proposed method chooses ten significant DCT coefficinets C(0,0), C(0,]), C(],0), C(2,0), C(l,l), C(0,2), C(0,3), C(l,2), C(2,l), and C(3,0) in zig-zag scan order (like JPEG). We apply these DCT coefficinets to represent the features of the block. Besides, we use a scale function to quantize and adjust these DCT coefficients. The scale function is defined as below.
C\iJ) =
LC(/,;)/a>a+| ifC(/,7)>0 \C{i,j)l a\<a-^ otherwise
(5)
where 0 ^ ;', j < 7, and a is the position number. The parameter a is used to quantize the significant DCT coefficients, and the value (a / 2) is used to indicate the compression-tolerant range. A large a value will make the values of the DCT coefficients change drastically; as a result, the reconstructed image will become very distorted. On the other hand, if the value of a is too small, then the image can hardly tolerate JPEG lossy compression. We can obtain a new reconstructed DCT coefficient matrix c"{ij) by adopting the above scale fucntion. Finally, we use IDCT transformation to transform each adjusted DCT block into 8x8 pixels and thus obtain a signed image O'.
3 The Proposed Method
Our proposed method is based on two properties of the JPEG lossy compression. We shall define the compression-tolerant range for the significant DCT coefficients and calculate the maximum difference for each block. The compression-tolerant range and the maximum difference are employed respectively to withstand JPEG lossy compression and to prevent the image from being tampered with. The signing and verification procedures of our proposed method will be stated as follows.
We could record the ten adjusted DCT coefficients directly and use them as the features of the corresponding block. However, this would waste a lot of storage space. The proposed method conducts the following steps to reduce the storage of the ten coefficients. First, we create a one-dimension array A whose size is 4096. Each element of A is a bit value. The maximum values of the DCT coefficients in a DCT block will not exceed the range between +2048 and -2048 after the DCT transformation. The proposed method assigns the contents of A by means of a pseudo random number generator (PRNG) with a seed Notice that we can use different seeds in different images for security's sake.
We will use the contents of A to record the ten adjusted DCT coefFicients. If the value of an adjusted DCT coefficient is p, then the proposed method retreives the corresponding bit A[2048+p] to represent p. This bit is called the "feature bit." Hence, we collect the ten feature bits as a variable/for each block. Note that/is the final result of the proposed method according to the ten significant DCT coefficients.
How is the extraction of the features related to security? The features extracted by the above procedure do not contain any information about the DCT coefficients located in the lower right positions. What if someone maliciously manipulates the DCT coefficients located in the lower right positions, then? For example, assume that a DCT coefficient located in a lower right position is 0. If we modify this coefficient to be 1000, then the reconstructed block will be different from the original block. However, depending on features extraction only, we cannot solve this kind of problem. Because the above procedures only consider the significant DCT coefficients located in the upper left positions, we think it is necessary to add some processes to assist the features extraction to solve this kind of problem.
The lower right DCT coefficients of a block processed by the quantization table of JPEG will get close to zero or become zero. These DCT coefficients indicate the high frequency part of the block. Their zero values represent that the pixel values of the corresponding block usually have lower variance. That is, the pixel values of the block will be close to their mean value in most cases. Let m denote the mean value of a block B. The proposed method employs m to subtract each pixel value in B and picks out the maximum absolute different value as the maximum difference T of B. According to the results of our experiments, after JPEG lossy compression, for the same block B, its maximum difference T' shall not be bigger than two and a half times the original difference T.
Finally, we disclose the signed image O'. The proposed method stores a quantization value a, the seed S^ of PRNG, the maximum difference T of each block, and the feature/of each block in the certification authority (CA). The maximum difference and the features of each block will be used to verity the signed image O'.
[Signing Procedure]
Input: A gray-level image O, a seed S^ of PRNG,
and a quantization value a. Output: The signed image O', the maximun difference T of each block, and the feature / of each block.
Step 1: Partition an image O into nonoverlapping blocks and use DCT to transform each block into DCT coefficient matrix C(i,J), where 0
Step 2: Use a scale function to quantize and adjust ten signification DCT coefficients located in the upper left positions.
Step 3: Create a one-dimension array A whose size is 4096. The content of A is assigned by PRNG with a seed Sk.
Step 4: If the adjusted DCT coefficient is p, then retrieve the corresponding bit A[2048+p]. Collect the ten featue bits and generiate the feature/for each block. Step 5: Use IDCT trasnformation to transform each adjusted DCT block into 8x8 pixels and then obtain a signed image O'. Step 6: Partition the signed image O' into nonoverlapping blocks and calculate the maximum difference T for each block. Step 7: Disclose the signed image O' and store the quantization value a, a seed St of PRNG, the features / of each block, and the maximum different T of each block in the certification authority (CA).
An example of the signing procedure is illustrated in Figure 1. Figure 1(a) shows a block of 8x8 pixels. We use DCT to transform this block into DCT coefficients and then list them in Figure 1(b). After that, we use the scale function to adjust ten of the significant DCT coefficients. The quantization value a in the scale function of this example is 8. The adjusted DCT coefficients are shown in Figure 1(c). The ten adjusted significant DCT coefficients are 892, -132, 84, -12, -164, 44, -4, 84, 148, and 132. Next, we create a one-dimension array A. The content of A, listed in Figure 1(d), is assigned by PRNG with the seed S^. The extracted ten feature bits are 1, 1, 1, 0, 1, 0, 0, 0, 0, and 0 since A[2048+892]=l, A[2048+(-132)]=l, ..., A[2048+( 132)]=0. After the above processes, the proposed method uses IDCT to transform the adjusted DCT coefficients block into 8x8 pixels. The signed block is shown in Figure 1(e). The mean value and maximun difference in this example are 111 and 98 respectively. Finally, we store the quantization value a (i.e., 8), the seed Sk of the PRNG, the maximum difference (i.e., 98), and the feature bits (1, 1, 0, 1, 0, 0, 0, 0, 0, 0) in CA.
3.2 The Verification Procedure
The basic idea of the verification procedure is that we will extract the maximum differences and features of the signed image and compare them with the conesponding maximum differences and features stored in CA. The image is tampered with if they are not identical.
At the beginning, we input a signed image O'. The proposed method requests the quantization value a, the seed Sk of the PRNG, the feature / of each block, and the maximum difference T of each block from CA. Next, the signed image O' is partitioned into nonoverlapping blocks where the block size is 8x8 pixels. Consider a block ß,. We extract the maximum difference T of 5,. The maximum difference calculation method here is identical with that in the signing procedure. The block will be proven to have been tampered with if the
maximum difference T is larger than 2.5xT. Here T is the maximum differnece of B stored in CA. Suppose T < 2.5xT. We cannot say for sure that this block has not been tampered with for now. The proposed method has to do the following checking. The proposed method uses DCT to transform Bi into DCT coefficients. The feature / in ß; will be extracted. The feature extraction procedure in the verification procedure is the same as that in the signing procedure. The block ß, can be proven to have not been tampered with if the extracted feature f of the signing image is identical with the corresponding feature/stored in CA. After the checking is all done, the blocks tampered with are marked "Yes" and the blocks not tampered with are marked "No".
[Verification Procedure]
Input: The signed image O'. The quantization value a, the seed St of the PRNG, the feature/of each block,and the maximum different T of each block from CA. Output: 'Yes' or 'No'. 'Yes' means the block has been tampered with. 'No' means the block has not been tampered with. Step 1: Partition an image O' into nonoverlapping blocks.
Step 2: Extract the maximum difference T for each block.
Step 3: Compare T with T in the same block. The block is proven to have been tampered with if T is larger than 2.5^T. If so, stop verifying the block and mark "Yes" for this block. Otherwise, we cannot yet say that this block has not been tampered with for now. The proposed method has to do the following checking.
Step 4: Extract the feature/ in the remaining blocks. Step 5: Compare/with/in the same block. If they are identical, the block is marked "No". Otherwise, the block is marked "Yes".
An example of the verification procedure is illustrated in Figure 2. Figure 2(a) shows a violence block of 8x8 pixels to be verified. The mean value and maximum difference in this example are 68 and 129, respectively. The maximum differences in Figures 1 and 2 are 98 and 129, respectively, and the maximum value 129 is bigger than 98. The distortion here is acceptable in terms of the proposed method. Thus we need to verify this block again in the following. The proposed method applies DCT on this block. The DCT coefficients of this block are listed in Figure 2(b). After that, we use the same scale function as was used in the previous example to adjust the ten significant DCT coefficients. The quantization value a is 8 in the scale function. The quantized results are 548, -44, 28, 12, -284, 44, -20, 356, 20, and 220. Next, we create a one-dimension array A by PRNG with the seed S^ and list it in Figure 1(d). The extracted ten feature bits are 1, 1, 1, 0, 1, 0, 0, 1, 0, and 1 since A[2048+548]=l, A[2048+(-44)]=l,..., and A[2048+220]=l. The proposed method picks up the
original feature from the previous example and compares it with the above results. Finally, we find that they are not identical since (1, 1, 0, 1, 0, 0, 0, 0, 0, 0)=^t(l, 1, 1, 0, 1, 0, 0, 1, 0, 1). This means this block has been tamperd with.
4 Discussions and Analyses
The number of the significant DCT coefficients in our experiment is set to be 10. We select ten significant DCT coefficients to represent the feature of a block because they are enough to keep most of the information of a block according to our experiments. If we use more feature bits, then the security of the proposed method can be improved significantly, but the image qualify of the signed image will be decreased at the same time. This is because most of the significant DCT coefficients have been modified and adjusted by the scale function. In addition, we need to store more feature bits of each block in CA. On the other hand, if we use less significant DCT coefficients in the proposed method, then the image qualify of the signed image will be improved. However, the coefficients will be not enough to represent the feature of a block and thus will increase the probabilify of undetected tampering.
The scale function uses quantization value a to quantize and adjust the significant DCT coefficients. If we increase the value of a, the extracted features will be able to survive under high JPEG lossy compression. That is, the compression-tolerant range will be increased. However, the image qualify of the signed image will be decreased at the same time. This is because the increase of quantization value a will enlarge the variety of the DCT coefficients. On the other hand, if we decrease quantization value a, then the image qualify of the signed image will be increased, but the compression-tolerant range will be decreased.
The feature bits and the maximum differences in this paper are used to verify the signed image. The number of the feature bits is the same as that of the significant DCT coefficients. If we extract more feature bits, then the security of the image can be improved, but we need to store more features. Besides, the selection of the maximum difference of each block is also very important. A large difference will increase the pixel's compression-tolerant range, but a small difference can decrease the probability of undetected tampering.
Consider the following cases of malicious manipulations. First, if someone wants to use image-processing tools to slightly modify the image directly, we can consider this kind of modification acceptable like reasonable JPEG lossy compression. Otherwise, we can use the features to point out the modified areas. Second, if someone tries to remove the features, it will be very difficult. The security of our method is based on PRNG and the features stored in certificate authority (CA) for further verification. Without the seed Sk of PRNG, nobody can get the features. Third, if someone wants to maliciously
manipulate the non-significant DCT coefficients like setting a large number on those non-significant DCT coefficients, we can use the maximum difference to point this modification out.
Our experiments were executed on IBM personal computer with a Pentium 133 CPU. The image-processing tool employed in the experiments was Photoshop 5.0. Two test images 'Lena' and 'F16' were used. Their original images are shown in Figures 3 and 4. Their image size is 256x256 pixels each. Table 1 lists the PSNR values versus a under some quantization values. Table 2 shows the error block numbers of two signed images under some JPEG compression quality. In Photoshop, the parameter q represents compersssion quality, and its value is between zero to ten. A large q value means better image quality but also means low compression rate, and vice versa.
The peak signal to noise rate (PSNR) is used to evalute the image quality, and it is defined as follows:
PSNR=10xlog,o:£^ dB.	(6)
MSE
The mean square error {MSE) for an N>^N gray-level image is
j n n
MSE={- fYj E	(7)
^ i=i j=i
Here x,y denotes an original pixel value, and X y denotes the corresponding decoded pixel value. Besides, we define the compression rate (CR). The compression rate is defined as follows:
CR=^.	(8)
X
Here X means the original image size while X means the compressed image size.
Two extra experiments were conducted to illustrate the performance of our method except the above experiments. The quantization vlaue a was set to be 8 in these two extra experiments. In the first experiemt, we input the original image 'Lena' shown in Figure 3 and output the signed gray-level image 'Lena' in Figure 5. After that, we used JPEG lossy compression to compress the signed 'Lena', where the compression rate was 3 (CR=3), and, moreover, we modified the compressed 'Lena' in the eyes. The decompressed 'Lena' after compression and tampering is shown in Figure 6. The result of the verification procedure is listed in Figure 7. According to our experiment, the decompression result is "acceptable"; and the eyes tampered with can also be pointed out by our method. Such phenomenon also exists in 'Fl6'. The experimental results are shown in Figures 8-10.
5 Conclusions
In this paper, we have proposed two important properities of image authentication. Taking advantage of those two properities, our method can both prevent images from being tampered with and allow acceptable
JPEG lossy compression. First, we set the compression-tolerant range for significant DCT coefficients. The block can withstand JPEG lossy compression if the significant DCT coefficients after JPEG lossy compression do not fall off the compression tolerantrange. Second, we set the maximum difference for each block to detect malicious manipulations done to the nonsignificant DCT coefficients. According to our experimental results, our method can indeed withstand JPEG lossy compression while keeping the image from being tampered with.
6 References
[1]	Bhattacharjee S. & Kutter M. (1998) Compression Tolerant Image Authentication. IEEE International Conference on Image Processing, 1, p. 435-439.
[2]	Friedman, G. L. (1993) The Trustworthy Digital Camera: Restoring Credibility to the Photographic Image. IEEE Transactions on Consumer Electronics, 39, 4, p. 905-910.
[3]	Lu C. S. & Liao H. Y. (2000) Structural Digital Signature for Image Authentication: An Incidental Distortion Resistant Scheme. Proceedings of the Workshops on ACM multimedia, p. 115 - 118.
[4]	Kundur D. & Hatzinakos D. (1999) Digital Watermarking for Telltale Tamper Proofing and Authentication. Proceedings of the IEEE, 87, 7, p. 1167-1180.
[5]	Schneider M. & Chang S. F. (1996) A Robust Image Content Based Digital Signature for Image Authentication. International Conference on Image Processing, 3, p. 227-230.
[6]	Lin E. T., Podilchuk C. 1. & Delp, E. J. (2000) Detection of Image Alternations Using SemiFragile Watermarks, Security and Watermarking of Multimedia Contents, 3971, p. 152-163.
[7]	Hung K. L., Chang C. C. & Chen T. S. (2001) Secure Discrete Cosine Transform Based Technique for Recoverable Tamper Proofing. Optical Engineering, 40, 9, p. 1950-1958.
[8]	Rivest R., Shamir A. & Adleman L. (1978) A Method for Obtaining Digital Signatures and Public-key Cryptosystems. Communications of the ACM,1\,1, p. 120-126.
[9]	Pennebaker W. B. & Mitchell J. L. (1993) JPEG: Still Image Data Compression Standard, New York, Van Nostrand Reinhold, 1993.
[10]	Chang R. F. & Chen W. T. (1993) Side-match vector quantization for reconstruction of lost block. Journal of Visual Communications and Image Representation, 4, 2, P. 171-177.
Table 1: The image quality (PSNR) of the signed images under different quantization vlaues a=4, 6, 8, 10, and 12
	a=4	a=6	a=8	a=10	a=12
F-16 Lena	50.04dB 52.07dB	49.77dB 49.87dB	47.72dB 47.97dB	45.93dB 46.16dB	44.36dB 44.62dB
Table 2: Numbers of Error blocks of the signed images pointed out after different JPEG compression qualities q= 8, 5, and 1
Signed images			a=4	a=6	a=8 a	= 10	a=12
F-16 (^=8)			0	0	0	0	0
F-16(o=5)			942	586	230	70	0
¥-16 (cr	1)		1024	981	853	820	640
Lena (9=8)			0	0	0	0	0
Lena (q=5)			960	597	225	74	0
Lena (0=1)			1024	1018	921	873	650
	Ì57	121	130	136 122	128 147	130 "	
	85	79	69	69 54	55 57	54	
	58	45	46	46 41	67 105	67	
	52	72	61	61 56	92 197	177	
	54	96	98	98 68	130 210	208	
	98	92	116	116 136	186 198	188	
	142	128	115	115 127	136 134	154	
	156	151	144	144 139	141 144	162	
(a) A block of 8x8 pixels
895	-132	85	-12	-27	38	-14	-1
-167	41	-4	4	-2	24	-11	5
83	148	-46	31	45	-19	24	0
134	-41	-9	2	-2	2	15	-3
100	-40	32	-35	-7	12	-18	-1
29	-10	14	16	8	4	-4	11
40	-24	-18	21	-6	14	9	6
-14	15	2	-6	2	3	0	-6
(b) The DCT coefficients
892	-132	84	zP^	-27	38	-14	-1'
-164	44		4	-2	24	-11	5
84		-46	31	45	-19	24	0
13^		-9	2	-2	2	15	-3
<00	-40	32	-35	-7	12	-18	-1
29	-10	14	16	8	4	-4	11
40	-24	-18	21	-6	14	9	6
-14	15	2	-6	2	3	0	-6
(c) The adjusted DCT coefficients
Coeificient	0		1764		1884		1916		2004
PRNG	1		1		0		1		1
Coefficient	2028		2036		2044		2060		2068
PRNG	0		1		0		0		0
Coefficient	2076		2092		2096		2132		2140
PRNG	1		0		1		0		0
Coefficient	2180		2196		2212		2268		2404
PRNG	0		0		1		1		1
Coefficient	2596		2940		3000		3500		4096
PRNG	1		1		0		0		1
(d) The content of an array A
156	120	130	136	122	127	145	130
85	78	69	68	54	55	58	54
57	45	45	46	41	67	105	67
52	71	61	61	55	92	195	176
55	95	97	97	69	130	209	207
98	92	115	115	135	185	197	187
141	127	116	114	127	135	133	153
156	150	143	143	139	141	144	161
(e) The signed block Figure 1: An example of the signing procedure
157	121	130	136	122	128	147	130
0	0	0	0	0	0	0	0
0	0	0	0	0	0	0	0
0	0	0	0	0	0	0	0
54	96	98	98	68	130	210	208
98	92	116	116	136	186	198	188
142	128	115	115	127	136	134	154
156	151	144	144	139	141	144	162
(a) A violence block of 8x8 pixels
550	-42	26	10	0	8	7	0
-281	40	-16	-2	3	14	-6	2
359	20	18	2	11	6	0	0
216	-52	7	12	-7	17	5	2
8	46	-1	-5	8	6	-4	1
132	4	10	7	6	3	0	0
164	-41	0	2	-9	13	3	1
-66	41	1	-1	9	- 4	-3	0
(b) The DCT coefficients
548	-44	28		0	8	7	0
-284	44		- 2	3	14	-6	2
356		18	2	11	6	0	0
	-52	7	12	-7	17	5	2
	46	-1	-5	8	6	- 4	1
132	4	10	7	6	3	0	0
164	-41	0	2	-9	13	3	1
-66	41	1	-1	9	-4	-3	0
(c) The adjusted DCT coefficients Figure 2: An example of the verification procedure
Figure 3: The originai image 'Lena' IPHil^
Figure 4: The originai image 'F 16'
Figure 7: The verification result
Figure 8: The signed image 'F16' where the PSNR=47.72 dB
Figure 5: The signed image 'Lena' where the PSNR=47.97 dB
Figure 9: The 'Fl6' with airframe tampered with
Figure 6: The compressed 'Lena' with the eyes tampered with
Figure 10: The verification result
Protecting the Data State of Mobile Agents by Using Bitmaps and XOR Operators
Jesiis Arturo Pérez Diaz,	,
ITEMS, Campus Cuervanava
Av. Paseo de la Reforma 182-A, 62589, Temixco, Morelos, Mexico.
Jesus. arturo.perez@.itesm.mx
AND
Dario Alvarez Gutiérrez, University of Oviedo Calvo Sotelo s/n, 33007 Oviedo, Spain. darioa@.pinon.ccu.uniovi.es
Keywords: security, mobile agents, data state protection, encryption Received: June 20, 2001
Mobile agents have been considered a promising technology to develop e-commerce applications, however the security concerns about the technology have stopped their widespread use.
The identified security areas comprise protecting hosts against malicious agents, protecting the agent's transmission and protecting agents against malicious hosts. The first two security issues and the protection of the agent's code state can be solved by applying traditional security techniques. Even though there are some works that manage the privacy of execution, their implementation is almost unfeasible in terms of performance and complexity.
This paper describes a fast and easy to implement algorithm that a mobile agent can use to encrypt its data during its itinerary. The algorithm only makes use of a bitmap and XOR operations. The algorithm consist of applying XOR operations to the data to be ciphered and a random bitmap, while the map is repeatedly shifted to the right or to the left in order to compute a CRC field for validation against malicious tampering.
The method only uses basic bit operations so that its implementation is very easy to develop. Besides, since it does not use any computationally expensive cryptographic technique (i.e. digital signatures) it is very fast. In this way we manage to have a secure, simple, fast and feasible protection algorithm to protect data while mobile agents are roaming, where simplicity and performance are its better advantages.
•	Protection of the agent system against attacks from
1 Introduction	mobile agents
Mobile Agent Systems are expected to make e-commerce •	Protection of the agent against other agents.
transactions inside virtual supermarkets. In this	t. .	ìt • x-	^ • • u ^
,. ..	. .	., • Protection of mformation transmission between
agent servers against unauthorized third parties.
application area security is crucial since we can consider that any application will not be usefial without doing
secure transactions.	• Protection of the agent against malicious agent
Mobile agents consist of code state, data state, and	systems (malicious hosts), which includes protection
execution state. Mobile agent systems are platforms that	^^^^^ ^^
allow agents to migrate from one node (a mobile agent	Different security architectures for mobile agents [1] and
system) to another, keeping its three states. While agents	mobile agent systems [2] [3] have used standard
migrate there are several security aspects involved. We	cryptographic techniques like public key cryptography, or
can point out different mechanisms that must be	digital signatures to authenticate authorities and solve the
implemented by the mobile agent system to ensure the	problem of protecting the host against malicious agents,
security of the mobile agent applications. Mobile agent	Also, they have implemented secure channels for the
systems basically must provide:	transmission of the agents by using SSL or TLS.
Nevertheless, the protection of mobile agents from malicious hosts is only partially solved.
The code state of the agent can be signed since it will not be modifiable. In this way, we can protect the static part of the agent. However, to protect the data state (that changes dynamically) becomes a more difficult task to tackle.
There are some works in this area, described in the related work section. However, there has not been found any solution having a feasible implementation.
Consider that most of mobile agent e-commerce applications do not need to protect all the data state, but only some important values where agents filter information and compile their results. The algorithm presented in this paper protects all the data that the agent decides to encrypt by calling a cipher fiinction. When the agent returns, only the source server is able to decrypt the sensible information stored by the agent.
The algorithm describes an easy way to protect sensible data that must be gathered and are carried ,by mobile agents alongside their itinerary. Two principal advantages are highlighted: the algorithm is simple and feasible to implement, and computationally inexpensive.
2 Related Work
Wilhelm presented a technique for protecting the itinerary of the mobile agents by using hardware mechanisms [4], He considered that software algorithms were not enough to ensure complete security during the mobile agent's itinerary. Even though the technique managed to achieve the protection of the itinerary, its implementation in real applications becomes difficult, since special hardware is required.
One interesting approach to avoid the malicious host attacks was proposed by Fritz Hohl [5]. This approach, which is called Code Mess Up, consists of a combination of two mechanisms: the first one generates a new and far less understandable version of the agent's code. The second mechanism restricts the lifetime of the agent's code and data. In this way, when the code of the agent is messed up, the malicious server would take some more time in order to understand the code and then attack it, but since the agent's lifetime is restricted, the malicious server will not have enough time to attack the agent. In this way the agent remains untouched.
Another solution for this problem was proposed by Tomas Sander and Christian Tschudin [6] [7]. They presented techniques on how to achieve "non-interactive computing with encrypted programs" in certain cases and give a complete solution for this problem in important instances. They further show how an agent might securely perform a cryptographic primitive, digital signing, in an untrusted execution environment. Their results are based on the use of homomorphic encryption schemes and function composition techniques.
The last two solutions were designed to offer privacy on the agent's execution, but not to give privacy and integrity to the agent's data. Beside, both of them have two main problems: a quite difficult implementation and a considerable performance hit in case of implementation. Perhaps these disadvantages are the reason why these techniques have not been implemented by any mobile agent system, as far as we know, and the problem in current mobile agent systems is still unsolved.
Considering these related works, the goal of our research is to offer a simple and feasible to implement algorithm that can be used by mobile agents just for encrypting the data that they gather while they are roaming in untrusted execution environments, and without a perceivable performance hit.
3 Data Encryption Using Bitmaps and the XOR Operation
The design of this data protection technique takes into account the fact that, in most applications, it is not vital to protect the whole data state of the mobile agent but some variables holding sensible data gathered by the agent, which is the main goal of the agent's travel and need to be protected.
Typical examples of these applications are e-commerce applications in which an agent travels alongside an itinerary looking for prices or particular services. The vital information that must be protected is the price or service offered in each visited server.
This technique requires that the agent travels holding data generated by the source server that will be used by the agent to encrypt the sensible data gathered, using fast XOR operations.
3.1 Usefulness of the XOR Operator
The main encryption idea is to apply the XOR operation between data and a random number (expressed as a bitmap in a row of a matrix and known only by the source server) to encrypt information. Once the agent returns to the source server, the XOR operation is applied again to the encrypted data, using the same random number, and the information is restored.
The agent in the source server will generate two matrixes with a number of rows equal to the number of data items it expects to encrypt. Initially, both matrixes will be filled with the same random numbers (forming a random background bitmap). One of the matrixes will be stored in the source server and the agent will carry the other.
For example, let's assume a 10 (binary 1010) is generated as a random number and put it in a row of the matrix. This number goes with the agent and a copy is also stored in the source server. During the itinerary, the agent gets a 3 (binary 0011) that the agent wishes to encrypt. The XOR operation will be then applied between the random number and the datum to be protected (1010 ® 0011),
giving 1001 as a result. This datum is stored in the same row of the matrix, overwriting the initial random mask, in order to avoid the next server seeing the random number used to encrypt the datum. Also, the next server is not able to know the real datum, since it ignores the random number used to apply the XOR operation.
A given server will use the next free row available in the matrix to store new data, as the occupied rows contain data encrypted in previous servers. In this way, the current server will never be able to know the previously encrypted data since it does know neither the datum nor the random number.
The source server had stored in a duplicate matrix a copy of the random numbers, in order to retrieve when the agent returns, the data encrypted by the agent while roaming from server to server. Thus, to retrieve the datum, the encrypted number 1001 will get a XOR applied with the corresponding random number generated in the source server (1001 XOR 1010), giving 0011 as a result (3, the datum the agent had encrypted).
To assure that the information restored upon return to the source server has not been tampered with, and is the same information that the agent encrypted in each server, a CRC field is computed in order to perform an integrity test.
The complete encryption algorithm is described in the next section.
3.2 Detailed Description of the Encryption Algorithm
A matrix with several fields is defined (table 1), which is used to encrypt the agent's data and for validation of the data later. The matrix is initially filled with random numbers, creating a background bitmap used to encrypt data gathered by the agent alongside its itinerary (table 2). The matrix is duplicated. One copy travels with the agent and the other is kept in the source server. The source server's matrix is used to recover the data upon agent's return.
Every datum to be protected by the agent needs a row of the matrix, so the agent must know beforehand the approximate amount of data it is going to use.
The structure of the matrix is as follows:
Data Area!of the Mobile Age n(						
Host ID	Data to be protected				CW	CRC
128 bits	128 bits	128 bits		128 bits	128 bits	128 bits
fl	ß	ß		fn		
The first field is the identifier of the server, as we need to know, for each row, the place where data was encrypted. The second field represents the space needed to store the data to be gathered by the agent, in 128 bit blocks. The third field is the "codeword", which is a random number to be generated in the remote server. The codeword is used to rotate data before applying the encryption function. The last field is a CRC, which is computed applying a XOR operation using all the 128 bit blocks in the data area. This CRC is used upon agent return to verily that the data area has not been altered.
ij;. Data Area of tK'elMobile Agent H ' ' 1 •iimii.iip! " . — ■						
ID Host	Data to be protected				CW	CRC
128 bits	128 1 bits	128 bits	1 128 1 bits		128 bits	128 bits
0101	1101	0011	010	0101	0100	IIOI
1100	0001	1101	010	0001	0100	0100
1011	0011	0101	001	1101	1001	0100
Table 1. Fields composing the rows of the matrix
Table 2. Matrix filled with a random generated bitmap
When the agent departs from the source server, the agent carries the matrix filled with random numbers, creating a background bitmap that is used to hide information, as shown in the next table:
Alongside the itinerary, the following algorithm is applied for each datum to be encrypted:
1.	The remote server creates a record with the same fields than a row of the matrix that the agent has.
2.	The host ID, data to be encrypted in 128-bit blocks form, and a generated random codeword (CW) are put into the record.
3.	Each 128 bit blocks? is rotated to the left as many times as indicated by the 7 less-significant bits of the CW. That is fi fi« li, where li	07Fh.
4.	Before applying the third step on fi+1, the CW is rotated to the right as many times as indicated by the 7 most-significant bits of the CW. Thus, the number of times that each fi is rotated is not always the same. CW will then heCW^CW» mi where mi (CW « 7) & 07Fh. Once the CW is rotated step 3 is repeated. These tasks will continue until no more 128-bit blocks are left.
5.	The original CW is restored into the corresponding field of the register in order to retrieve the original information using the inverse algorithm in the source server.
6.	The CRC field is computed as follows. The initial value is filled with binary O's, and then it is XOR'ed in sequence (from left to right) with all the 128 bit blocks in the data area, giving the final CRC value.
7.	Lastly, the corresponding row in the matrix holding the original bitmap is XOR'ed with the generated register (data) to be protected, thus encrypting the data.
8.	The counter indicating the number of lines used in the matrix is incremented so that the next row available of the matrix can be used.
It is worth to note that, once the mobile agents arrive at a new server, the new server can not access the information stored in the previous server, as the background bitmap held in the previous row before the XOR operation was applied can not be guessed'. Only the source server, who has a copy of the original matrix, is able to apply the inverse algorithm to retrieve the encrypted data.
The CRC field is computed in order to detect any alteration made by a malicious server on the encrypted data. Without any validation action, a malicious server could modify just one bit on the encrypted data field and when the agent returns to its source server it would not be able to detect that alteration. So, the agent would recover a wrong modified value since the source server just applies and XOR operation with the stored copy of the matrix in order to get the encrypted value.
The CRC field does not prevent a malicious server from making any alteration, but it ensures that if an alteration were made it would be detected since the CRC field will be invalid.
The bit rotations made in step 3 may appear unnecessaiy. However, if the blocks are not rotated, a malicious server could alter the encrypted information in just one bit in a specific position, and the CRC may not change since each block is XORred with the next block. Then, upon return to the source server, this alteration would not be detected.
On the other hand, once the random rotations are applied, a maliciously-altered bit in one block would be detected, as this bit affects many positions in the inverse decryption algorithm (since the position of that bit will change after rotations are applied), rendering always an invalid CRC that will detect the alteration.
To retrieve the information encrypted by the agent alongside the itinerary, the source server just applies the XOR operation to each row of the matrix that was used by the agent, with the corresponding row of the copy of the original matrix holding the initial background random bitmap.
Then, using the random CW, inverse rotations are applied to retrieve the real data that was encrypted by the agent in a given intermediate server.
The main advantage of this technique, encrypting data using bitmaps and XOR operations is that is very easy to implement, compared with other methods, which use very complex mathematical algorithms [8]
Besides, it is computationally inexpensive, as only very fast bit operations are used, avoiding effectively the performance impact of other techniques such as digital signatures, keys, or any other means that hurt performance.
4 Feasibility of Implementation and Incorporation to Current Mobile Agent Systems
A great advantage of our protection scheme is the feasibility of implementation. Besides, it could be very easily incorporated to the current mobile agent systems' security mechanisms.
The majority of Java-based mobile agent systems define an abstract class called Agent. All the agents programmed by the user inherit from this class the required functionality, so that the agent can migrates from one host to another or can create more agents.
This abstract class usually follows a pattern like this:
public abstract class Agent implements java.io.Serializable{
public void run()
public final java.lang.Object clone()
public final void createAgent(.....)
public final void dispatch(java.net.destinationURL) public final void revert()
}
We just need to add an addsecure() method to the agent abstract class in order to allow the agents to securely store sensible information in the data structure(the matrix of bitmaps) that is carried with them, so we could define:
public final void addsecure(Object data)
The implementation of this method will encrypt the information using the algorithm described in the previous section and will store it in the next row available of the matrix that is carried with the agent.
The matrix can be easily defined in Java using a Java array (i.e. an instance of the class vector) that will hold the background bit map originated in the source server.
In this way, each time that an agent is created by a user (i.e. commerceAgent), it will inherit the addsecure() method allowing it store information in a secure way.
public class commerceAgent extends Agent {
' Only the next available rows, not used yet, can be seen. This can be used for an attack that is described later.
}
When the user creates an instance of commerceAgent, the instance will be able to protect the information it is gathering, just by invoking the addsecureQ method. For example:
commerceAgent findFlyAgent;
defines an agent of type commerceAgent. The run method of the findFlyAgent would contain the instructions to query the price of the fly it is looking for, once it gets the price it records for later analysis at the home server, so the last line of the run method would be:
findFlyAgent.addsecure(FlyPrice);
in order to protect the sensible datum it has gotten. The current server will execute the encrypting process and will store the FlyPrice safely in the matrix. The next server visited will not be able to find out what was the price in the previous one and it just will be able to encrypt the information that the agent gathers in that server.
5 Limitations of the Method
The algorithm allows to protect the information the agent decides during its itinerary, and to verity that it has not been altered when the agent returns. The algorithm does not prevent the possible alteration of data from malicious hosts, but detects any modification that has been made. In this way, if any alteration is detected (which means a CRC field is invalid) the agent will reject the information since it would be considered invalid. In this way our technique offers integrity.
The current server will never be able to access the previously encrypted data since it ignores the data and the random number used to apply the XOR operation. However, it can see and copy the still available rows with random numbers that will be used to encrypt the next data not only in the current server but in the next server as well.
The first, and most evident deriving of this, is that a visited server cannot retrieve the data that was encrypted before, but could easily make a copy of the rest of the background bitmap. This means that a server could potentially retrieve the data encrypted in the future by an agent, assuming that the agent visits again the same server. Thus, an agent should not visit the same server twice if it wants to be completely secured.
Another possible attack (although less probable) is that two cooperating malicious servers teamed to retrieve the information carried by the agent. The first server would send to the second one a copy of the unused part of the background bitmap already known by the first server (the available rows of the matrix). If the agent arrived later to the second malicious server, it would be able to retrieve the data encrypted since the agent left the first malicious server and then modify the values.
The last limitation is that there is a fixed maximum number of data that can be protected, which is given by the length of the matrix (the length must be set in advance). However, in practice, a reasonable length could be set, according with the expected task to be carried by the agent.
Finally, this technique does only protect the part of the data state of the agent that the agent wishes to encrypt.
The rest of the data, such as local variables, etc. are not protected.
6	Future Work
We will continue working on this technique in order to implement an improved algorithm that avoids the current limitations. Nevertheless we intent to use only the operations included in this paper (bit rotations and XOR operation), or equally fast or simple ones, so that we can keep its simplicity and fast speed which are the objectives and philosophy of this work.
7	Conclusions
One of the problems that a mobile agent system must solve is the protection of agents from malicious hosts, which includes the protection of the data state of an agent. This is very important in order agent technology be adopted in e-commerce applications, for example in applications where agents collect information (such as flight prices) for later analysis at the source server.
Protecting this data gathered by the agent (and not the whole data state, which is not vital) is the objective of the research described in this paper. For example, malicious servers should not be able to see or modify the information gathered in order to change previous low prices to make its price appear as the best.
Other techniques such as [5] and [6] try to solve the problem by privacy of execution applying very complex techniques, which are very difficult to implement, and, more importantly, are very expensive computationally, as key cryptography is used. This is a hurdle very difficult to overcome in practical systems.
We propose a new technique to protect the part of the data state of an agent (the data gathered the agent wishes to protect) that dose not suffer from these limitations, as it is fast and easy to implement.
A matrix is generated at the source server, and filled initially with random bit numbers. Each row is used to protect one datum, and is divided into 128 bit blocks. A copy of the matrix is stored at the source server, and the other copy travels with the agent.
To protect one data item, the agent uses a row, and applies the XOR operation to the data item with the random number held previously, encrypting it and overwriting the initial random number with the result. A CRC is computed using XOR operations also, in order to detect alterations when returning to the source server. Some bit alterations would not change the CRC, so the random bitmaps are bit-rotated n-times, as indicated by a random codeword that is also held in the matrix (and is also rotated).
The original server is the only one able to decrypt the information, since the inverse algorithm (basically undoing the rotations and applying the XOR operation with the original random bit map) requires the knowledge
of the original random bit map and codewords, which is only know (the bitmaps) by the original server.
The only limitation is that an agent should not visit the same server twice, or a server co-allied with a malicious server, as a copy of the matrix could be made and subsequent encrypted data items could be retrieved. A minor limitation is that the agent should estimate the maximum number of data items to protect, as the matrix must be generated beforehand.
The technique we have presented removes the complexity and computational limitations of other techniques, which hinder the acceptance of agent technology in real applications. Agent's data state protection is made feasible in practical applications, as no performance hit is introduced because no expensive key cryptography is used. Furthermore, the algorithm could be a lot of times faster than any other that uses traditional key cryptographic techniques since only bits operations are used.
This algorithm can be easily integrated in current mobile agent systems in order to create basic e-commerce applications that compile information securely.
Institute (ICSI) Technical Report, 97(049):1-14. November, 1997.
[8]	Joan Feigenbaum, Peter Lee. Trust Management and Proof carriying code in Secure Mobile-Code Applications. Accepted paper to the DARPA Workshop on Foundations for Secure Mobile Code Workshop, 26 - 28 March 1997.
[9]	Sobrado Igor. Evaluation of two security schemes form mobile agents. Proc. ACM SIGCOMM -LatinAmerica and Caribbean 4/01, San Jose, Costa Rica, April 3-5 2001.
References
[1]	Pérez Diaz Jesus Arturo, Alvarez Gutiérrez Dario. Sahara: a comprehensive security architecture for mobile agents systems. Simposio Espanol de Informàtica Distribuida. ISBN: 84-8158-163-1. Orense, Spain.
[2]	Pérez Diaz Jesus Arturo, Alvarez Gutiérrez Darlo, Cueva Lovelle Juan Manuel. An implementation of a secure Java2-based mobile agent system. The Fifth International Conference on The Practical Application of Intelligent Agents and Multi-Agent Technology. PAAM 2000. ISBN: 1 902426 07 X. Manchester, U.K.
[3]	Fraunhofer IGD. Project http://www.infonnatik.uni-stuttgart.de/ipvr/vs/proiekte/mole/mal/preview/SeMo A-CSecure-Mobile-Agentsl.9656.txt.html
[4]	Wilhelm Uwe G., Staamann Sebastian. Protecting the Itinerary of Mobile Agents. Anales del ECOOP Workshop on Distributed Object Security and 4"' Workshop on Mobile Object Systems. Julio 21-22 1998. Belgium.
[5]	Hohl Fritz. An approach to solve the problem of malicious hosts in mobile agent systems. Institute of parallel and distributed systems. University of Stuttgart, Germany. 1997.
[6]	Sander Thomas, Tshudin Christian. Protecting Mobile Agents against Malicious Host. Lecture Notes in Computer Science (LNCS), SpringerVerlag, New York, USA, 1419, June 1998.
[7]	Sander Thomas, Tshudin Christian. Towards mobile cryptography. International Computer Science
Evaluation of Technologies for Business Process Automation
Maja Pušnik, Matjaž B, Jurič and Ivan Rozman
University of Maribor, Faculty of Electrical Engineering, Computer and Information Science, Institute of Informatics, Smetanova 17, 2000 Maribor E-mail: maja.pusnik@uni-mb.si
Keywords: business process automation technologies, decision models, ebXML, XLANG, RosettaNet Received: June 2, 2002
The importance ofprocess automation for B2B (business to business) collaboration is rising. The efforts are directed towards automating business processes and forming a global electronic market. In this paper we present and evaluate the three most important technologies for business process automation: ebXML (Electronic Business XML- extensible Markup Language), RosettaNet and XLANG. They differ in terms of features, quality and serviceability. We analyze, compare and evaluate those technologies from the perspective of SME (small and medium enterprises). Based on the comparison we define a multi-criteria decision model with twenty parameters and the corresponding weights, we evaluate the alternatives and define a utility function, which helps us to select the most suitable technology. The contributions of this paper are the in-depth evaluation of technologies and the definition of a multi-criteria decision model.
1 Introduction
The well-known fact is that business must be altered to survive the upcoming changes and progress. To make the idea of a global marketplace and B2B work, proper technologies, which will assure safety and efficiency, must be created. They have to be appropriate for all kinds of enterprises, small and large, for those with great financial recourses and responsibilities and for those with limited budgets. Only with such universal technologies, a global market and complete serviceability will be realized.
In the paper we will review and compare the three most important technologies for business process automation: ebXML, XLANG and RosettaNet. We will define criteria for their evaluation and build a decision model with twenty criteria. We will evaluate the results and choose the most suitable technology from the perspective of a SME. All three technologies are based on XML and build on the functionality of web services, where they reuse existing web service technologies, such as SOAP (Simple Object Access Protocol), UDDI (Universal Description, Discovery and Integration) and WSDL (Web Service Definition Language). We will see that they differ in some features while in others they are complementary. Because they are based on open standards, they are reachable in aspects of price and complexity, not only to large enterprises, but also to small and medium enterprises.
The review of related research has shown that there are not many similar analyses. The comparison made in [6] only compares ebXML and RosettaNet in an informal
way and does not define a decision model. The author in [20] compares B2B standards, which include RosettaNet, ebXML, OAGIS (Open Applications Group Integration Specification) and Simple Web Services. The same author explains in a different article [21] how RosettaNet, ebXML, OAGIS and EDI (Electronic Data Interchange) fit together. However the author does not define a formal decision model. In [22] the author again compares RosettaNet, ebXML, OAGIS, Web Services, xCBL (UBL) - XML Common Business Library (Universal Business Language) and cXML (commerce XML) and creates a comparison framework.
Our paper is organized in the following order: the needs of the market are evaluated in the second chapter. Third chapter makes a comparison of the ebXML, RosettaNet and XLANG. Fourth chapter defines a multi-criteria decision model and evaluates them. The last, fifth chapter, gives a conclusion of the results.
2 Needs of the Market
The way enterprises work, understand their existence and survive must be retained. But the way they do business and communicate with each other must be improved. So business processes must still work on and through the net, just as they have manually. Technologies must describe business processes in a consistent and safe way and more. They must enable changes, upgrades and adaptations, since business is a living process and therefore must be flexible and manageable.
But this is only the first step. There is still the question of automation. A business process consists of many steps and includes many people, some of them completely unnecessary, which only enlarges the possibility of making a mistake. One of the goals in creating a global electronic market is to automate everything that can be automated, including routine work or explicitly defined processes with long-term rules and foreseen conditions.
Some solutions have already been created in the past, more or less successfully, but by far not sufficient enough for goals and ambitions of the millennium. The web services have only created an initiation of what is yet to come. They enabled process describing, but not automation. The ultimate goal of those technologies is making business as safe and as accessible as possible for all businesses all over the world [19].
3 Comparison of Technologies
There are several technologies for coordination and automation of business processes. Some of them have been present on the market for quite a long time, for example EDI. But the problem with vintage ones is inaccessibility for smaller enterprises and an obvious inflexibility, since most of them require a large initial investment and expensive support. The up to date technologies build upon legacy technologies, which have used older proprietary standards.
Contracts / agreements				ebXMLCPA Collaborafion Protocol Agreement	
Private process	XLANG	WSFL Web Service Flow Language	BPML Business Process Markup Language		
Public collaborative process			WSCL Conversation Language	(	sbXML BPSS
End point description		WSEL WS, ; Endpòint Language		uusmess Process Specification Schema ebXML CPP Collaboration Protocot Profile	
Service description and transport binding	WSDL Web Service Description language				
Figure 1: Process Coordination Framework [1]
The need for different kind of technologies has increased. Modem technologies are mutually connected and complemented. Figure 1 presents their relationships, horizontally divided by the level of provided services and vertically by the initiative organizatioti, by which they were sponsored and created [1]. All of the technologies are an upgrade of web services and they are all based on the XML language. Their design priorities and fields of concentration however differ.
Service description and transport binding was assured in WSDL and in ebXML CPP (Collaboration Protocol Profile) as the EDI follower. With time, new
technologies emerged from them in different directions. Their relations are seen from Figure 1: WSEL (Web Service Endpoint Language), ebXML BPSS (Business Process Specification Schema), WSCL (Web Service Conversation Language), WSFL (Web Service Flow Language), XLANG, BMPL (Business Management Markup Language), RosettaNet PIPs (Partner Interface Process) and ebXML CPA (Collaboration Protocol Agreement) []].
WSDL is meant for describing network services as a set of endpoints, operating on messages. WSEL is meant for non-operational features of web services like security. WSCL allows that we define abstract interfaces for web services for business process conversation. WSFL allows the description of business processes or interaction patterns, based on the web services operations.
ebXML provides a set of technologies for describing various stages of business collaboration. CPPs enable companies to specify their profiles in which they define the terms for collaboration. CPAs are the computer equivalents of trading partner agreements. They can be defined manually or automatically generated from two or more CPPs. The actual flow of a business process is specified using BPSS. Shared public and private business processes for collaboration between two or more partners are specified using BPML. The focus of XLANG and RosettaNet PIPs is similar to BPML and will be further discussed later in this article.
XML, concentrated on the contents, enables remote systems to interchange and interpret the documents without the human intervention. XML document is basically an ordinary text file with markup [1], The combination of structure, flexibility and verification makes XML useful not only for electronic publishing, but also for designing business messages, exchanged between enterprises [1]. While building larger processes, all business partners must agree upon the vocabulary, interfaces and the type of method invocation, before they send individual messages.
XML vocabularies can define all kinds of business documents or even whole frameworks, which provides interoperability and functionality.
3.1 ebXML
ebXML is a family of specifications that enable companies of all sizes to collaborate with each other, independently of the location [2], through the exchange of XML-based messages [8]. Development of the ebXML specifications is an on-going effort sponsored by OASIS (Organization for the Advancement of Structured Information) and UN/CEFACT (United Nations Center For Trade Facilitation & Electronic Business) [8].
The need for ebXML lies in the experience from the past. EDI, the anterior technology for data interchange among enterprises, was unreachable for most SMEs, since the
costs were too high and the implementation too complex. ebXML is based on XML, web services and open standards and is publicly available. It overcomes this barrier and enables the creation of software for building applications, based on mutual structure and syntax, which will lower the costs of business data interchange. ebXML mission is to provide an open XML-based infrastructure, enabling the global use of electronic business information in an interoperable, secure and consistent manner by all parties [8],
ebXML architecture was primarily designed for B2B interaction. UDDI and SOAP offer services with similar functionality on the low level. EbXML uses and builds upon these standards. It provides safe and reliable messaging and adds a set of higher level specifications for expressing the semantics of B2B collaborations. For these purposes it provides CPPs, CPAs, BPSS, core components, registry/repository and BPML [11].
ebXML provides an effective platform for long-term business transactions and enables us to express the following:
quality of service, timeouts, conformations, multi-language support, authentication, authorization, privacy, integrity and non-repudiation.
Example of ebXML usage, shown in Figure 2:
^xHO'
[SoftWa*

Request ebXML specifications
COMPANY X
ebXML specifications detail Register company business profile.
cuiiipmiy uuaiiic^a piuiiit,
f^^ scenarios and implementation details uild local i-r /stem implementation
system	Confirm profile and scenarios accepted
COMPANY Y
Figure 2: ebXML in practice
By using ebXML, companies have a standard method to exchange business messages, conduct trading relationships, communicate through data in common terms, define and register business processes [8]. it enables all parties to complement and extend current EC/EDI (electronic commerce/EDI) investment and it expands electronic business to new and existing partners.
It also facilitates convergence of current and emerging XML efforts [8].
ebXML delivers the value by [8]:
using the strengths of OASIS and UN/CEFACT to ensure a global, open process, developing technical specifications for the open ebXML infrastructure,
creating the technical specifications with the world's best experts,
collaborating with other initiatives and standard development organizations,
building on the experience and strength of existing EDI knowledge,
enlisting industry leaders to participate and adopt ebXML infrastructure and
realizing the commitment by ebXML participants to implement the ebXML technical specifications.
3.2 XLANG
XLANG is a notation for the specification of message exchange behavior among participating web services, supporting especially the automation of business processes [9], It is expected to serve as the basis for automated protocol engines that can track the state of process instances and help to enforce protocol correctness in message flows.
XLANG is based on XML and is used for describing business processes in the BizTalk inifiative. It offers a model for orchestration of services and contract collaboration between partners [3]. XLANG is fully focused on public processes. It supports long-term operations and nesting. It enables: exception handling, restoring operations,, behavior, actions, control flow, correlations, contents of transaction, service management, time-outs,
custom correlation of messages, modular behavior description and contracts with multiple roles [3].
However, it does not define authentication or the quality of service nor the non-repudiation [4]. The goal of XLANG is to make it possible to formally specify business processes as state-full long-running interactions [9].
Main features of XLANG include [1]:
•	behavior; container for the description of the service's behavioral aspects, including support for looping, concurrency and exception handling,
•	actions; atoms of behavior, referencing WSDL operations on available ports.
376 Informatica 26 (2002) 373-380
M. Pušnik et al.
•	control flow; iequence in which the service performs actions,
•	correlations: structure, the service uses to route messages to correct workflow instances,
•	context; context for long-running transactions,
•	service management; features of service instance management and
•	port mapping; method for plugging in the service user and the service provider.
XLANG is an extension of WSDL and dynamics in processes are supported with different flows [3]:
1.	Message flow, where actions are the basic constituents of an XLANG process definition that specifies the behavior of the service. The actions are request/response, solicit response, one way, notification, timeouts and exceptions.
2.	Data flow, the base of XLANG is fed by the message flow and supports the control flow decisions.
3.	Control flow, which provides support for looping, besides the regular elements. It also enables exception handling and transactional behavior.
XLANG also supports business process contracts, however they are merely mappings between two port types, which interact together. A contract can only map ports that are unidirectional [3].
The unit of action, offered by a service is an operation. An operation can be a single asynchronous message, or a request/response pair of messages with optional fault messages. The operation can be either incoming or outgoing. But WSDL does not say what is the operation semantics. There are three possibilities [17]:
1.	In the first case the operation is a stateless service that has no memory of previous operations, such as a stock quote service.
2.	The second possibility is an operation on an object, in the usual sense of object-oriented programming systems, in which case the object will have the ability to use its state variables to keep a record of the consequences of previous operations. In the latter case, we usually think of the object as being subservient to the caller, since the caller controls the entire life cycle of the object. The object itself has low influence regarding the order in which its operations are invoked and no independent behavior.
3.	The third possibility is autonomous agents with fitll state representation of the service. In this case the service supports long-term interactions with full state, in which every interaction has a beginning, defined protocol for operation call and the ending. The supplier has to provide a service, which starts an interaction by receiving an order through the entering message, then returns the acknowledgement to the buyer, if the order can be accomplished.
Enterprise workflow systems today support the definition, execution and monitoring of long-running
processes that coordinate the activities of multiple business applications. But they do not separate internal implementation from external protocol description [9],
U

Template
RFQ
Loss notice
e -market
Template __
RFQ
Order
Quota
Order change
Signature number
Order number
Signature
Bill

Figure 3: XLANG connecting two parties [9]
The Figure 3 represents the dynamics between two participants inside an electronic market, where XLANG is the translating key between a buyer and supplier that cooperate on the net using the advantages of the electronic market.
3.3 RosettaNet
RosettaNet is a non-profit consortium of more than 400 of the world's leading Information Technology, Electronic Components, Semiconductor Manufacturing and Solution Provider companies, working to create, implement and promote open electronic business process standards [7],
RosettaNet was created as a compromise between EDI and SOAP. Its main goals are reaching dynamic, flexible trading networks, operational efficiency and new business opportunities [10], It enables: real time complex transitions, checking, confirmation, non-repudiation, multiple languages, additional standards in industry, SSL (Secure Socket Layer) authentication, digital signature and data encoding.
Its biggest advantage is the well defined although inflexible PIP [5], The purpose of every PIP is to offer general business data models and documents, which enable interface implementation by system developers. Every interface includes [14]:
•	XML document, which is based on the DTD (Document Type Definition) and specifies PIP services, transactions and management, which include dictionary properties,
•	class and sequence diagrams in UML,
•	validation tool and
•	implementation guide.
PIP interface offers mechanism for sending messages and reporting failures. It demonstrates the Integration of web services and its safety features, demanded at RosettaNet [19]:
•	two - way SSL authentication,
•	digital signature,
•	data encryption and
•	non-repudiation.
RosettaNet PIP defines an automated business process among trading partners for demanding and offering product prices and availability information [16]. Different business processes are covered with:
1.	RosettaNet executive plan, which offers a general guidance, priorities of addresses and integration through tables.
2.	Individual plan of supply chain, which address of the supply chain - specific theme, prioritization, sources, implementation and adaptation.
3.	RosettaNet partners, which enable voting about standards, participants in workshops and implementation.
RosettaNet standards are managed on a global level. Locally they are focused on implementation and support. So partners can choose between global or local membership [13].
RosettaNet is very rich in its supporting tools: the RosettaNet implementation tool including the current PIP template, a Partner Agreement Wizard for quick importation, development and testing of customized PIP and more. It also contains RosettaNet dictionary and RosettaNet implementation framework. The template enables the development of new PIPs. The Partner Agreement Wizard enables importing of trading partners and a fast development of new processes. Embedded PIP enables implementation of only that certain PIP the partner needs. It includes support for all published RosettaNet PIPs as well as for CIDX (Chemical Industry Data Exchange) and PIDX (Petroleum Industry Data Interchange). PIP can also be tested before actually applied and used.
RosettaNet is also focused on the industry support; the adapter for industry development enables integration with new and existing applications and ways of business [15].
Figure 4: RosettaNet communication
RosettaNet plans to integrate support for the ebXML Messaging Services Specification in future releases of RosettaNet's Implementation Framework (RNIF). While RosettaNet remains committed to developing business process standards, required to support the complex needs of the high-technology industry, it also wants to ensure interoperability across all supply chains. Figure 4 represents the communication between two trading partners with help of RosettaNet PIP - which enables connection of business processes [12].
4 Evaluation Model 4,1 Criteria
To be able to evaluate the technologies for describing business processes for their suitability and quality, we have defined a multi-criteria decision model. We have identified the following criteria [18]:
Defining and describing processes'. Evaluates the architectural support, syntax and semantics for describing all the features of the process and the support for the transition from classical to electronic business from aspects of flexibility, simplicity, user friendliness and compliance to standards.
Collaboration description: Evaluates the support for business interactions and defining relationships between partners, from aspects of flexibility, safety and complexity.
Role model: Evaluates the support with modeling tools for describing roles and collaboration between them.
Small/big/medium enterprises support: Evaluates the appropriateness and flexibility of the technology for different company sizes with different characteristics, needs and preferences.
Complexity and learning effort: Evaluates the amount of effort and change, needed to learn and understand the technology and all its features.
Efficiency: Evaluates how efficient is the technology at describing and specifying the business processes.
Maturity: Evaluates the maturity, based on the number of years the technology exists.
Tools support: Evaluates the support within tools and integrated development environments, which ease the development and assure quality.
Synchronous communication support: Evaluates support for synchronous, short-term transactions, which require immediate answer.
Asynchronous communication support: Evaluates support for asynchronous, long-term transactions.
Independency of communication protocols: Describes the relationship between communication protocol and the technology.
Quality of service: Evaluates the possibilities for specifying service quality of certain flows, which can be done either by raising the priority of a flow or limiting the priority of another flow.
Authentication: Evaluates the level of verification of the senders identity - whether the business message sender is or is not who he claims to be [6].
Authorization: Evaluates the level of verification, whether the sender of a message is permitted to send the subject message to the receiving partner [6].
Integrity: Evaluates, whether the unaltered during transportation [6].
messages remains
Encryption: Evaluates the coding and the level of security of messages against unauthorized readers [6],
Non-repudiation: Evaluates the mechanism for verifying whether an originating trading partner can or cannot deny having originated and sent a message and that a receiving trading partner can or cannot deny having received a message, sent by its partner [6].
Exceptions handling: Evaluates the business preparation for every sort of failures, duplications and losses of data.
Claim detection: Evaluates the preparation and support for events of claim loss.
Data transformation: Evaluates the possibilities, tools and technologies for data transformation between collaborating enterprises.
	Criteria definition	Scale defining
cl	Defining and describing processes	2 - flexible, simple, compliant with standards 1 - simple and user friendly 0 - basic features only
c2	Collaboration description	2-iTiultiple language, flexibility, safety 1 - safety and basic features 0 - basic features
c3	Role model	1 - yes 0-no
c4	Small/big/ medium	2 - big/medium/small 1 - big/medium
	enterprises	0-big
	support	
c5	Complexity and learning effort	2 - simple 1 - moderate effort 0 - great effort
c6	Efficiency	2 - very efficient 1 - averagely efficient 0 - low efficiency
c7	Maturity	Actual number in years
c8	Tools support	2 - many 1 - medium 0-low
c9	Synchronous	1 - yes
	communication	0-no
	support	
clO	Asynchronous	1 - yes
	communication	0-no
	support	
cll	Independency from communication protocols	1 - yes 0-no
cl2	Quality of service	1 - yes 0-no
cl3	Authentication	1 - yes 0-no
cl4	Authorization	1 - yes 0-no
cl5	Integrity	1 - yes 0-no
cl6	Encryption	1 - yes 0-no
cI7	Non-repudiation	1 - yes 0-no
cl8	Exceptions handling	2 - handling message loss, resolution, system recovery 1 - two above 0 - one above
cl9	Claim detection	2 - good I - average 0 - poor
c20	Data	2 - good
	transformation	1 - average 0 - poor
Table I: Criteria and scale
4.2 Utility Function
We have defined the utility function, which organizes the results, for them to be comparable (on scale between 0 and 1). In the case, where input value is an actual number, the utility function transforms it to the closed interval from 0 to 1.
Equation 1: Utility function
' t'MOO 2 ^ Moo

Equation 2: Maximum utility
N
U =max{Uj)
Meaning of the symbols:
•	U - maximum utility,
•	Uj - utility of alternative j,
•	Ci - criterion i (Table 1),
•	Aj -alternative j (ebXML, XLANG, RosettaNet),
•	Wi - weight of criterion i,
•	N - total number of alternatives.
4.3 Results
For the purposes of the evaluation of the technologies in this article we have selected the weights based on the preferences of a SME, where security (authentication, authorization, integrity, encryption and non-repudiation), defining and describing processes, collaboration support, complexity and learning effort, maturity, tools support, data transformation, exception handling and quality of service are particularly important. The selection of the weights is based on the survey, done in [18], The weights can however be. altered according to the needs and priorities of each distinctive business. The Table 11 shows the evaluation of ebXML, RosettaNet and XLANG. It is divided in 5 columns. The first column presents criteria. The second column shows the weights, which we assigned to each criterion. The rest of the columns show evaluations for each technology, using the scale, explained in the third column of Table I. In the last row we show the results calculated using the utility function.
As seen in Table II ebXML has achieved the highest result. It turns out that ebXML is the best technology for most of the businesses.
XLANG is second best, although it lacks the quality of service, authentication and non-repudiation. However, it is integrated within the BizTalk Server Initiative, which is very promising. We believe that it will get improved over time.
RosettaNet is the least appropriate for general SMEs. Its main preference lies in technical features and level of development. Since it is the oldest technology of the three, it is the most mature one. Its main disadvantage is in the fact that it is suitable mainly for very large companies, since its framework PIP is very inflexible, and once created, very difficult to alter thus inappropriate for smaller businesses.
c	w	ebXML	XLANG	RosettaNet
cl	9	2	2	0
c2	8	2	1	
c3	2	1	0	
c4	10	2	2	
c5	9	1	1	
c6	7	2	1	
c7	8	2	2	
c8	2	1	1	
c9	2	1	1	
clO	2	1	1	
cll	2	1	0	
cl2	6	1	0	
cl3	3	1	1	
cl4	3	1	1	
cl5	5	1	1	
cl6	3	1	1	
cl7	4	1	1	
cl8	6	2	2	
cl9	2	2	1	
c20	7	2	2	
		0,735	0,606	0,498
Table II: Evaluation matrix and results
5 Conclusions
The need to do business on the net and to automate business processes is increasing, as is the need for supporting technologies. Such technologies must satisfy certain standards, they must be flexible and available to all organizations, large but particularly to small and medium enterprises. Describing business processes must be relatively simple, so that even non-programmers can use it, since the business process experts usually do not have the necessary knowledge, needed to work with complex languages.
In the article we have identified, compared and evaluated the features of the three most important technologies and upon our findings defined a multi-criteria decision model for their quantitative evaluation. The defined decision model is usable for all kinds of enterprises. They can express their priorities through criteria weights. For the purposes of this article we have also defined a common set of weights for small and medium enterprises and done the evaluation of the technologies. From this perspective
we have determined that ebXML technology is the most suitable with the widest range of possibilities, followed by XLANG and RosettaNet.
References
[1]	Selim Aissi, Fallavi Malu, Krishnamurthy Srinivasan (May 2002), E-Business Process Modeling: The Next Big Step, IEEE Computer, pp. 55-62.
[2]	Alan Kotok, David R.R. Webber (2002), The new global standard for doing business over the Internet ebXML, New Riders Publishing.
[3]	EbPML.org (2002), XLANG, http://www.ebpml.org/xlang.htm.
[4]	David O'Riordan (April lO"' 2002), Business Process Standards for Web Services, The candidates. Services Business Strategies and Architectures, http://www.webservicesarchitect.com/content/ai'ticle s/oriordanOl.asp
[5]	Joe McKee (May/June 2002), RosettaNet at the Dance-an e-business standard does its own choreograph^', Oracle technology network, pp. 5152.
[6]	Pekka Rantola, Janne J. Korhonen (15.5.2002), RosettaNet vs. ebXML-Security Solutions and exception handling, Helsinki University of Technology, http://www.soberit.hut.fi/T-86/T-86.161/2002/RosettaNet
[7]	Vitria (February 26* 2002), RosettaNet E-Business Process Standards for the High-Tech Industry, Vitria Technology, Inc.,
http://www.vitria.com/news/press_releases/pr 2002-02-26.html.
[8]	EbXML, Technical specifications, http://www.ebxml.org/.
[9] Cover Pages hosted by Oasis (June 6 XLANG, Technology Reports, http://xml.coverpages.org/xlanp.html.
2001),
[10] RosettaNet Overview, Background Information, http://www.rosettanet.org/RosettaNet/Rooms/Displa vPages/Layoutlnitial.
[1 l]Madhu Siddalingaiah (August 17"' 2001), Overview of ebXML,, Technical overviews, SUN Microsystems,
http://dcb.sun.com/practices/webservices/overviews/ overviewebxml.isp
[12]	Arsin Corporation (2002), Solution Integration Services, Arsin RosettaNet PIP Solution, http://www.arsin.com/docs/RNTFactSheet_final.pdf
[13]	Andy Moir (April 2002), Introduction to RosettaNet, XML.gov, http://xml.gov/presentations/rosettanet.
[14]	Cover Pages hosted by Oasis (November 2002), RosettaNet, Technology report, http://xml.coverpages.org/rosettaNet.htm.
[15]	Microsoft BizTalk Server (2002), BizTalk Accelerator for RosettaNet Features, Microsoft, http://www.microsoft.com/biztalk/evaluation/feature s/rosettanet.asp.
[16]Bea	Web-Logic Integration (2002), RosettaNet 2.0 Security Sample, Bea,
http://edocs.bea.eom/wli/docs70/b2bsampl/rn2sec.ht m .
[17]	XLANG (2001), Web Services for Business Process Design, 200J Microsoft Corporation, http://www.gotdotnet.com/team/xml_wsspecs/xlang-c/default.htm.
[18]	Object Technology Center (2002), Preferences of small and medium businesses, Technical Report, FERI
[19]	Matjaž B. Juric, S. Jeelani Basha, Rick Leander, Ramesh Nagappan (December 2001), Professional J2EE EAI, Wrox Press Ltd.
[20]	Arij it Sengupta (2002), Oracle's support for open eBusiness standards, Oracle corporation, http://www.idealliance.org/papers/xmle02/slides/Sen gupta/sengupta.ppt.
[21]	Arij it Sengupta (2002), Data integration, process integration and trading partner agreements, Oracle corporation,
http://www.edifice.org/ERUG/Sengupta_ERUG.ppt.
[22]Paavo	Kotinurmi (Oktober 22"'' 2002), Comparing XML Based B2B Integration Frameworks, Helsinki University of Technology SoberIT, http://www.soberit.hut.fi/lCTEC/lectures/20021022_ Kotinurmi.pdf.
Visual Secret Sharing Watermarking for Digital Image
Shen-Chuan Tai, Chuen-Ching Wang*, and Chong-Shou Yu Institute of Electrical Engineering, National Cheng Kung University Tainan, Taiwan, R.O.C
Address: Institute of Electrical Engineering (computer / group 92533) National Cheng Kung University, Tainan, 701, Taiwan Email: wcj@rose.ee.ncku.edu.tw
Keywords: Visual Secret Sharing Watermarking (VSSW), Discrete Cosine Transform (DCT), Digital Watermarking, Copyright Protection.
Received: February 23, 2001
A visual secret sharing watermarking (VSSfV) technique is proposed as a way of solving copyright protection problents for digital images. The proposed watermarking technique employs a visual secret sharing (VSS) scheme and separates the watermark into two parts, a public watermark and a secret watermark. For watermarking security, only the public watermark is inserted into the original image, while the owner holds the secret watermark. Without the secret watermark, it is almost impossible to extract the watermark even if the embedding algorithm is published. To meet requirements of robustness and imperceptibility, we modify DCT coefficients belonging to the middle frequency band to embed the public watermark. Importantly, the watermark can be retrieved from the watermarked image without resorting to the original image. Various experiments using the proposed watermarking method are presented to demonstrate robustness to tampering and a to variety of common image processing operations and geometric manipulations.
1 Introduction
Protection of intellectual property is an increasingly important concern as widespread use of the Internet is making multimedia data increasingly easily copied and distributed. Fortunately, digital watermarking techniques allow us to embed copyrights into digital contents and later extract the watermark to detect copyright infringement and confirm legal ownership.
Many watermarking techniques [1 - 12] have been published in the literature. These published techniques utilize either transform domain or spatial domain. In [2], Cox et al. describe a method to embed into the host image a watermark composed of a randomly generated sequence. This scheme applies the full-frame Discrete Cosine Transform (DCT) to the original image, producing a set of coefficients. Then, a subset of these coefficients is chosen according to a rule that depends on the most perceptually significant coefficients of the DCT transform domain. To embed the watermark, they use a scaling modulator to alter the values of these coefficients according to the values of the watermark. Cox's watermarked version is robust to some attacks involving common signal and geometric processing operations. However, there are some drawbacks to this technique. First, extracting the watermark requires the original image for watermark detection. This limitation affects its application on the Internet. Second, the authors use a threshold of similarity measure to determine whether the host image is watermarked or not. In practice, selection of too small or large a threshold will lead to watermark
detection error. Third, modulating the most significant coefficients (excluding the DC term) with a random sequence degrades image quality, and is thus an unreasonable requirement for a general watermarking scheme.
Hsu and Wu [4] proposed a frequency-domain watermarking technique that used fixed block-base DCT transformation. The method first breaks up the host image into 8X8 blocks and then performs the DCT on each block. In the embedding algorithm, they select 16 middle-band coefficients from each block and then modify these coefficients according to the residual mark to reverse the corresponding polarity. After this procedure, the watermark is embedded into the host image to form a watermarked image. Unlike [2], this method makes use of a binary image as the watermark, thereby allowing identification of the extracted watermark by direct use of the unaided human eye. Both human visual recognition and a similarity measurement were used experimentally to verify the efficiency of their watermark extraction method. However, there are some drawbacks to this technique. First, this method cannot overcome certain attacks, for example image rotation and image resampling. Second, for extraction, this scheme also requires the original image to extract the watermark information. For watermarking systems applicable to the Internet, security is a very important concern. The watermarking systems of [2] and [4]
require the originai image for watermark retrieval, making verification complicated, necessitating the original image be shared in a public place or network for ownership verification and thus making these systems unsuitable for Internet application.
In this paper, a new watermarking technique based on VSS scheme for enhancing the watermarking security is presented. The proposed method operates in a fiill-frame DCT domain, which allows a reasonable tradeoff between quality and robustness. More importantly for Internet application, watermark extraction does not require the original image and the original watermark, thus simplifying the watermarking system and allowing the original image and original watermark to be kept secret. Further, because it is dangerous to trust only a single person or organization to manage very important information, the proposed algorithm includes a visual secret sharing scheme (VSS) which shares a secret among a limited number of members.
This paper is organized as follows. Section 2 introduces the basic concept of the VSS as applied in the proposed watermarking system. The watermarking technique itself is described in section 3. Experimental results are shown in section 4. Finally, section 5 presents conclusions.
2 The Basis of Visual Secret Sharing Scheme
VSS is a well-known cipher technique for digital images. Decoding can be performed by the naked eye, with no instrumentation or complex computation. The concept of VSS is derived from [13]. In [14], Naor and Shamir extended this idea to {k, «)-VSS, which is designed to break a shared image into n different shadows. Each single shadow look like random data. The shared image can be recovered easily from k{k<n) shadows or more. That is, k person's permission is required to decode the shared image. More detailed description can be found [13-15].
For simplicity, the (2,2)-VSS scheme is used in the proposed watermarking method. The shared image is divided into 2 shadows that consist of random dots. The mapping relationships of the (2,2)-VSS scheme based on [14] are shown in Table 1. For each pixel in the shared image, two blocks of 2X2 pixels are generated in the corresponding location of the shadow images, one for shadow 1 and the other for shadow 2. If a pixel P, in the shared image is black, then any one of the first six rows in Table 1 for the two 2X2 blocks of shadow images can be selected. If P, is white, any one of the last six rows in Table 1 for the two 2X2 blocks of the shadow images can be selected. The two 2x2 blocks are copied to the corresponding position in shadows 1 and 2, respectively. According to the human visual system, both shadow 1 and shadow 2 now contain 50% white sub-pixels and 50% black sub-pixels and appear like random noise. The human eye cannot read the
secret message in the independent shadow and, thus, the shared secret can be concealed in shadow 1 and shadow 2.
Table 1 The mapping relationship function of (2,2)-VSS scheme based on [14].
Pixel	blockl	block2	blockl vjsua/ ö;?b!ock2
■	PB	»	■
■	e	U	■
■	E	a	■
■	a	E	■
■	B	ffl	■
■	n	U	■
□	E	E	E
□	a	a	a
□	Ey	EB	s
□	e	«	e
□	n	B	n
□	B	U	B
When both shadows are superimposed on each other, the shared secret is clearly visible to the human visual system. The decoder is thus the unaided human eye, which is very similar to an ordinary OR function performed on two sub-pixels. Therefore, if pixel P, in the shared image is black, then the corresponding 2X2 block created by superimposing shadows 1 and 2 will contain four black sub-pixels. On the contrary, if Pj is white, then the corresponding 2X2 block by superimposing shadows 1 and 2 will contain two black sub-pixels and two white sub-pixels. From the viewpoint of the human visual system, the block with two black sub-pixels and two white sub-pixels will be recognized as a white pixel, while the block with four black sub-pixels will be recognized as a black pixel. Consequently, the secret information in the shared image can be easily detected when these shadows are superimposed together by way of a pixel-by-pixel visual OR operation.
3 Watermarking Implementation Process
The proposed watermarking process involves watermark generation, embedding and extracting. During watermark generation, the VSS technique is used to enhance the security of the embedded watermark. In our proposed scheme, only the public watermark is embedded in the image. The copyright owner reserves the secret watermark for reconstructing the watermark. In order to embed the watermark, we first transform the host image as DCT domain and process the coefficients in the middle
band into coefficient pairs. Next, each pixel of the public watermark is inserted into the middle band by modifying the location of the two coefficients in each coefficient pair. Then, inverse DCT transformation is performed and thus a watermarked image is obtained. Finally, the reconstructed public watermark extracted from the watermarked image may be superimposed on the secret watermark to recover the watermark.
Consider a grayscale image H of size N X N pixels. Also, let W be a. binary image of size k X I pixels that we will use as a watermark. We want to embed an invisible watermark into H to form a watermarked image Hw Host image, watermark and watermarked image can be denoted by:
H =	< p,o < N}h(p,o)e {0,1,...255} (1)
W = j\0 <i<k,Q<j< /}>v(/, j)s {0,255} (2) <Af}/i'(yO,ö)e{0,l,...,255} (3)
3.1 Watermark Generation
As described in section 2, the VSS technique can partition a shared image into two or more shadows. For convenience, we set the watermark size at 50 by 50. Watermark W is broken down into two sub-images, the public watermark Wp and the secret watermark Ws-Each sub-image with size of lOOx 100 sub-pixels is just a shadow of the (2,2)-VSS scheme. Each pixel in the original watermark is now represented as the ORed result of two 2x2 blocks, each from the corresponding position of Wi> and Ws, respectively. In the proposed method, the owner randomly assigns the secret watermark. After defining the secret watermark, the public watermark can be generated by using the relationships based on (2,2)-VSS scheme and listed in Table 1. For each pixel Pfin W and each block ßs-in IVs, the block Bi> in Wp can be derived as:
^ if P,= black	^^^

Bs if P, = white
where Bg stands for inverting each sub-pixel in the block Bs. For example, if P^ is a black pixel in W and
ri 0"
the corresponding block Bs in (^v is ^ ^ corresponding block Bp in Wp is the complement of
In accordance with the
then the
"1 0"		"0 r
	, that is	
0 1		_1 0
ongoing example, the public watermark is finally obtained by collecting each block Bp. It can be expressed as:
Wp = {vp{a,ß)^ {0,255}0<a < 2kfi<ß < 2l\ (5)
3.2 Watermark Embedding
3.2.1 DCT Transformation of the Image
In this approach, each pixel of the public watermark is embedded into the middle-frequency area of the host image. The block-diagram of watermark embedding is shown in Fig. 1. To spread the energy of the host image, we use full-frame DCT instead of block DCT transform, after which the watermark information can be embedded into the transform domain. When the watermark information is dispersed over the entire spatial image, then the watermark can easily survive common image processing. The transformation is:
h(u,v) = C{u)C{v)Y, X/!(P,^T)CO:
p=0
{lp+\)u7t'
IN
co;
\2(y+\)vn'
IN
(6)
C{u) =

u=0
otherwise
where H = {h{u,v)e R, 0 < u,v < N} and R is a real number.
Frequ
ncy Domain
Extract Mid-frequcncy Public Coefficient Watermark
Embedding
Secret Watermar
Watermarked Imaee
Figure 1: Block-diagram of embedding procedure.
/	1	H'	. 1 5 1 6 1 7 1 8 1 - 1 71 72	
/\ 7J	74	■J5 j 76	TJ 1 71 1 79 go I —	"i" /
/H'"	,.1	1« 1 1«	ISO 1 I5B 1 is: — 1 213	.4/
A
x+y = nil Middle Band : m, < x+y < m^ x+y = m,
y
Figure 2: Middle-band determined by two lines.
384 Informatica 26 (2002) 381-388
S.-C. Tai et al.
3.2.2	Selection of the Middle-band Coefficients
In fact, the robustness and invisibility are conflict each other. To be a reasonable tradeoff, we select middle band components for embedding the watermark. As seen in Fig. 2, the two lines
x + y = m^ and x + y = m2 set the boundaries for the region selected for hiding the watermark. That is, the middle band is selected by:
trij < X + y < m2	(7)
Here, x, y is the position of each coefficient in the host image, and m t and m2 represent intercepts that are located on the x and y axis respectively. These parameters, i.e. m/ and can be adaptively selected to control the size of the middle band for embedding watermark.
3.2.3	Processing the Rank of the DCT Coefficient Pair
As mentioned above, the watermark information can be embedded in middle band, the watermark information, fVp, is added into the middle band as follows.
Step 1. Process the coefficients of the middle band to form a one-dimensional sequence by row major scanning. That is, C = { C],C2—,C2^2kx2i )■ Step 2. Process the Wp as a one-dimensional sequence by row major scanning. Eq. (5) will be written as:
y/ = {ör(y ) s {0,255 }0 < )• < 4W } (8) where ©()•) = Wp(a,ß) with y =ax2l + ß
Step 3. For i = l,2...,2kx2l, pack the coefficient,^,.,, and the neighboring coefficient, €21, into an /-th coefficient pair, CPi= {€21.1, C^,). Step 4. Select a random number as a secret key, S, to generate the 2kx2l different random sequence over the interval [I, 2kx 21], That is,
where ä, denotes the i-th random
number.
Step 5. Define RANK:

1 ^/(C2,_,>C2,) 0 //(C2,_,<C2,)
(9)
Step 6. For i =1,2...,2k x 21, embed the public watermark into the middle band by modifying all coefficient -pairs according to the following rule. { w{Ri)®RANK(C2,,„C2i) = \ } Then { swap the coefficients of C^,.; and €2, } Else { no operation } where w (.) represents a gray level pixel of watermark ; here we suppose that gray level 255 and 0 denote logic 0 and logic 1, respectively.
3.2.4 Inverse DCT Transformation
As mentioned in the previous section, some coefficients in the middle-band will change their position. Indeed, this process is equivalent to the watermark embedding operation. To obtain a watermarked image, the Inverse Discrete Cosine Transform is used to transfer the frequency domain. The watermarked image is:

(2/0 + \)u7t
IN
(2(7 + l)v;z-
2N
(10)
3.3 Extracting Watermark Procedures
In the extraction procedure, reconstructing watermark is really quite simple. The proposed technique does not use the original image to detect the watermark. The extraction procedure is shown in Fig. 3. First, we use the DCT transform on the watermarked image and, thus, H^ = DCT( H^ ) is obtained. Next, the coefficients of the middle band are selected as paragraph 3.2.2. Then, the coefficients are expanded and packed into coefficients by the same method as the watermark embedding process. After that, the secret key, S, is applied to generate the predefined random number and, thus, each pixel of the public watermark, according to this predefined random order, can be retrieved from the watermarked image. The procedure for extracting the reconstructed watermark Wp is briefly described as follows: Step I. By row major scanning, we form the coefficients of the middle band in as a one-dimension form, C' = {C[,C'2.:,C'2yak>ai }• Step 2. For / = 1,2...,2k x 21, pack the
coefficient, Cj,-)
and the neighboring
coefficient, Cj, , into an /'-th coefficient pair, CP, =
Step 3. Use the secret key, S, to generate the predefined random number as a set, that is R =
where Ä, denotes the ;-th
random number.
Step 4. According to R, extract the ;-th pixel of the watermark by judging the /?, coefficient pair as the following equation:

'/^21 >C2i+\
(11)
where Wp(/) stands for the /-th pixel in the Wp and
"1" and "0" represent gray-level 0 and gray-level 255, respectively.
Step 5. Assemble all the pixels to obtain a reconstructed public watermark Wp .
Step 6. Superimpose the reconstructed public watermark ( iVj, ) on the secret watermark ( Wg ) to obtain the reconstructed watermark {W ).
Secret W aterm ark
Frequency Domain
Extract M id-frcqucncy Coefficient
visual OR
Reconstructed public watermark
Reconstructed Watermark
Figure 3: Block-diagram of extraction procedure.
3.4 Reduction Process
As mentioned section 3.3, the reconstructed watermark can be directly used for identifying the ownership protection. To improve the clarity of W , we use a post-process called the "reduction process" to reduce the redundancy of data caused by VSS scheme. Indeed, this process is a quite simple lookup table (Fig. 4) which performs the reduction process by direct mapping. That is, a block data with four subpixels located in each group will be transferred into a corresponding pixel. This means that each block data in group A will be mapped into a black pixel (gray-level 0), and a block data with four white sub-pixels in group C will be mapped into a white pixel (gray-level 255). Especially, each block data which contains one black sub-pixel and three white sub-pixels located in group B will be assigned to a gray pixel (gray-level here is 125). Suppose that the four inputs, a,, 02 > «3, «4. in the block of each group represent either white or black gray-level, and the 3 outputs, 00, 10, 11, represent black, gray and white pixels respectively. Then, these 16 possible states in Fig.4 can be further mapped into 3 possible states. Referring to Fig. 4, let /,/2 be the output bits, which is controlled by a,, Oj , 03, «4 . Then /^/2 may be expressed as :
f,(ai,a2,a},a4)=a,a2a3+a2aia4+aia3a4+aja2a4 (12) f2(a,,a2,a3,a4)=a,a2a3a4	(13)
Via the reduction process, the reconstructed watermark is reduced to the same size as the original watermark. Also, the reduced version of reconstructed watermark is more visible to human vision. Comparison of with-reduction and without-reduction is shown in Fig. 5. Since the reduction process mitigates noise effects caused by common image operations, the with-reduction result shown in Fig. 5(b) yields superior quality relative to the without-
group A
acB NEsa SBB
group
group C
fl-bU	ffl
	
gray-level 0
B
gray-level 125
□
gray-level 255
Black Pixel
Gray Pixel
White Pixel
reduction result of Fig. 5(a). Figure 4: Reduction process lookup table for 4:1 data reduction rate.
(b)
Figure 5: Performance test for reduction process: (a) original reconstructed watermark; (b) reduced version of (a).
4 Simulations Results
To demonstrate the performance of the proposed scheme, watermarking is performed on the "Lena," "Baboon" and "Airplane" standard images, and the watermarked images are subjected to robustness and quality testing. The employed images are of size 512X512 pixels. The original watermark is of size 50 X 50 pixels. In the robustness test, comparison between extracted watermark and original watermark is made by unaided human vision. Further, we define Detection Rate {DE) as a quantitative measurement for evaluating extraction fidelity. If nxl is the size of the original watermark,
S.-C. Tai et al.
then DR can be expressed as

«X/
(14)
where

I
(15)
if >»'Uy) = w('>y)
'2= '/	= gray pixel
0, //
Here, w{i,j) represents each pixel of the original watermark and M>'{i,j) represents each pixel of the reconstructed watermark. Gray pixels are given a 50% hit ratio when computing the Detection Rate. The quality of a watermarked image is estimated by using peak-to-peak signal-to-noise ratio (PSNR), expressed by:
255^
7SE iKP,<y)-h,,{p,cj)f (16)
' p = 0<T=0
where h(p,o ), h„{p,o) represent each pixel of the host image and the watermarked image respectively.
4.1	Watermarked Image Quality
"Lena", "Baboon" and "Airplane" images, each of size 512X512 pixels with 256 gray levels, were used for testing. Figs. 6(a), 6(c) and 6(e) show the original images. Figs. 6(b), 6(d) and 6(f) show the watermarked images embedded with the public watermark. The PSNR for Figs. 6(b), 6(d) and 6(f) are 38.66 dB, 29.46 dB and 37.49 dB respectively, showing that these watermarked images retain reasonable quality.
4.2	Attack Testing
To test the robustness of the proposed watermarking scheme, we apply typical attacks such as common image processing and geometric manipulations. These attacks are performed using the commercial image-processing tool, Photoshop 5.0.
4.2.1	Lossy Compression with JPEG
JPEG lossy compression is a standard for still images. We applied it to the watermarked image to simulate an attack. Table 2 shows some experimental results including extracted watermarks, image quality and Detection Rate at different compression ratios. It can be seen that the proposed watermark can withstand JPEG attack at approximately CR=10, i.e. the reconstructed watermark can still visually identify ownership rights.
4.2.2	Filtering Operation with Blurring and Sharpening
In a poor transmission system, many image operations such as blurring and sharpening operations are used to enhance subjective quality. The coefficients used for the
blurring filter and the sharpen filter are — x
1 1 1 1 1 1 1 1 1
and —X 6
0 -1 0
-1 0 10 -1 -1 0
, respectively. Figure 7(a) shows
a blurred version of the watermarked image, and Fig. 7(b) shows the result from a version of the watermarked image blurred with a low-pass filter. Similarly, Fig. 8 shows the result obtained from a version of watermarked image sharpened with high-pass filter.
(e)	(f)
Figure 6: Invisibility test for VSSW: (a), (c) and
(e) are original images; (b), (d) and (f) are
watermarked images; quality of (b), (d) and (f)
are 38.66 dB, 29.46 dB and 37.49 dB
respectively.
Table 2 Watermarked image Fig. 6(b) tested by JPEG attack.
Extracted image
m

Ratio of JPEG	CR=1	CR=6.04	CR=8.02	CR=10.24	CR=12.00	CR=14.09
Watermarked quality(dB)	38.65	34.65	33.85	33.37	32.77	32.60
Detection Rate	100%	88.72 %	82.59 %	80.69 %	76.97 %	71.37%
(b)
Figure 7: Robustness test against blurring filtering for VSSW: (a) blurred image;(b) extracted watermark.
(b)
Figure 8: Robustness test against sharpening filtering for VSSW: (a) sharpened image; (b) extracted watermark.
4.2.3 Cropping Attack
Fig. 9(a) shows a cropped version of the watermarked image of Lena. The watermarked image is clipped into a quarter of the original image size. Under these conditions, however, we can still clearly extract the watermark from the watermarked image. The result is shown in Fig. 9(b). Fig, 10(a) shows the interesting case where part of watermarked image is cut off by an image editor. However, the watermark shown in Fig. 10(b) can still be detected from the cropped version of the watermarked image.

(b)
Figure 9: Robustness test against image cropping for VSSW: (a) cropped image; (b) extracted watermark.
(a)
(b)
Figure 10: Robustness test against image cropping for VSSW: (a) cropped image; (b) extracted watermark.
4.2.4 Rotation Attack
Fig. 11 (a) shows a rotated version of the watermarked image. The watermarked image is first rotated two degrees by a rotation operation. Fig. 11(b) shows that despite the rotation, the extracted watermark is still acceptable to the human eye.
(b)
Figure 11: Robustness test against image rotation for VSSW: (a) rotated image; (b) extracted watermark.
4.2.5 Rescaling Operation Attack
First, the watermarked image is reduced by a 2:1 resizing operation. Next, the image is enlarged to restore the image size. The result shown in Fig. 12(b) is extracted from the restored version of Fig. 12(a). Also, a 1:1.1 resizing operation is applied to the watermarked image, and the result is shown in Fig. 13. Again, the extracted watermark is acceptable.
(a)	(b)
Figure 12: Robustness test against image rescaling (2:1) for VSSW: (a) rescaled image; (b) extracted watermark.
(a)	(b)
Figure 13: Robustness test against image rescaling (1:1.1) for VSSW: (a) rescaled image; (b) extracted watermaric.
5 Conclusions
A secure and etficient watermarking system based on a VSS scheme has been proposed. The proposed system can embed watermark information into an image while maintaining good image quality. Without a valid secret watermark, it is virtually impossible to retrieve the watermark. Under this watermarking scheme, checking image ownership is only possible for someone who has possession of the correct secret watermark. This matches the cryptographic standard that a cryptosystem should be secure if someone knows the cryptographic algorithm used but does not have the appropriate key. The proposed technique thus achieves highly security. Furthermore, it is achieved by modification of only the coefficients in the middle band on full-frame DCT domain. Unlike some watermarking techniques, the proposed approach does not require an original image to extract the watermark. Thus, the proposed watermarking scheme may be applied easily in networks such as the Internet. Also, the proposed method can withstand various signal processing attacks, including lossy compression, sharpen filtering, blur filtering and image cropping. Especially, it achieves robustness with respect to the image rotation and image rescaling. For future work, we intend to extend the proposed process to compressed images by an advanced algorithm that will allow concurrent image compression and image watermarking.
References
[1]G.	W. Braudaway, K. A. Magerlein & F. Mintzer,
(1996),	"Protecting publicly-available images with a visible image watermark," Proc. of SPIE, vol. 2659, pp. 126-133.
[2]I.	J. Cox, J. Kilian, F. T. Leighton & T. Shmoon,
(1997),	"Secure spread spectrum watermarking for multimedia," IEEE Trans, on Image Processing, vol. 6(12), pp. 1673-1687.
[3]C.	I. Podilchunk & W. Zeng, (May 1998), "Image-adaptive watermarking using visual models," IEEE J. Sel. Areas in Comm., vol. 16(4), pp. 525-539.
[4]C.	T Hsu & J. L. Wu, (Sep. 1996), "Hidden signatures in images," Proc. of ICIP'96, vol. 3, pp.743-746.
[5]F	Härtung and M. Kutter, (Jul. 1999), "Multimedia watermarking techniques," Proc. of the IEEE, vol. 87(7), pp. 1079-1107.
[6]D.	Kundur & D. Hatzinakos, (1998), "Improved robust watermarking through attack characterization," OPTICS EXPRESS, vol. 3(12), pp.485-490.
[7]K.	Hara, T. Shimomura & T. Hasegawa, ( 1988), "An improved method of embedding data into pictures by modulo masking," IEEE Trans, on Comm., vol. 36(3), pp. 315-331.
[8]C.	T Hsu & J. L. Wu, (Aug. 1998), "Multiresolution watermarking for digital images," IEEE Trans, on Circuits and Syst.-II Analog and Digital Signal Processing, vol. 45(8), pp.1097-1101.
[9]R.	Ohbuchi, H. Masuda & M. Aono, (1998), "Watermarking three-dimensional polygonal models through geometric and topological modifications, " IEEE J. on Sel. Areas in Comm., vol. 16(4), pp.551-560.
[10]	M. Maes, T. Kalkerr, J. Haitsma and G. Depovere, (1999), "Exploiting shift invariance to obtain a high payload in digital image watermarking," IEEE Int. Conf. on Multimedia compu. andSyst.,vo\. 1, pp.7-12.
[11]	J. Meng & S. F. Chang, (1998), "Embedding visible video watermarks in the compressed àomdàn," The Proc. ofICIP,vo\. 1, pp. 474-477.
[12]	D. Anand & U.C. Niranjan, (1998), "Watermarking medical images with patient information," Proc. of 20th Annual Int. Conf. of the IEEE Engineering in Medicine and Biology Society, vol. 20, pp. 703-706.
[13]	A. Shamir, (1979), "How to share a secret," Communications of the ACM, vol. 22, pp. 612613.
[14]	N. Naor & A. Shamir, (1995), "Visual cryptography," Advances in Crypt ology Eurocrypt'94, Springer-Verlag, Berlin, pp. 1-12.
[15]	B. Schneider, (1996), "Applied cryptography," New York, Wiley.
Artificial Neural Networks Based Systems for Recognition of Genomic Signals and Regions: A Review
Vladimir B. Bajic''^, Suisheng Tang', Hao Han' and Vladimir Brusic',
1/ BioDiscovery Group, Laboratories for Information Technology, 21 Heng Mui Keng Terrace, Singapore 119613, Tel: +65-6874-8800; fax: +65-6774-8056, e-mail: bajicv@lit.a-star.edu.sg
2/ South African National Bioinformatics Institute (SANBI), University of the Western Cape, Private Bag XI7,
Bellville 7535, South Africa
AND
Artemis G. Hatzigeorgiou
Department of Genetics, Department of Engineering and Applied Science, Center for Bioinformatics, University of Pennsylvania, Philadelphia, USA.
Keywords: Artificial Neural Networks, Bioinformatics, Genomics Received: April 18, 2002
In this review we present a number of important applications in computational genomics of a class of intelligent systems, namely artificial neural networks (ANNs). We present the current state-of-the-art solutions used in recognition of different genomic signals and regions. All systems to be commented are based fully or in part on the ANNs. We included systems that recognize different aspects of a/ transcriptional control information related to promoters, TATA-box regulatory region, and poly A signal, b/ those that relate to translation process comprising recognition of the translation initiation site, coding cDNA/EST fragments, reading frame-shift errors and their correction, and c/ splice-sites recognition. The review includes some of the most efficient systems for the indicated recognition problems in bioinformatics and aims to be an initial guide for those interested in these challenging problems.
1 Introduction
Bioinformatics is a complex and relatively new field of research which deals with the application of computational methods to the analysis of biological data (Attwood & Parry-Smith 1999, Baxevanis & Ouellette 2001, Mount 2001). A lot of biologically relevant information has been deposited in public databases and is available for scientific community (Attwood & Parry-Smith 1999, Baxevanis & Ouellette 2001, Mount 2001). However, generally speaking, there are no standards adopted. While all basic sequence data is obtained from experiments, the annotation of that mass of data is not always supported by experimental evidence and a lot is based on the computational analysis (Benson et al 2000). This, in addition to the incomplete understanding of the most of the biological processes (Hartwell et al 2000), complicates the application of computer methods for different analyses tasks.
Bioinformatics role is predominantly in extracting relevant information from the large quantity of biological sequence data, but also in producing hopeftilly accurate predictions of different important biological signals and regions contained in biological sequences, which can help in the sequence annotation and in reducing the quantity of target sequences for wet-lab analyses. Every signal and region recognition task in
bioinformatics can be considered as a pattern recognition problem and handled by a variety of intelligent systems techniques. The first efficient use of artificial neural networks (ANNs) in genomics was the application of a perceptron for the recognition of start codons in E.coli (Stormo et al 1982), but applications exist that cater for the whole gene structure predictions (see, for example. Cai & Bork 1998, Snyder & Stormo 1993, 1995, Uberbacher & Mural 1991, Uberbacher et al 1996, Xu et al 1996, etc). In this review we will focus on several common computational genomics problems which are solved as pattern recognition tasks by ANNs. One of the challenges is the recognition of transcriptional control signals (Weinzierl 1999), such as transcription start sites (TSS), TATA-box, and transcription termination signals such as polyA signal (Zhao et al 1999). See also reviews in Fickett & Hatzigeorgiou (1997), Pedersen et al (1999). Another group of challenging problems are systems that deal with the translation process (Kozak 1999), including recognition of the start còdon (translation initiation site -TIS) (Hatzigeorgiou 2002), coding measures required for recognition of coding exons, reading frame-shift errors and their correction (Hatzigeorgiou et al 2001). In addition we will present a system for recognition of splice-sites which separate exons and introns in eukaryotes. Our presentation includes up-to-date solutions used in this field and can help as a guide to
computer science community for further exploration of this fascinating field.
2 Genomic Signals and Regions Included
Biological processes in cell of every living organism are orchestrated by complex regulatory mechanisms at different hierarchical levels (Hartwell et al 2000). DNA molecule contains inherited information that comprises signals required to control biochemical processes in the cell. DNA contains units of inheritance, genes, which determine to the greatest extent how the organism will develop and how it would respond under specific internal and external conditions (Hartwell et al 2000). To achieve its biological function, a gene has to pass several stages of biochemical processing. These are broadly characterized as transcription and translation (Hartwell et al 2000). In eukaryotes, initially, the gene segment of DNA is copied into the so-called pre-mRNA sequence in the process named transcription Weinzierl (1999). After the primary transcript is formed, it is further processed in the so-called RNA processing, and eventually, it is translated into the final gene product in a process named translation (Hartwell et al 2000). We will consider here the recognition of several transcription control signals. Initiation of transcription is mainly controlled by a region called promoter (Weinzierl 1999). This region contains the so-called transcription start site (TSS) where the transcription starts. It also contains numerous transcription factor binding sites (TFBSs), short stretches of DNA with certain characteristic composition. Proteins called transcription factors (TPs) bind to promoter region to the binding sites in order to provide favorable environment for an RNA polymerase to initiate the transcription. One of the challenges is that there are no unique characteristics that can describe eukaryotic promoters, since TFBSs appear in different combinations at different mutual distances and in different orientation (Klingenhoff et al 1999, Zupicich et al 2001, Helhl & Wingender 2001, Kel et al 2001). Also, TP can bind to different TPBSs. At this moment there are more than 10,000 categorized TFBSs in TRANSFAC database (Wingender et al 2001), but it is reasonable to assume that there are much more that are not discovered yet. Some of the better known TFBS are TATA-box, GC-box, CCAAT-box, initiator (Inr), etc. (see Fickett & Hatzigeorgiou 1997, Pedersen et al 1999). Since eukaryotic promoters are short of common characteristics, they are very difficult to discover accurately by computational methods (Pedersen et al 1999, Fickett & Hatzigeorgiou 1997). We will present several systems based on ANNs that search for TSSs, TATA-box, and polyA signal in anonymous (non-annotated) DNA sequences.
When the transcript is formed from the template DNA, it is further subjected to RNA processing and translation. RNA processing will transform the primary transcript by splicing out sections called introns (which are internally
bounded by the splice-sites) and will perform some additional alterations at the ends of the transcript, forming the mRNA. This mature RNA is required to generate the final gene products through the translation process. The final gene products are peptides/proteins required by the biochemistry of the cell. Translation is a complex process which essentially uses information contained in the so-called open reading frame (ORF) which is a section of the mRNA that starts with the so-called start codon, and terminates with the stop codon. Codons are groups of three consecutive nucleotides that are converted into one amino acid during the translation process. The length of the ORF is divisible by three, since it contains consecutive codons. The first nucleotide of the start codon is called translafion initiation site (TIS) (Hartwell et al 2000). The ORF part of the mRNA is biased and can be detected by assessing the so-called coding measures (see Hatzigeorgiou et al 2001, Hatzigeorgiou 2002) of this region. If the number of nucleotide deletion or insertion in ORF has remainder after divided by three, the translation template will be changed and will cause frame-shift. These would result in the synthesis of wrong final product of the gene, which may be fatal for the organism. Since in the biological databases there is a limited amount of data that have proper experimental verification, it is of interest to be able to detect computationally the frame-shift errors and when possible to correct them.
We will present several systems that use ANNs in the detection of the TIS, frame-shift errors and their correction, ORPs in the mRNA or cDNA sequences, and splice-sites.
3 Systems which Recognize Control Signals for Transcription Process
Roughly we can classify transcription control signals as those related to initiation of this process and termination of it. The transcription initiation signals are associated with promoters. Recognition of the promoter region and its signals is one of the most difficult problems in computational genomics. There have been several systems developed to attack this problem with different degrees of success. For the reviews and comparisons see Fickett & Hatzigeorgiou (1997), Reese et al (2000), Bajic et al (2002b), Bajic (2000), Scherf et al (2000). We will present here eight systems which use ANNs as a part of the solution. In what follows we use three measures to describe the system performance: sensitivity Se = TP/(TP+FN), specificity Sp = TN/(TN+FP), and positive predictive value ppv = TP/(TP+FP), where TP, TN, FP, and FN denote the number of true positive, true negative, false positive, and false negative predictions, respectively. In addition, we also consider the frequency of prediction in DNA sequence.
3.1 Grail's Promoter Recognition Module
Grail program (Xu et al 1996, Matis et al 1996) is a gene recognition system. It has a module that can recognize some types of promoters. Grail recognizes mainly TATA-box containing promoters and it has sensitivity of 0.66 for such promoters making one prediction per approximately 23000 nt ('nt' stands for nucleotide). This system uses a feedforward ANN and input signals from five sensors: TATA-box, GC-box, CCAAT-box, Inr, TIS, as well as distance information between these signals. It also employs a set of refined rules associated with the distances of important signals to reduce the level of false positive predictions. The schematic diagram is depicted in Fig. 1.
DNA
string
CCAAT
TIS
LOGIC-BLOCK
SELECTION RULES
FINAL PREDICTION
relative distances
Figure 1: The structure of the Grail's promoter prediction module
3.2 McPromoter ver.3
McPromoter is a system that uses integration of certain physical properties of DNA, promoter region segmentation, and an ANN to produce prediction of promoters (Ohler et al 2001). Promoter region is divided into six sub-regions, two of which corresponding to the core promoter. These six regions correspond to the two upstream promoter segments, TATA-box, spacer between the TATA-box and Inr, Inr element, and downstream promoter element (Ohler & Niemann 2001, Ohler et al 2001). These segments represent the states in the stochastic segmentation model of the promoter region and each of the states is modeled by an interpolated Markov model (Ohler & Niemann 2001, Ohler et al 2001). Segment probabilities are calculated as joint probabilities of sequences and physical profiles. Physical profiles are derived for a number of features including GC content, DNA bendability, DNA conformation, etc. As background models, coding and non-coding sequences were used. Segment probabilities and the likelihoods of promoter and background sequences are used as inputs to the feedforward ANN to generate the final predictions. The system is developed for
invertebrates and makes approximately one prediction per 3000 nt, at the sensitivity of 0.36.
3.3 NNPP System
The NNPP program (see Reese et al 1996, Reese 2001) is based on the recognition of two specific signals within the core promoter region: the TATA-box and Inr, as well as their mutual distance. This system uses three time-delay ANNs (see Fig. 2). One ANN recognizes TATA-box, the other recognizes Inr. The third time-delay ANN combines the outputs of the previous two ANNs with the spatial distance between the TATA-box and Inr signals. The system produces one prediction per 550 nt at the sensitivity of about 0.75.
Another system based on a similar architecture was developed by Mache et al (1996) and achieved sensitivity of 0.5 with the average frequency of one prediction per 3100 nt. It has been shown in another report (Hatzigeorgiou et al 1994) that better performance of promoter recognition can be obtained with the feedforward networks than with the time-delay neural networks.
] Coordination TDNN
Level 1
receptive fields for coordination TDNN
.. , receptive field receptive field forTDNN2 forTDNNI
sliding windov/for TDNN system
input signals
Figure 2: The structure of the NNPP system
392 Informatica 26 (2002) 389-400
V. B. Bajic et al.
3.4 Promoter 2.0
forj-th Cluster
Y1
window of 6
^NA string -,,,,,,,,--
Figure 3: The structure of one of the layers in the Promoter2 system
Promoter 2.0 (Knudsen 1999) uses four ANNs to recognize promoter. The ANNs are employed to model TATA-box, GC-box, CCAAT-box and Inr, as well as distances between these elements. The ANNs are connected in a special hierarchical manner, so that the output of one ANN serves as an additional input for the higher level ANN (see Fig. 3 for one layer structure). Due to the unusual structure, the conventional backpropagation and similar training algorithms cannot be used directly. The system is trained with a simplified version of a genetic algorithm. It uses a window of 6 nt in the scanning process along the DNA sequence.
3.5 SPANN2 System
DNA sequence
data window
				
B1		....	B10	
				

CLUSTER 1 ^_1 CLUSTER 10
LOGIC
BLOCK
Figure 4a: Structure of the SPANN2 system
i ^
Figure 4b: Information processing in Bj block which corresponds to the j-th data cluster
SPANN2 system (Bajic & Bajic 2000) uses domain transformations to convert the primary DNA information into more convenient form for ANN processing. The system uses 11 ANNs: one SOM ANN which partitions input data into 10 clusters, and one generalized regression ANN for each of the clusters to make the final prediction. On the test-set of Fickett & Hatzigeorgiou (1997) the system exhibits a sensitivity of 0.33 and 16 false positive predictions which is a favorable performance on this test set. The structure of the system and information processing is depicted in Figs. 4a-c.
Figure 4c: Information processing in block A
3.6 LVQ Networks for TATA-box Recognition
The system of Wang (2001) (Fig. 5) uses two LVQ ANNs in the process of recognition of TATA-box. The system considers 8 derived features from the TATA-box and its neighborhood. These features are used in the preliminary filtering process which reduces the number of false positive predictions. This filtering is based on the analysis of a number of statistical properties. For the sequences that pass the filter, data compression is applied that reduces the dimension of the transformed feature space to 6. This serve as input data for the LVQ system that uses two LVQ networks and a set of rules to further reduce the false positives. The system is trained by a genetic algorithm due to combination of rules and two LVQ networks. The system achieves sensitivity of 0.33 and 47 FP on the dataset from Fickett & Hatzigeorgiou (1997), which can be considered favorable on the basis of comparison measures used in Bajic (2000).
DATA WINDOW FROM DNA SCAN ING
FEATURE VECTORS OF SIZE 8
STATISTICAL FILTERING AND FEATURE SPACE REDUCTION
TRANSFORMED FEATURE VECTORS OF SIZE 6
IF LVQ ANNI CLASSIFIES INPUT AS
CORRESPONDING TO THE TATA MOTIF, THE LOGIC BLOCK PASSES THAT DATA TO THE INPUT OF LVQ ANN2
FINAL DECISION: TATA OR NON-TATA
Figure 5: Structure of the Wang's system for TATA-box recognition
3.7	Hamming Networks and Recognition of TATA-box and PolyA-signal
The systems developed by Milanesi et al (1996) use Hamming clustering method to partition the input training space to clusters, so that for each cluster a specific representative prototype is selected. The input signals for the TATA-box motif and polyA-signal motifs are given in a special binary form, and Hamming distance is used in the clustering process. After the initial clustering of the input data, the feedforward ANN is trained with the representative cluster prototypes and with negative data to produce ANN that will recognize the required type of input signal.
3.8	Dragon Promoter Finder ver.1.3
Dragon Promoter Finder (DPF) is an enhanced integrated promoter recognition system (see Bajic et al 2002a,b,c). It uses a composite-hierarchical approach, artificial intelligence, statistics, and signal processing techniques. It separates promoter sequences to those that are G+C-rich or G+C-poor. The system structure is represented in Figs. 6a-b. There are separate models for different expected sensitivity levels (Fig. 6a). Each model is optimized for a particular sensitivity level to minimize the number of FP predictions. Also, models are derived separately for G+C-rich and G+C-poor sequences. Within a particular model three sensors for promoter, intron and coding exon sequences compete and an ANN determines if the input sequence corresponds to the promoter or not. DPF allows scanning of complete vertebrate genomes for promoters with significantly reduced number of false positive predictions (see Bajic et al 2002a,b,c). It can be used in combination with the gene finding programs for more accurate prediction of the 5'end of genes. The system was evaluated on a large and diverse human sequence-set and exhibited several fold less false positive predictions at the same level of sensitivity than several publicly available TSS-finding programs. Results obtained using human chromosome 22 data showed even greater specificity than the evaluation set results. The system has been implemented in the Dragon Promoter Finder package, which can be accessed at http://sdmc.krdl.org.sg:8080/promoter/. The system is capable of successftilly recognizing broad classes of promoters, regardless of whether they are CpG-island related or not, or whether they are in the G+C-rich and G+C-poor regions. This makes it quite universal as opposed, for example, to solutions in Hannenhalli & Levy (2001), loshikhes & Zhang (2000) which are specialized in recognizing CpG-island related promoters, or TATA-box containing promoters in G+C-rich regions (Down & Hubbard 2002). Based on the results on the mouse genome
analysis, the average frequencies of predictions based on several sensitivity settings for DPF are summarized in Table 1. The predictions that were within 1000 nt apart were combined into one prediction.
SUPPLIED BY USER I-1
DNA sequence
STRUCTURE OF DRAGON PROMOTER FINDER SYSTEM
SELECTED BY	n|
USER
I-1	I-
ACCURACY RANGE
SLIDING DATAWINDOW
MODEL SELECTOR
DATA-	
WINDOW	N
SIZE	-1/
SELECTOR	
MODEL_n
Figure 6a: Dragon Promoter Finder system structure
BASIC PREDICTOR MODEL OF DRAGON PROMOTER FINDER
DNA SEQUENCE SUPPLIED BY USER I-1
PREDICTOR OUTPUT
content of a sliding
	Promoter Sensor
	
	Cvnn Qoncnr
	cxon oensor
	
_.	Intron Sensor
	
pre-processing and ANN
Figure 6b: Structure of one of the models in DPF
Testing results of the DPF ver. 1.3 system on the annotated known genes on human chromosome 22 are summarized in Table 2. The annotation data (Rel.2.3) for human chromosome 22 were produced by the Chromosome 22 Gene Annotation Group at the Sanger Centre and were obtained from the World Wide Web (Dunham et al.). It should be noted that human chromosome 22 is very G+C rich and consequently atypical for human genome, and thus the performance of prediction programs on this chromosome cannot serve as a basis for assessment of the general prediction ability. However, it is good to use it as a relative reference source. We considered the predicted TSS positions correct if they fall within the interval of length equal to the average length of the promoter region predicted by Promoterlnspector (555bp), while the other conditions were the same as in Scherf et al (2001). If the predictions were within 1000 nt apart they were combined in one prediction. For example, Table 2 implies that for each TP at Se = 80%, an average of 3.34 FP predictions were made; at Se = 0.58%, 1.4 FP predictions were made; and at Se = 0.49% on average 1.1 FP predictions were made.
TABLE 1 Average predictions of DPF ver. 1.3 on the non-masked version of the Mouse Genome Assembly (v
Expected Se	Frequency of predictions	Total number of predictions
50%	40957 nt	135377
65%	16433 nt	337399
80%	4973 nt	1114806
TABLE 2 The results of promoter prediction by DPF ver. 1.3 on human chromosome 22.
Se	ppv	Cost of making one TP prediction, i.e. FP/TP
49%	48%	1.1
58%	42%	1.4
64%	33%	2.0
74%	30%	2.34
80%	23%	3.42
4 Recognition of Translation Related Signals and Regions
The problem of TIS recognition appears in different contexts, from the recognition of TIS in anonymous DNA (Derst et al. 2000), to its recognition in mRNA, cDNA or EST fragments (Pedersen & Nielsen 1997, Salzberg 1997, Bürge & Karlin 1997, Agarwal & Bafna 1998a,b, Salamov et al 1998, Zien et al 2000, Nishikawa et al 2000, Hatzigeorgiou 2002, Hatzigeorgiou et al 2001). Also, most of the gene recognition programs have modules that aim at recognizing TIS. In a vast majority of cases (Kozak 1999) TIS confomis the well-known 'ATG' consensus in DNA ('AUG' in mRNA). On average, this consensus appears once in 64 nt on a completely random DNA.
The recognition of TIS can be efficiently set up if the ribosome scanning model (selection of the most 5' AUG in the proper context in mRNA as the initiation codon, Kozak 1999), is used in the recognition process such as in Hatzigeorgiou (2002). Unfortunately, there are many mRNA/cDNA sequences that have long 5'UTR which can contain other upstream ATGs (upstream of the main TIS), upstream stop codons, or even complete upstream ORFs (Kozak 1999). This makes recognition of the functional TIS far more difficult. The recognition problem is even more complicated by having partial cDNA or mRNA sequences, those that contain incomplete main ORF. The ability of computational prediction of TlSs is hampered in such cases. When the length of the 5'UTR is greater, the possibilities increase that such sequences could contain AUG triplets,
stop codons or the 'so-called' mini ORFs. Tliis opens ways for violating the ribosome scanning rule (Kozak
1999)	and generally allows several scenarios for a downstream AUG to be used instead of the first AUG. These phenomena are known as leaky scanning, reinitiation and direct internal initiation. Without entering into details of these phenomena (Kozak 1999) we point out that such possibilities are considerably reduced in the case of short 5'UTRs. The computational problem is significant when 5'UTRs are longer and this motivated Salamov et al (1998) to develop a method (ATGpr) for TIS recognition that can cope with such a situation. Several techniques have been employed in attempts to develop efficient method for computer-based recognition of TIS. These include support vector machines (Zien et al
2000),	linear discriminant analysis (Salamov et al 1998), different ANNs (Hatzigeorgiou 2002, Hatzigeorgiou et al 2001, Derst et al 2000, Pedersen & Nielsen 1997), generalized higher-order profiles (Agarwal & Bafna 1998a,b), positional conditional probability matrices (Salzberg 1997) and other statistical techniques (Bürge & Karlin 1997, Nishikawa et al 2000). Different sequence features were employed in these recognition methods, and also different lengths of TIS neighborhood are used to extract these features. The best reported results on TIS recognition in cDNA (Hatzigeorgiou 2002) indicate sensitivity and positive predictive value of 0.94. Unfortunately, that method cannot be applied to partial cDNA since it uses a variant of the ribosome scanning model which essentially requires a complete ORF, Other methods, such as ATGpr (Salamov et al 1998), WWW_Evaluator (Rogozin et al 1996) and NetStart (Pedersen & Nielsen 1997) do not rely on the ribosome scanning model and do not require a full length ORF in cDNA fragments.
The most direct way to characterize the coding regions of genomes and provide reliable information for structural annotation of genes in genomic sequences still remains the analysis of sequences from cDNA libraries. As a consequence of their contribution to rapid gene discovery, full and partial cDNA sequences have been generated in very large numbers, both in public and private sectors. ESTs make up currently more than 60% of all the database entries and EST sequencing projects have already started to have a major impact on biomedical research, by accelerating the identification of new genes of interest as potential targets for drug discovery, and by providing target sequences for genome wide expression profiling.
Unlike high quality finished genome sequences, which are double-strand and multiple-pass, cDNA and EST sequences are mostly single-strand, single-pass sequences which frequently contain sequencing errors. Errors may result in nucleotide substitutions, insertions or deletions, leading to frame-shifts which cause problems in the analysis of these sequences. The analysis of ESTs is further complicated by the fact that they are usually short, 300-600 nucleotides in length, they originate from
different parts of the cDNA, and may include only sequence of non-translated regions.
4.1 Translation Initiation Site
There are several systems used for recognition of the TIS in cDNA based on the use of ANNs.
4.1.1 Diana-TIS
The system Diana-TIS (Hatzigeorgiou 2002) uses two ANNs. The first one, the consensus ANN, assesses the TIS and its immediate surrounding (see Fig. 7a). The second one, the coding ANN, assesses the coding potential of the regions upstream and downstream of TIS. Several approaches have been investigated to find a good method for coding region prediction. The best performance is obtained by an ANN that uses preprocessed data as input. The most efficient preprocessing method was the counting of codons in a window using a step of three nucleotides and starting with the window's first codon (see Fig. 7b). Negative examples are extracted randomly from the non-coding regions and also from those windows of the coding region that start with the second and third nucleotide of a codon (i.e., those that are out of frame). The finally assembled system uses these two networks and several additional rules to predict the TIS. The consensus ANN is with short-cut connections, has two hidden neurons, and it is trained by cascade-correlation algorithm. The coding ANN is a feedforward network trained by resilient backpropagation.
local TIS-score
output layer
hidden layer
input layer
^ DNA-Sequence'
a a a c c
MHgIaIcItI tI

Figure 7a: shows the architecture of the implemented TIS recognition module. A sliding window of 12 nucleotides is presented to the trained ANN. A high score at the output of the ANN indicates a possible TIS.
Abilities of individual ANNs and the integrated method to correctly recognize TIS are summarized in Table 3. Both Se and ppv are obtained from the results on the positive and negative test sets.
TABLE 3. Performances of three ANNs used
	(Se+ppv)/2
Coding ANN	0.764
Consensus ANN	0.825
Integrated method	0.94
V. B. Bajic et al.
The system is http://diana.pcbi.upenn.edu
available
under:
coding potential
4.1.2 Netstart
The NetStart system (Pedersen & Nielsen 1997) predicts TIS by an ANN. The input to the ANN is the nucleotide sequence translated through the binary code. Data window is of 203 nucleotides in length and a feedforward three layer ANN is used with 30 hidden neurons. The system achieves Se = 0.78 and Sp = 0.87 and can be accessed at http://www.cbs.dtu.dk/services/NetStart/
4.2 Coding Regions and Frame-shift
DIANA-EST system (Hatzigeorgiou et al 2001) is aimed at recognition of the coding or non-coding EST/cDNA sequences, reading frame-shift errors and corrections of some of these. It uses three ANNs. DIANA-EST is based on a combination of ANNs and statistics for the characterization of coding regions within ESTs. Two major problems in the analysis of ESTs are their short length and frequent sequencing errors. Their short length makes it difficult to use non-frame-specific coding measures and, conversely, frame-specific methods suffer from the frequent frame shifts introduced by sequencing errors. To overcome these problems DIANA-EST incorporates two separate modules: a coding/non-coding sensitive module and a frame/non-frame sensitive module. Both modules are based on ftame-specific codon usage statistics combined with an ANN (Hatzigeorgiou, 2001).
The selection of positive and negative data was made similarly as in the case of the DIANA-TIS system. The performance of this particular ANN on human sequence has an accuracy characterized by (Se+ppv)/2 = 84% (Hatzigeorgiou et al 2001).
For the prediction of the coding regions in EST's all three modules are integrated into one system. In the first step the frame-ANN is applied on a sliding window along the sequence. If the sequence is derived from a coding region without sequencing errors the output will be a sequence of numerical values with a high score in every third position. This is the position of the first nucleotide of a codon. If a deletion or insertion occurs, this periodicity will get disturbed. In the ideal case the response of the ANN in scanning of a coding nucleotide sequence will be made of a chain of alternations of 1 0 0, starting with 1
(for example: 1 0 0 1 0 0 1 0 0.....1 0 0 1 0 0). In the
second step of the algorithm the ideal chain of the sequence gets aligned with the real score values in order to maximize the overall coding score potential (which, in this case, is calculated by
output layer
hidden layer
Input layer
codon usage vector ( j | 3 | 0 | ° °	j
D

64
DNA-Sequence a | a| a | c | c |
c tact g t
54
Figure 7b: shows coding region recognition module. A window of 54 nucleotides slides along the sequence.
After preprocessing, the data of each window are presented to the trained ANN. A high ANN output indicates a window of a coding region starting on the first nucleotide of a codon.
The selection of positive and negative data was made similarly as in the case of the DIANA-TIS system. The performance of this particular ANN on human sequence has an accuracy characterized by (Se+ppv)/2 = 84% (Hatzigeorgiou et al 2001).
For the prediction of the coding regions in EST's all three modules are integrated into one system. In the first step the frame-ANN is applied on a sliding window along the sequence. If the sequence is derived from a coding region without sequencing errors the output will be a sequence of numerical values with a high score in every third position. This is the position of the first nucleotide of a codon. If a deletion or insertion occurs, this periodicity will get disturbed. In the ideal case the response of the ANN in scanning of a coding nucleotide sequence will be made of a chain of alternations of 1 0 0, starting with 1
(for example: 1 0 0 1 0 0 1 0 0.....1 0 0 1 0 0). In the
second step of the algorithm the ideal chain of the sequence gets aligned with the real score values in order to maximize the overall coding score potential (which, in this case, is calculated by multiplying the values of the two sequences). This alignment is made by a dynamic programming approach. Finally, retracing back the path of the best alignment (the alignment with the best score) it is possible to locate the frame changes. This method is similar with the approach described in (Xu et al 1995). In order to avoid a frequent frame-switching it is necessary to introduce frame-change penalty. For the recognition of the coding start the consensus-ANN is used to calculate a score for every ATG (putative TIS). In Addition the non-coding/coding potential around every ATG is calculated by building the difference between all coding scores (calculated by the coding ANN) of in-frame 60 positions before and after the ATG. If the product of the consensus score and the non-coding/coding difference is above a certain threshold
(here 0.2) and the ATG is on the leading frame, the ATG is characterized as TIS.
Stop codons are permitted on the predicted coding region. Possible ends of the coding sequence are determined by the presence of stop codons in a local coding frame. Local coding frame means that the frame of the stop-codon has a high score for 60 nucleotides before the stop codon.
The integrated method of detection of coding/non-coding sequences has Se = 0.865 and ppv = 0.796. Correction ability of the integrated system is estimated to be characterized by (Se+ppv)/2 = 0.897. The system	is	available under:
http://diana.pcbi.upenn.edu
ensemble of coding/non-coding ANNs is formed from 6 ANNs. One ANN uses data window of 101 nt, 4 ANNs use data window of 201 nt, and one ANN a data window of 251 nt. These different window sizes are used to cater for short and long exons. This ensemble achieves Se = 0.91, Sp = 0.895. One may consider the splice site ensemble as dealing with the local information, while the coding/non-coding ensemble deals with the global information surrounding the sites. These two ANN ensembles are combined into the final system and additional rules are used for post-prediction filtering. The final system achieves the following results on A.thaliana data set:
5 Splice Sites Recognition
Splice sites represent boundaries between the introns and exons in eukaryotic organisms. They are located in introns and the boundary between the exon and the adjacent intron is characterized by a canonical nucleotide pair GT and called 'donor' site, while the boundary between the intron and the adjacent exon is characterized by AG nucleotide pair and it is called 'acceptor' site. Consequently introns are bounded by GT and AG splice sites. Since many of the internal exons are also coding exons, a nucleotide bias is present in such coding exons and this frequently can help in predictions of the splice sites.
5.1 NetPlantGene
One of the systems for splice sites prediction based on ANNs is NetPlantGene (Hebsgaard et al 1996). This system uses ANNs and additional rules to infer the predictions of splice sites. It combines local and global sequence information. ANNs used in this system are all feedforward, three layer ANNs, trained by backpropagation algorithm. The input sequence is obtained from appropriate nucleotide data window, where nucleotides are coded by a sparse binary code A = (1000), C = (0100), G = (0010), T = (0001). Each ANN has only one output node which produces real number response in the range [0,1]. Two types of ANNs are designed for this system: 1/ ANNs for detection of coding nucleotides as opposed to non-coding ones, and 2/ ANNs used for detection of splice sites. The coding/non-coding ANNs are trained to achieve maximal correlation coefficient (CC). The splice sites ANNs are trained to achieve Se = 0.95 while the number of FPs is minimized. Data window used for splice site ANNs is 23 nt, and 10 hidden layer neurons were used. An ensemble of 10 ANNs which were randomly initialized is formed and the average of the outputs of ensemble ANNs is used for predictions.
On A.thaliana dataset this ensemble achieves Se = 0.61, ppv = 0.69 for donor sites. For acceptor sites the performance is similar, but due to complexity of acceptor sites a window of 61 nt is used and all ANNs in the ensemble of 10 ANNs had 15 hidden layer neurons. An
5.2 Other Splice Site Systems Based on ANNs
Several other systems for splice site recognition based on different types of ANNs are developed. Generally, their technical solutions exploit the coding/non-coding region contrast and immediate surrounding of the splice sites. The best known of these systems are NetGene2 (http://www.cbs.dtu.dk/services/NetGene2/) which evolved from NetGene (Brunak et al 1991) and NetPlantGene,	NNSplice
(http://www.fruitflv.org/seq_tools/splice.html'). BRAIN (Rampone 1998) and EXON-ENet (Fu 1999). A comparison of the many splice site recognition systems including some of those based on ANNs are given in the study of Thanaraj (2000).
DNA sequence
Figure 8: NetPlantGene system structure
TABLE 4. Performances of NetPlantGene ANN ensembles
	Se	FP
Integrated method (donor sites)	0.80	0.011%
Integrated methods (donor sites)	0.95	0.097%
Integrated method (acceptor sites)	0.80	0.034%
Integrated methods (acceptor sites)	0.95	0.26%
6 Conclusion
We presented an overview of a number of ANN based systems used for different problems associated with transcriptional and translational signals and regions, as well as splice-sites. These systems show overall improved performances as compared to other non-ANN based systems and thus represent very promising tools for specific pattern recognition in computational genomics. The indicated performances of these systems can be used only as a guideline since the training and test sets for these systems were, generally, different. Interested reader can find details of the solutions in the respective literature.
References
[1]	p. Agarwal, V. Bafna (1998a) Detecting nonjoining correlations within signals DNA. In Proceedings of the 2'"' Annual International Conference on Computational Molecular Biology, RECOMB98. ACM Press, pp. 1-7.
[2]	P. Agarwal, V. Bafna (1998b) Translation initiation: implications for gene prediction and full-length cDNA. In Proceedings of the S"' International Conference on Intelligent Systems for Molecular Biology, ISMB98. AAI Press, pp.2-7.
[3]	T. K. Attwood, D. J. Parry-Smith (1999) Introduction to Bioinformatics, Addison Wesley Longman Ltd., Essex, UK
[4]	V. B. Bajic (2000) Comparing the success of different prediction software in sequence analysis: A review. Briefings in Bioinformatics, Vol. l,No. 3, pp. 214-228.
[5]	V. B. Bajic and 1. V. Bajic (2000) Neural network system for promoter recognition. Chapter 14 in Future Directions for Intelligent Systems and Information Science (Nik Kasabov, Ed.), pp. 288-305, Physica-Verlag, New York.
[6]	V. B. Bajic, S. H. Seah, A. Chong, G. Zhang, J. L. Y. Koh, V. Brusic (2002a) Dragon Promoter
Finder: recognition of vertebrate RNA polymerase II promoters, Bioinformatics, 18(1):198-199.
[7]	V. B. Bajic, A. Chong, S. H. Seah, V, Brusic (2002b) Intelligent System for Vertebrate Promoter Recognition, IEEE Intelligent Systems Magazine, July/August, 17 (4): 64-70.
[8]	V. B. Bajic, S. H. Seah, A. Chong, S. P. T. Krishnan, J. L. Y. Koh, V. Brusic (2002c) Computer model for recognition of functional transcription start sites in polymerase 11 promoters of vertebrates, Journal of Molecidar Graphics & Modeling, in print.
[9]	D. Baxevanis & B. F. F. Ouellette (2001) Bioinformatics, A Practical Guide to the Analysis of Genes and Proteins, Wiley -Interscience, New York.
[10]D.	A. Benson, I. Karsch-Mizrachi, D. J. Lipman, J. Ostell, B. A; Rapp, D. L. Wheeler (2000) GenBank, Nucleic Acids Research 28: 15-18.
[11]S.	Brunak, J. Engelbrecht, S. Knudsen (1991) Prediction of Human mRNA Donor and Acceptor Sites from the DNA Sequence, Journal of Molecular Biology, 220, 49-65.
[12]Burge,	S. Karlin (1997) Prediction of complete gene structures in human genomic DNA. J. Mol. Biol. 268, 78-94.
[13]	Y. Cai, P. Bork (1998) Homology-Based Gene Prediction Using Neural Nets, Analytical Biochemistry 265:269-274
[14]Derst,	M. Reczko, A. Hatzigeorgiou (2000) Prediction of human translational initiation sites using a multiple neural network approach. International Journal of Computers, Systems, and Signals, 1, 169-179.
[15]T.	A. Down, T. J. P. Hubbard (2002) Computational detection and location of transcription start sites in mammalian genomic DNA, Genome Research, 12:458-461.
[16]	Dunham et al.. Unpublished data, http://www.sanger.ac.uk/HGP/Chr22/
[17]	J. W. Fickett, A. G. Hatzigeorgiou (1997) Eukaryotic promoter recognition. Genome Research, 1 861-878.
[18]L.	M. Fu (1999) An Expert Network for DNA Sequence Analysis, IEEE Intelligent Systems, Jan./Feb. 65-71.
[19]Hannenhalli,	S. Levy (2001) Promoter prediction in the human genome, Bioinformatics, 17 Suppl. 1, S90-S96.
[20]	L. H. Hartwell, L. Hood, M. L. Goldberg, A. E. Raynolds, L. M. Silver, R. C. Veres (2000) Genetics: From Genes to Genomes, McGraw-Hill Higher Education, Boston.
[21]	G. Hatzigeorgiou (2002) Translation initiation start prediction in human cDNAs with high accuracy, Bioinformatics, 18, 343-350.
[22]	G. Hatzigeorgiou, P. Fiziev, M. Reczko (2001) DIANA-EST: a statistical analysis, Bioinformatics, 17, 913-919.
[23]	G. Hatzigeorgiou, N. Mache, J. Wieland, M. Reczko, A. Zell (1994) Recognition of promoters and coding regions on eukaryotic sequences with neural networks, in Bioinformatik	94,	pp.70-74, gopher://gopher.imb-iena.de/l l/ftp/bioinf94.
[24]	R. Hehl, E. Wingender (2001) Database-assisted promoter analysis. Trends in Plant Science, 6(6) 251-255.
[25]	S. M. Hebsgaard, P. G. Korning, N. Tolstrup, J. Engelbrecht, P. Rouze, S. Brunak (1996) Splice site prediction in Arabidopsis thaliana pre-mRNA by combining local and global sequence information. Nucleic Acids Res. Sep l;24(17):3439-52.
[26]?.	loshikhes, M. Q. Zhang (2000) Large-scale human promoter mapping using CpG islands. Nature Genetics, 26, 61-63.
[27]	E. Kel, O. V. Kel-Margoulis, P. J. Farnham, S. M. Bartley, E. Wingender, M. Q. Zhang (2001) Computer-assisted identification of cell cycle-related genes: new targets for E2F transcription factors. J. Mol. Biol. 309, 99-120.
[28]	K. Klingenhoff, K. Frech, T. Quandt, T. Werner (1999) Functional promoter modules can be detected by formal models independent of overall nucleotide sequence similarity. Bioinformatics, 15(3) 180-186.
[29]	S. Knudsen (1999) Promoter2.0; for the recognition of Pol II promoter sequences, Bioinformatics, 15,356-361.
[30]	M. Kozak (1999) Review: Initiation of translation in prokaryotes and eukaryotes. Gene, 234, 187-208
[31]N.	Mache, M. Reczko, A. Hatzigeorgiou, Multistate time-delay neural networks for the recognition of POL II promoter sequences, ISMB96. St. Louis, http://www.informatik.uni-stuttgart.de/ipvr/bv/personen/mache 1996
[32]	S. Matis, Y. Xu, M. Shah, X. Guan, J. R. Einstein, R. Mural, E. Uberbacher (1996)
Detection of RNA Polymerase II Promoters and PolyAdenylation sites in human DNA sequence. Computers & Chemistry, 20, 135-140.
[33]	L. Milanesi, M. Muselli, P. Arrigo (1996) Hamming-Clustering method for signals prediction in 5' and 3' regions of eukaryotic genes. Comput Appi Biosci. Oct; 12(5):399-404.
[34]	W. Mount (2001) Bioinformatics : Sequence and Genome Analysis. Cold Spring Harbor Laboratory Pr., NY, USA
[35]	T. Nishikawa, T. Ota, T. Isogai (2000) Prediction whether a human cDNA sequences contains initiation codon by combining statistical information and similarity with protein sequences, Bioinformatics, 16, 960-967.
[36]	U. Ohler, H. Niemann (2001) Identification and analysis of eukaryotic promoters: recent computational approaches, Trends Genet. 17:5660.
[37]	U. Ohler, H. Niemman, G-c. Liao, G. M. Rubin (2001) Joint modeling of DNA sequence and physical properties to improve eukaryotic promoter recognition, Bioinformatics, 17 Suppl. 1, S199-S206.
[38]G.	Pedersen, P. Baldi, Y. Chauvin, S. Brunak (1999) The biology of eukaryotic promoter prediction - a review. Computers & Chemistry, 23, 191-207.
[39]G.	Pedersen, H. Nielsen (1997) Neural network prediction of translation initiation sites in eukaryotes: perspectives for EST and genome analysis. Proceedings of the 5"" International Conference on Intelligent System for Molecular Biology, ISMB97, AAA! Press, pp.226-233.
[40]	S. Rampone (1998) Recognition of Splice-Junctions on DNA Sequences by BRAIN learning algorithm. Bioinformatics, 14(8), 676684.
[41]M.	G. Reese (2001) Application of a time-delay neural network to promoter annotation in the Drosophila melanogaster genome. Computers & Chemistry, Dec; 26(l):51-56.
[42]	M. G. Reese, N. L. Harris, F. H. Eeckman (1996) Large scale sequencing specific neural networks for promoter and splice site recognition. Biocomputing: Proceedings of the 1996 Pacific Symposium (L. Hunter and T. E. Klein, Eds.), January 2-7, World Scientific Publishing	Co.,	Singapore, http://www.fruitfly.org/seq_tools/promoter.html
[43]	M. G. Reese, G. Hartzell, N. L. Harris, U. Ohler, J. F. Abril, S. E. Lewis (2000) Genome
annotation assessment in Drosophila melanogaster, Genome Research, \Q, A?>3-5Q)\.
[44]	B. Rogozin, A. V. Kochetov, F. A. Kondrashov, E. V. Koonin, L. Milanesi (2001) Presence of ATG triplets in 5' untranslated regions of eukaryotic cDNAs correlates with a "weak" context of the start codon. Bioinformatics, 17(10):890-900.
[45]	Salamov, T. Nishikawa, M. B. Swindells (1998) Assessing protein coding region integrity in cDNA sequencing projects, Bioinformatics, Jun; 14(5):384-390.
[46]	S. L. Salzberg (1997) A method for identifying splice sites and translational start sites in eukaryotic mRNA, Comput. Appi. Biosci, 13, 365-376.
[47]	M. Scherf, A. Klingenhoff, T. Werner (2000) Highly specific localisation of promoter regions in large genomic sequences by Promoterlnspector: A novel context analysis approach, J. Mol. Biol., 297, 599-606
[48]	M. Scherf, A. Klingenhoff, K. Frech, K. Quandt, R. Schneider, K. Grote, M. Frisch, V. Gailus-Durner, A. Seidel, R. Brack-Werner, T. Werner (2001) First pass annotation of promoters on human chromosome 22, Genome Research, 11:333-340
[49]	E. Snyder, G. D. Stormo (1995) Identification of coding regions in genomic DNA, J. Mol. Biol., 248, 1-18
[50]	E. Snyder, G. D. Stormo (1993) Identification of coding regions in genomic DNA sequences: an application of dynamic programming and neural networks, Nucl. Acids Res., 21(3):607-613.
[51]	D. Stormo, T. D. Schneider, L. Gold, A. Ehrenfeucht (1982) Use of the 'Perceptron' algorithm to distinguish translational initiation sites in E.coli, Nucl. Acids Res. 10:2997-3012
[52]	T. A. Thanaraj (2000) Positional characterization of false positives from computational prediction of human splice sites, Nucl. Acids. Res., 28(3):744-754.
[53]	E. C. Uberbacher, R. J. Mural (1991) Locating protein-coding regions in human DNA sequences by a multiple sensor-neural network approach, Proc. Natl. Acad. Sci. USA, 88:11261-11265, Dec.
[54]	E. C. Uberbacher, Y. Xu, R. J. Mural (1996) Discovering and understanding genes in human DNA sequence using GRAIL, Methods Enzymol. 266:259-281.
[55]	Wang (2001) Statistical pattern recognition based on LVQ artificial neural networks:
Application to TATA-box motif, (M.Tech.degree), Technikon Natal, South Africa
[56]	R. O. J. Weinzierl, Mechanism of Gene Expression, Imperial College Press, London, 1999.
[57]	E. Wingender, X. Chen, E. Fricke, R. Geffers, R. Hehl, 1. Liebich, M. Kruli, V. Matys, H. Michael, R. Ohnhäuser, M. Prüß, F. Schacherer, S. Thiele, and S. Urbach (2001) The TRANSFAC system on gene expression regulation, Nucl. Acids. Res. 29: 281-283.
[58]	Y. Xu, R. J. Mural, E. C. Uberbacher (1995) Correcting sequencing errors in DNA coding regions using a dynamic programming approach.Co/M/>u/ Appi Biosci. Apr;l](2):l 1724.
[59]	Y. Xu, R.J. Mural, J.R. Einstein, M.B. Shah, E.C. Uberbacher (1996) GRAIL: A Multi-Agent Neural Network System for Gene Identification, Proceedings of IEEE, 84(10) 1544-1552.
[60]	Zhao, L. Hyman, C. Moore (1999) Formation of mRNA 3' ends in eukaryotes: mechanism, regulation, and interrelationships with other steps in mRNA synthesis. Microbiol Mol Biol Rev, Jun; 63(2):405-45
[61]Zien,	G. Raetsch, S. Mika, B. Schoelkopf, C. Lemmen, A. Smola, T. Lengauer, K. -R. Mueller (2000) Engineering support vector machine kernels that recognize translation initiation sites, Bioinformatics, 16, 799-807.
[62]Zupicich,	S. E. Brenner, W. C. Skarnes (2001) Computational prediction of membrane-tethered transcription factors. Genome Biology 2(12) 1-6.
Adapting Proactive Mobile Agents to Dynamically Reconfigurable Networks
Christian Erfurth and Wilhelm Rossak
Friedrich-Schiller-University Jena
{cen,rossak}@informatik.uni-jena.de, http://swt.informatik.uni-jena.de
Keywords: mobile agents, intelligent navigation, itinerary problem
Received: June 15, 2002
This paper describes our current results and research activities in the fìeld of intelligent distributed computing in dynamic computer networks. In this area, we focus on mobile agents and mobile agent systems, especially on the itinerary problem of mobile agents. We describe a framework to improve an agent's migration process and to increase its autonomy and adaptivity in the planing and travel phase. As a basis we utilize the Tracy mobile agent system, developed at FSU during the last few years. For our purposes we expand this system by adding inteUigent and adaptable system components, as well as a basic information infrastructure.
1 Introduction
and results in the area of itinerary planning.
Mobile agents are autonomous and proactive software entities which act on behalf of an owner, communicate and cooperate with each other, and have the ability to migrate through a heterogenous network of computers [21, 3]. Within the last years an increasing amount of research has been devoted to this area. Topics like communication [2], cooperation [15], migration [20], applications [16, 17], security issues [19], etc. are investigated at several universities and companies.
Mobile agents can be used in distributed real-time applications to avoid network latency by migrating to the critical domains in the network, help to reduce network traffic on the WWW by moving the code to the data sources, build and manage intelligent on-line communities (for humans and/or agents), etc. There are lots of other possible application areas that have been widely discussed. True is that mobile agents are not limited to a single killer application, or even need one. We at FSU look at (mobile) agents mostly from the viewpoint of distributed computing systems, i. e. as a new paradigm for open peer-to-peer computing.
In a typical scenario, a mobile agent visits more than one host in a network of so-called agencies to fulfill its task. Thus, it needs a "travel plan" when it starts its journey, a usually fixed plan that is provided by its programmer/owner. However, due to the size and possibly dynamical behavior of modern (ad-hoc) networks that change much faster than the agent-owner's perception of the net, mobile agents must by now develop the ability to construct and pro-actively adapt their own travel plans, which we call an itinerary.
We at FSU, in the TRACY project [1], focus as one hotspot on the itinerary problem of mobile agents by improving the basic technical efficiency of their migration and by increasing their autonomy and adaptivity in the planning and travel phase. This paper outlines our current research
2 A Sample Scenario
Before we start to explain our framework we want to introduce a general scenario for the application. It should help to describe our idea and our goals. In the scenario, the mobile agents visits a set of agencies while migrating through the network to fulfill its task.
A user (the owner) hands over a task to an agent. Normally, such a task should not contain information on HOW to fulfill. Hence, the agent has to organize the journey through the network by itself. Therefore, the agent searches for suitable services at a map provided by the local agency. This map contains information on services within the net and some network characteristics. The search result is a set of agencies within the network that should be visited. Now the agent may trigger a route planer to use the available map's information on connection topology and qualities to identify a possible trip through the network. The result is a first travel plan - the itinerary. Before the agent begins the trip, it might use a migration planer to optimize the trip from an efficiency perspective.
Now the agent "executes the itinerary" and starts the migration. During the trip the agent visits service points and communicates and cooperates with other agents. At any point in time, but at least when migrating to further away agencies (map's information is more blurred for further away agencies), the agent may fine-tune and re-adapt its itinerary. This is achieved by taking advantage of the more detailed information now available locally. Finally, after its trip, the agent hands over the results to its principal. This might include a description of the visited agencies, a kind of travel report.
As indicated, we want to provide an infrastructure that enables agents to be more autonomous. A user should concentrate on WHAT the agent has to do and not on HOW.
Ch. Erfurth et al.
3 The Framework
To improve the autonomy of mobile agents means mostly to improve the pro-activity of an agent. The agent has to initialize its itinerary on its own by locating services within the network which are suitable to fulfill the owner-given task. While visiting these service points (so-called agencies) within the network, the agent may in addition adapt to the changing environment (broken links, new services and platforms, etc.) and modify its itinerary dynamically.
We strive to provide an infrastructure that enables mobile agents to plan and modify their itinerary in an autonomous fashion. We utilize the existing systems Tracy - The Mobile Agent System and The Tracy Domain Service. Tracy is a general purpose mobile agent system implemented in Java 2 at FSU Jena [6, 4]. The Tracy Domain Service is a mechanism to network multiple sub-networks of agencies [5]. Thereby, the full set of available agencies is split into disjunct domains (local sets of agencies) to achieve better scalability and an improved information handling potential within a possibly dynamical network environment (see also sec. 4.1). To implement this idea, we expand the existing agent system middle-ware by adding intelligent and adaptable system components and a basic information infrastructure to each agency (see sec. 4.2 to sec. 4.4).
Our main idea is to build and provide Dynamic Domain Maps [8, 18]. Such a map contains information on available Domain Nodes (agencies within the local domain) and their services, as well as on the quality of the available connections in-between these nodes (e. g., bandwidth, reliability, etc.). The data on the map is kept up-to-date at regular intervals. The amount of data to be included can be handled quite easily due to the limited number of nodes within a domain. These domain maps form the basic information infrastructure that enables our mobile agents to act and plan in an intelligent fashion.
Between domains, compressed (summarized) domain maps can be exchanged. Thus, each domain has at least a limited view of (neighboring) remote domains and mobile agents are, therefore, able to target services also in other, non-local domains.
Fig. 1 shows the system components available on each agency the agent may visit - the Map Module, the Route Planer, and the Migration Planer. The Map Module is designed to support the agent in choosing and locating task-relevant services within the Dynamic Domain Maps. The Route Planer can be used by the agent to plan an efficient path through the (local) domain or to modify the itinerary during the journey. The Migration Planer is a Tracy-specific module to minimize network load and migration time. It links to the capability of Tracy to optimize the actual technical migration process once an itinerary has been developed.
These tasks can be triggered by any agent on demand while residing in an agency, without having to program and replicate the respective algorithms into each single agent instance (mobile agents want to be slim and fast). How-
O
Agent
O
Map
Services?
Map

.....°
Itinerary
Map/Requirements
M i gr. S trat.
Itinerary
i
.■■■' A
Migration Planer
Route Planer Algorith.
Figure 1 : Major Infrastructure Components
ever, if and when an agent actually wants to use this type of infrastructure service remains within its own power of decision. Thus, it could also ignore the offered capabilities of this infrastructure completely and apply its private algorithms and rules.
4 Module Description
This section describes the major infrastructure components shown in fig. 1 in more detail, as well as the Tracy Domain Service which is an important basis for the framework.
4.1 Tracy Domain Service
An agent system network consists of an unstructured set of agencies. Generally such an agency is independent and not networked with other agencies. In Tracy the whole set of agencies is structured into domains, i. e. every agency belongs to exactly one domain and is no longer unknown to other local agencies. Within every domain there is a Domain Manager, which is a special agency responsible for administration tasks within the local domain. Every Domain Node (agency within the domain) needs to register with the Domain Manager which can, therefore, provide a list of known Domain Nodes and the services they offer.
On the left hand side of figure 2 you can see a local domain and its Domain Manager. Remote Domains are illustrated on the right hand side. Which of the many available domains is "local" depends for a specific agent simply on the fact on which agency it currently resides.
Multiple domains are networked via a Domain Master which acts as a Domain Manager for Domains (represented by the Domain Manager). Thus, the Tracy Domain Service is a two-stage concept: networked Domain Nodes within a local domain and networked domains within the web.
Domain Master
a map of remote domains (compressed domain maps).
In fig. 3 the data structure of a domain map is shown in more detail. Local and remote maps are structured identically, the difference is in the granularity of the data. We distinguish between Node Information and Line Information. Node Information contains a technical description for each node (e. g. supported protocols) and information on provided services. If we talk about the local domain map, the term "node" means a single agency. Otherwise, if we talk about remote domains, a node represents a complete domain, represented by its Domain Manager. Line Information contains data on important line characteristics (e. g. bandwidth).
Node Iiifornialion ..■.
....... V,
Figure 2: Tracy Domain Concept
4.2 Map Module
The starting point to build a local domain map is the list of nodes available within the domain, provided by the Domain Manager, and the relations between each one of them. This is the basic matrix structure we use for the map. An additional component is needed to collect detailed information on each node and on the network (line) characteristics, to fill the matrix with information. This component is the Network Monitoring Module [18].
The main problem within such a domain, and thus the complete network, is its possibly highly dynamical behavior and its heterogeneity. At anytime additional nodes may be added on while other nodes may drop out. The performance and line characteristics of nodes may be completely different, reaching from PDAs to high performance servers with good bandwidth. Actually measured values of line characteristics may oscillate quite intensively over time.
To get an up to date view of the current network situation, a lot of measurement experiments have to be done at short and regular time intervals. The more frequently, the better the data. However, network load is also increased and a good compromise has thus to be found.
The amount of data which has to be stored is increased by the number of nodes. If there should be information available on line characteristics for each node-to-node connection, the amount of required memory rises to the square with the number of nodes. This data can not be handled efficiently for a large set of unstructured nodes. The use of the Tracy Domain Service gives us thus the chance to limit the number of nodes to a fitting size (domains are limited in size, as they correspond usually to logical sub-networks).
As mentioned before, between domains there is a possibility to exchange data (domain maps) with compressed information. Of course, it is not so important to have an up-to-date map of remote domains as it is for the local domain. For this reason, and because of memory efficency, the map within the Map Module is split into a local domain map and
NodesList (DynamicList)
Node S	Node	Node
AgentServer i Properties ;	s AgentServer :: i Properties	:: AgentServer ! Properties
- :<ii-
zz
Node
AgenlServer Properties
lAne Infoniuuion -,
immmimmmmmmsmmsm
t

Matrix Elements (DynamicList)
apEntry MapEntry MapEntry
■
E
It
apEntry
Matrix Elemenis (DynamicList)
MapEntry		MapEntry		MapEntry	■iill	MapEntry
Matrix Elements (DynamicList)
MapEntr;		■MapEntry		M.i|i| nil,.		MapEntry
iiPil
M.ii]i\ I kiiKnl^ iD^ii.iiiii^I Isti
MapEntr)		MapEntry	it!	MapEntry	IBIII	\i.ipi iiiK
Figure 3: Map Data Structure
The Line Information part is mainly a connection matrix Mc:
' Cii Ci2 C21 C22
Mc =
\
\
/
with entries Cjj that are each a list of values vi,... ,Vk for the different line characteristics, representing the directed connection from node i to node j.
Map information is located on every agency. Within the matrix Mc, there is one row i (Line Information part) which represents the own local agency i. This row contains the line information entries to all known hosts within the local domain. The list values Vi within the entries are
extracted from measurement experiments.
The current values we provide for our agents planning purposes are always forecasts for the next expected result, based on a time-line of multiple existing measurements, and not simply the last available measurements. This helps to prevent an agent from extracting a possibly singular value that is wide off the usual scale and bases the agent's prognosis on a more solid and larger set of values.
To measure a certain line characteristic, we use software sensors. Such a sensor is located on every node and conducts experiments with the sensors located on other nodes within the same domain. If the local node is a Domain Manager, the sensor makes experiments also with remote Domain Managers. The precondition to do experiments is a list of known hosts, received from the Domain Manager or, on the domain level, from the Domain Master.
Within the matrix Mc there is only one row filled with entries by experiments of the local node. This line must be communicated to the other nodes within the local domain to fill their respective line structures, representing the remote matrices. We use the ongoing bandwidth experiments to transmit these data as well as Node Information. Once a local node has received data from all other nodes within the local domain, its local domain map is complete. The same procedure is done at the domain level to fill the map's structure for remote domains. As a result, every node (agency) has quite detailed information with regards to the local domain and a rough knowledge on known remote domains.
4.3 Route Planer Module
The Route Planer Module uses the Dynamic Domain Maps created by the Map Module. The task of the module is to generate a path through the network which can be used by the agent to visit all targets within nearly minimal migration time. Therefore, the agent has selected interesting nodes from the map or even has its own list of interesting agencies which will be a starting point to calculate the path through the net. For this calculation the module has to query the Map Module for line characteristics which can be used to weight connections in-between nodes.
The calculation for the whole trip can be done only, if the agent travels within a known area, like the local domain. If the agent visits remote domains, information is incomplete, i. e. the exact targets within the remote domains are not known yet and the line qualities within the remote domains are not available in detail. Hence, the agent should use the Route Planer again to re-calculate the path.
The route planing process itself is basically the Traveling Salesman Problem which is an NP complete type of problem. As a consequence, getting an optimal solution in practical application is ruled out. There are some heuristic algorithms (such as local search algorithm, genetic algorithm, simulated annealing, neural networks etc.) that have been applied extensively for solving such problems [12]. The comparative performance of the algorithms depends on the problem and the given detailed circumstances.
To be able to test different algorithms, the Route Planing Module has an interchangeable calculation component (see fig. 1). Our current results indicate that classical local optimization algorithms (2-Opt, 3-Opt, Lin-Kernighan) seem to be the best fit for our application ([13] compares various algorithms). Thereby, in order to improve a feasible tour it is modified by deleting some edges, thus breaking the tour into paths, and then reconnection those paths in another possible way [14]. Basically the algorithms in this family differ in the number of edges which will be exchanged: 2-Opt uses 2 edges, 3-Opt uses 3 edges, and Lin-Kernighan uses a variable number of edges.
These algorithms are designed to handle symmetrical distance/weight matrices. There are also algorithms for asymmetrical distance/weight matrices [11], which we will test in more detail at the moment.
4.4 IVTigration Planer IVIodule
This module plans the agent's actual migration process from a technical perspective. In contrast to the other modules, it is a very Tracy specific enhancement and will not be covered in any real depth in this paper.
Tracy provides adaptable Migration Strategies, i. e. the way how migrations are processed. In general, an agent consists of different parts: the agent's state and its variable data parts (the serialized agent), and the agent's byte code (several class files). To execute an agent on a remote platform, at least the state and the data part have to be transmitted. The byte code files have to be transmitted only, if they are needed on the remote platform and are not available there.
Based on this basic observation, different types of migration strategies are possible: push strategies, pull strategies, and its variations. A push strategy means to transmit byte code along with the agent or even before the agent arrives at the remote platform. The opposite is a pull strategy where the agent loads code on demand.
The purpose of the use of different migration strategies is to transmit only code units which are needed (see fig. 4) and thus to cause minimal network load [7]. The granularity level in this question ranges from a set of class files to single classes, or even to single methods (planned) [9].
The decision which type of migration strategy should be used in a given situation is mainly influenced by three facts: network characteristics, node properties/contents, and the agent's own characteristics and behavior.
An example for a network characteristic based decision would be an agent starting from a node which is not connected to the web all the time, like a dial-up host or a mobile device. It should be transmitted as a whole because a dynamical download from code parts on request is only possible if the node is online. In general, this holds for all types of instable connections.
For a first realization of the Migration Planner Module, we currently limit our efforts to network characteristic based decisions and a class based granularity. Due to this
Agent
	
	
	
Unit Migration
Unit Migration
NodeB
Node A
Node C
Figure 4: Migration of selected agent parts
limitations the module is also less Tracy specific and could be ported relatively easily to other available environments.
5 Conclusion
We have established an infrastructure of Dynamic Domain Maps as a basis to handle and structure a potentially slowly dynamical and very large network of agencies. This infrastructure enables mobile agents to navigate through a partially unknown network of nodes. With the help of additional modules at each agency, we enable mobile agents to plan an efficient itinerary and to adapt to changing environments, if they choose to do so. This solution respects each agent's autonomy, its independence and, at the same time, offers additional value and intelligence whenever necessary.
Tracy is by now a fully operational middle-ware for mobile agents that will be outsourced into a small company within the next year. The realization of the additional modules discussed in this paper (fig. 1 ) is ongoing research - the Map Module is already implemented and the Route Planer is well under way. We already started with the development of the Migration Planer based on the possibilities of Tracy's Migration Strategies. Furthermore, we plan to run more experiments to get an improved feeling for the potentials and possible drawbacks of our framework.
References
[1]	Tracy - The Mobile Agent System. http://tracy.informatik.uni-jena.de, 2002.
[2]	J. Baumann, F Hohl, N. Radouniklis, K. Rothermel, andM. Straßer. Communication Concepts for Mobile Agent Systems. In Rothermel [17], pages 123-135.
[3]	J. Bradshaw, editor. Software Agents. The MIT Press, MenloPark, CA, 1996.
[4]	P. Braun, J. Eismann, C. Erfurth, and W. Rossak. Tracy - A Prototype of an Architected Middleware to Support Mobile Agents. In Proceedings of the 8th Annual IEEE Conference and Workshop on the Engineering of Computer Based Systems (ECBS), Washington D.C (USA), April 2001, pages 255-260,2001.
[5]	P. Braun, J. Eismann, and W. Rossak. A Multi-Agent Approach To Manage a Network of Mobile Agent Servers. Technical Report 12/01, Friedrich-SchillerUniversität Jena, Institut für Informatik, 2001.
[6]	P. Braun, C. Erfurth, and W. Rossak. An Introduction to the Tracy Mobile Agent System. Technical Report Math/Inf/00/24, Friedrich-Schiller-Universität Jena, Institut für Informatik, Sept. 2000.
[7]	R Braun, C. Erfurth, and W. Rossak. Performance Evaluation of Various Migration Strategies for Mobile Agents. In Fachtagung Kommunikation in verteilten Systemen (KiVS 2001), Hamburg (Germany), February 2001, 2001.
[8]	C. Erfurth and W. Rossak. Characterization and Management of Dynamical Behaviour in a System With Mobile Agents. In H. Unger, T. Böhme, and A. Mik-ler, editors. Innovative Internet Computing System - Second International Workshop, IICS 2002, Kühlungsborn (Germany), June 2002, volume 2346 of Lecture Notes in Computer Science, pages 109-119. Springer-Verlag, 2002.
[9]	C. Fensch. Class Splitting as a Method to Reduce Network Traffic in a Mobile Agent System. Master's thesis, Friedrich-Schiller-Universität Jena, Institut für Informatik, 2001.
[10]	G. Gutin and A. P. Punnen, editors. The Traveling Salesman Problem and its Variations. Kluwer Academic Publishers, May 2002.
[11]	D. S. Johnson, G. Gutin, L. A. McGeoch, A. Yeo, W. Zhang, and A. Zverovitch. Experimental Analysis of Heuristics for the ATSP. In Gutin and Punnen [10], pages 445-488.
[12]	D. S. Jonhson and L. A. McGeoch. The Traveling Salesman Problem: A Case Study in Local Optimization. In E.H.L.Aarts and J.K.Lenstra, editors, Local Search in Combinatorial Optimization, pages 215310. John Wiley and Sons, Ltd., 1997.
[13]	D. S. Jonhson and L. A. McGeoch. Experimental Analysis of Heuristics for the STSP. In Gutin and Punnen [10], pages 369-444.
[14]	S.Lin. Computer Solutions of the Traveling Salesman Problem. Bell System Technical Journal, 44:22452269, 1965.
[15]	A. Omicini, E Zambonelli, M. Klusch, and R. Tolks-dorf, editors. Coordination of Internet Agents: Models, Technologies, and Applications. Springer-Verlag, 2001.
[16]	E. D. Pietro, O. Tomarchio, G. lannizzotto, and M. Villari. Experiences in the use of Mobile Agents for developing distributed applications. In Workshop su Sistemi Distribuiti: Algoritmi, Architetture e Linguaggi (WSDAAL'99), L'Aquila (Italy), September 1999, 1999.
[17]	K. Rothermel, editor. Proceedings of the First International Workshop on Mobile Agents (MA'97), Berlin (Germany), April 1997, volume 1219 of Lecture Notes in Computer Science, Berlin, 1997. SpringerVerlag.
[18]	S. Schreiber. Beschreibung und Analyse von dynamischen Netzen für Agentensysteme. Master's thesis, Friedrich-Schiller-Universität Jena, Institut für Informatik, Apr. 2002.
[19]	G. Vigna. Mobile Agents and Security, volume 1419 of Lecture Notes in Computer Science. SpringerVerlag, New York, 1998.
[20]	D.E.White. A Comparison of Mobile Agent Migration Mechanisms. Senior Honors Thesis, Dartmouth College, June 1998.
[21]	J. E. White. Mobile agents. In Bradshaw [3], pages 437^72.
User Profiling to Support Internet Customers: What Do You Want to Buy Today?
Giovanni Semeraro, Marco Degemmis and Pasquale Lops
Computer Science Department, Università' di Bari, Via Orabona 4, Bari, Italy
semeraro@di.uniba.it, degemmis@di.uniba.it, lops@di.uniba.it
Keywords: User Profiling, Personalized Services, Intelligent Search Received: July 27, 2002
In the recent years, the astonishing growth of the Internet and the considerable advances of Web technologies have promoted the development of electronic commerce. While e-commerce has not necessarily allowed businesses to produce more products, it has allowed them to provide consumers with more choices. Instead of tens of thousands of books in a superstore, consumers may choose among millions of books in an online store. Increasing choice has also increased the amount of information that scrupulous customers want process before they are able to select which items meet their needs. One way to address this information overload is the use of personalized systems able to support customers in retrieving information about products they are really interested in. Personalization has become an important strategy in Business-to-Consumer electronic commerce, where a user explicitly wants the e-commerce site to consider his or her own information, such as preferences, in order to improve access to relevant product information. In this paper, we propose a scheme to learn user profiles to support Internet customers. The proposed scheme is designed to handle different levels of users' interests simultaneously. Experimental evaluations show the promise of the approach.
1 Introduction
In the era of Internet, huge amount of data are available to everybody, in every place and at any moment. Even though this is extremely useful and exciting, the ever-growing amount of information at disposal generates cognitive overload and even anxiety, especially in novice or occasional users. Consumers have to spend more time to browse the net in order to find the information needed. Sometimes the contents of the Web pages are irrelevant to the consumers' expectation, but they have to read them in order to filter and get what they really want. The main challenge is to support web users in order to facilitate navigation through web sites and to improve searching among the extremely large Web repositories, such as Digital Libraries, online product catalogues or other generic information sources. A possible way to overcome this problem is the development of intelligent systems to provide personalized information services [Schäfer et al., 2001]. A remarkable example arises in e-commerce Internet sites (marketplaces, electronic shops, and others), which provide information on thousands, even millions, of products and services and where it is well known that the process of buying products and services often implies a high degree of complexity and uncertainty. Effectively supporting user search and browsing over such large repositories entails the problem of properly understanding user needs, filtering out not relevant items, helping the user to formulate the most appropriate queries, and presenting results ranked according to their presumed relevance. The complexity of the problem
could be lowered by the automatic construction of machine processable profiles that can be exploited to deliver personalized content to the user, fitting his or her personal choices. This is called user modelling process.
This paper discusses a method to model customers' preferences to offer personalized Internet services. Our approach relies on a two-step profiles generation process: in the first step, the system learns coarse-grained profiles in which the preferences are the product categories the user is interested into. In the second step, the profiles are refined by adding a probabilistic model of each preferred product category, induced from the descriptions of the products the user likes in these categories. The final outcome of the process is a more specific fine-grained profile able to discriminate between interesting and uninteresting products for the user.
The remainder of the paper is organized as follows. Section 2 discusses how the need to acquire new customers has made web personalization an indispensable part of ecommerce and how intelligent techniques are needed to achieve effective personalization. In Section 3, after introducing the use of learning techniques to construct models of users' preferences, a detailed description of the two-step profile generation process is provided. Section 4 presents some experimental results. Finally, conclusions and future works are drawn in Section 5.
2 Background
2.1 User profiling for web-content personalization
Internet services have evolved rapidly, leading to a constantly increasing number of modern Web sites. Enterprises are developing new business portals and providing enormous amounts of product information, which in many cases is heterogeneous, not structured and needs to be dealt with in a personalized way. It is particularly relevant to give the customers personal advice which reflects their individual needs and interests. Personalization has become an important strategy in Business-to-Consumer e-commerce, where a user explicitly wants the e-commerce site to consider his or her own information, such as preferences, in order to improve access to relevant products. A key issue in web site personalization is the automatic construction of accurate machine processable user profiles. This process is called user modelling and consists in ascertaining a few bits of information about each user, processing that information quickly and providing the results to applications, all without intruding upon the user's consciousness. User modelling is nothing more than a fancy term for automated personalization. Any application that behaves differently for different users employs a user model. The models themselves can be big or small, complex or simple, rich or sparse. They often have different names: personality profiles, psychographic profiles, or consumer databases. They are collections of information about an individual. Such collections of information are at best embryonic precursors of an ideal user model, which would possess an intimate and thorough knowledge of the user it refers to. In short, the user model should be able to recognize the user, know why the user did something, and guess what he or she wants to do next. Profiles could be used to deliver personalized content to the user, fitting his or her personal choices.
According to [Tasso and Omero, 2002], the main advantages of using the one-to-one personalization paradigm based on user profiling are:
-	Making the site more attractive for users - A web site that takes into account user preferences is able to make recommendations reflecting user needs. Specifically in the e-commerce area, this will probably turn a significant part of browsers into buyers;
-	Obtaining trust and confidence - Users will not be requested to explicitly insert information concerning their preferences, tastes, etc., but they will be able to participate in the management and updating of their personal profile. This will result in an increase of trust and confidence in a system able to automatically collect data about their preferences;
-	Improving loyalty - The effectiveness of a personalization system improves in the long run. Every time a user interacts with the web site, the personalization
mechanism collects new data about his or her preferences, so that a more and more satisfactory service can be offered. Even if a competitor uses a personalization system, it has to learn a lot of information about the new customer to be able to offer the same satisfactory service.
Among issues the personalization community is dealing with, the acquisition of a good user model is of special importance.
Machine learning techniques represent a very promising solution by which personalization in adaptive systems can be achieved. In fact, they have proved successful in cases where large data sets were available by providing tools for retrieval and filtering useful information and they applied to the definition of models of users interacting with an information system [Webb et al., 2001],
2.2 Related work
During recent years several systems have been designed to offer personalized services and to deliver user-tailored Web content. In this context, various learning approaches have been applied to discover user preferences and construct user profiles. A text categorization method is adopted by Mooney and Roy [Mooney and Roy, 2000] in their LIBRA system. It makes content-based book recommending by applying automated text categorization methods to product descriptions in Amazon.com, using a naive Bayes text classifier.
A similar approach is adopted by Syskill & Webert [Pazzani and Billsus, 1997], which tracks the users browsing to formulate user profiles. The system identifies informative words from Web pages to be used as boolean features and learns a naive Bayesian classifier to discriminate interesting Web pages on a particular topic from uninter-esfing ones.
Data mining methods are used by the l:lPro system [Adomavicius and Tuzhilin, 2001] in order to construct individual profiles made up of two sections. One part of the profile contains facts about a customer, and the other part is made up of rules describing the customer's behavior. The behavioral part of the profile is derived from transactional data, representing purchasing and browsing activities of each user.
A multistrategy machine learning approach is adopted in [Billsus and Pazzani, 1999] for the induction of accurate interest profiles that consist of separate models for long-term and short-term interests. This strategy is applied in designing an agent that learns about a user's interests in daily news stories. The use of this approach is due to the fact that the model must be capable of representing a user's multiple interests in different topics and must be flexible enough to adapt to a user's changing interests reasonably quickly, even after a long preceding training period.
3 Learning User Profiles for Intelligent Information Access
3.1 Exploiting Learning Techniques to construct models of users' preferences
By user profile we mean all the information collected about a user that logs to a Web site, in order to take into account his or her needs, wishes, and interests. Roughly, a user profile is a structured representation of the user's needs which a retrieval system could exploit in order to autonomously pursue the goals posed by the user. In a user profile modelling process, we have to decide what has to be represented and how this information is effectively represented. Generally the information stored in a user profile can be conceptually categorized in several classes, according to their source:
-	Registration Data, such as name, address, email address, phone number, title, etc., useful to better serve the customer, for example to contact him or her either electronically or otherwise. Systems using personal data must abide to privacy laws [Kobsa, 2001];
-	Questions & Answers (Q&A), that reveal a set of topics of possible interest for the customer;
-	Legacy Data, gathered from external data sources such as CRM systems or ERP systems, etc.;
-	Past History, that is data related to the browsing activity of users, or data stored in databases such as which of registered online customers responded to an e-mail campaign;
-	3rd Party, gathered from marketing databases, demographic analysis, etc.;
-	Current Activity, that contains the set of actions performed by the customer in the current session. This gives the flexibility and responsiveness to address rapidly changing customer needs.
A user profile is given by a list of attribute-value pairs, in which each attribute is given the proper value on the ground of the specific user it refers to. Each attribute-value pair represents a characteristic of that user. The list of attributes must be finite as well as the possible values related to each attribute. Examples of attributes in that list are: last name, first name, age, address, job, annual income, preferences, etc. The attribute list is the same for all the users. These attributes or features can be divided into three categories:
-	Explicit, whose values are given by the user himself or herself (Registration Data or Q&A) ;
-	Existing, that can be drawn by existing applications (e.g. job);
- Implicit, are elicited from the behavior of the user, through the history of his or her navigation or just from the current navigation.
A simple approach to acquire user preferences is the manual construction of a user profile: buyers have to fill an initial form that asks for personal data and some specific information (such as product categories of interest among the list of categories available in the store). In this way, only a limited amount of information can be acquired (customers might not be able or willing neither to fill large forms nor to provide personal details and preferences). The main problem of this process is its dependency on users willing to update their preferences. If users do not remember or do not want to spend their time in updating preferences, the personalization service will exploit unreliable or wrong data.
For these reasons, we adopt an approach that dynamically updates the user model by considering data recorded on past visits to the store (transactions). The approach uses rules - describing the behavior of a user - learned from customers' transactional data to construct individual profiles.
Profiles generated from a huge number of transactions tend to be statistically reliable.
In this paper we try to combine the rule-based approach with a text categorization method applied to semi-structured text - the product descriptions [Mooney and Roy, 2000] - in a two-step generation process in order to build more detailed user profiles. The two steps are described in the next subsections.
3.2 Discovering user preferences from transactional data
The first step of the profiling extraction process we propose is performed by the Profile Extractor (Figure 1), a module built upon an intelligent middleware component, called Learning Server, developed in the context of a digital library service [Semeraro et al., 2000, Semeraro et al., 2001]. It has in charge the induction of the rules to create the coarse-grained profiles containing the product categories the user is interested into.
The Profile Extractor employs supervised learning techniques to dynamically discover users' preferences from transactional data recorded during past visits to the ecommerce web site. Preferences are stored in a customer profile that could be useful to generate individual recommendations.
From our point of view, the problem of learning user's preferences can be cast to the problem of inducing general concepts from examples labelled as members (or non-members) of the concepts [Mitchell, 1997]. In this context, given a finite set of categories of interest C= {ci, C2, ..., c„}, the task consists in learning the target concept Ti "user interested in the category ci ".
In the training phase, each user represents a positive example of users interested in the categories he or she likes and a negative example of users interested in the categories he or she dislikes.
Moreover, we chose an operational description of the target concept Ti, using a collection of rules that match against the features describing a user in order to decide if he or she is a member of Ti.
In the COGITO project, the system was tested on the German virtual bookshop of the Bertelsmann Online company (www.bol.de). In this context, the set of categories of interest (translation in English of the German word is reported in parenthesis) is C= {Belletristik (Fiction), Computer_und_Internet (Computer and Internet), Kinderbücher (Children's Books), KulturjundjGeschichte (Culture and History), Nachschlagewerke (Reference Books), Reise (Travel), Sachbuch_und_Ratgeber (Monographs and Guidebooks), Schule_und_Bildung (School and Education), Wirtschaft_und_Soziales (Economics and Lmw), Wis-senschaft_und_Technik (Science and Technique)}. The members of C are the ten main book categories the BOL product database is subdivided into and represent the preferences of the users accessing the BOL Web site. Moreover, we needed to establish a formal description of the features (attributes) that represent each example. The complete set of attributes is listed in Table 1.
Transactional data about customers, stored in an XML file (Users' History), are used to set up the examples in order to train the system. A domain expert classifies each instance as member or nonmember of each book category, depending on the values of the attributes.
The architecture of the Profile Extractor (Figure 1) is made up of several sub-modules:
-	XML I/O Wrapper - it extracts from the Users' History the data required to set up the instances used to train the learning component;
-	Profile Rules Extractor - it processes the instances and induces a classification rule set for each book category. An example of a classification rule set inferred for a specific category of the BOL Web site is presented in Figure 2. The core of the module is WEKA [Witten and Frank, 1999], a tool that provides implementation of state-of-the-art learning algorithms, developed at the University of Waikato (New Zealand) and written in Java. The learning algorithm adopted in the rule induction process is PART [Frank and Witten, 1998], that produces rules from pruned partial decision trees, built using C4.5's heuristics [Quinlan, 1993].
-	Profile Manager - it uses the rule sets inferred by the Profile Rules Extractor to predict whether a user is interested in each book category. All these classifications, together with the user's transactional data, are gathered to form the user profile. It is composed of two main frames: factual, containing personal and transactional data, and behavioral, containing the learned preferred book categories ranked according to the degree of interest computed by the learning system (Figure 3).
In the COGITO context, user profiles are used to personalize the search in the BOL product database according to the user's interests [Abbattista et al., 2002].
3.3	Exploiting the profiles for intelligent search
In the COGITO prototype, the coarse-grained profiles are exploited by a natural language assistant named Susanna. It offers a better support to customers using the BOL search engine to find interesting books. This improves the usability of the BOL web site, as demonstrated in the following scenarios.
Scenario I : unknown user
A user is known by the COGITO system if he or she completes the BOL registration procedure. This step provides each customer with a personal identification number that is necessary both for recognizing a user and for collecting data on his or her preferences and for generating or updating his or her profile.
In the first scenario, an unknown user asks the chatterbot for a book by the author named King. Susanna finds several books through a remote call (deep linking) to the search engine available on the BOL Web site, and displays them as shown in Figure 4.
Notice that the books that rank at the top aie authored by Stephen King. Books by other authors are found further down the list, which means that the user must scroll down a long list if he or she is not looking for a book by Stephen King. The customer not looking for a book by Stephen King can now choose to either refine the search by using an advanced search function or continue to chat with Susanna about different fields of interest.
Scenario 2: registered user
In the second scenario, the user has already been chatting to Susanna about some of his or her interests. Therefore, a profile of this user is available to the system, which can exploit it to accomplish a more precise search in the product database. Consider that the profile of such a user is the one presented in Figure 3 and the query submitted by the user is the same as in the previous scenario.
Now, the first book displayed is a book about Windows 2000 co-authored by Robert King (Figure 5). This result is due to the fact that the original query about King has been automatically expanded into "King AND Com-puter&Internet" (highlighted by the circle in Figure 5), since Computer_und_Internet is the category with the highest degree of interest in the profile of the user (Figure 3). This process is called query expansion. These scenarios highlight the dependence of the result set on the profile of the user that issued the query.
3.4	Refinement of profiles by learning from textual descriptions
The profiles inferred by the COGITO system are coarsegrained: they contain the book categories preferred by
a user. Our intention was to enhance the profiles by taking into account the user's preferences in each category, in order to achieve more precise book recommendations. Thus, we adopted a probabilistic learning algorithm to classify the textual descriptions of the books, the naive Bayes classifier [Mitchell, 1997, Sebastiani, 2002]. Naive Bayes has been shown to perform competitively with more complex algorithms and has become an increasingly popular algorithm in text classification applications [Pazzani and Billsus, 1997, McCallum and Nigam, 1998].
Our prototype, called ITem Recommender (ITR), is able to classify books belonging to a specific category as interesting or uninteresting for a particular user: for example, the system could learn the target concept "book descriptions the user finds interesting in the category Computer and Internet".
Bayesian reasoning provides a probabilisfic approach to inference. It is based on the assumpfion that the quantities of interest are governed by probabilistic distributions and that optimal decision can be made by reasoning about these probabilities together with observed data.
In our learning problem, each instance is represented by a set of slots. Each slot is a textual field corresponding to a specific feature of a book. The slots used by ITR are: title, authors and textual annotation. A simple pattern-matcher, the Item Extractor (Figure 6), analyzes the book descriptions and extracts the words, the tokens to fill each slot (it also eliminates stopwords and applies stemming). The text in each slot is a collection of words (a bag of word, BOW) processed taking into account their occurrences in the original text. Thus, each instance is represented as a vector of three BOWs, one for each slot. Moreover, each instance is labelled with a discrete rating (from 1 to 10) provided by a user, according to his or her degree of interest.
According to the Bayesian approach to classify natural language text documents, given a set of classes C= {ci,c2, ..., C|c|}, the conditional probability of a class Cj given a document d is calculated as follows:
To calculate (1), we need to estimate the probability terms P{cj) and P{tk\cj,Sm), from the training data, where each instance is weighted according to the user rating r:
r-l
w, =
w:
= l-wi
(2)
The weights in (2) are used for estimating the two probability terms according to the following equations:
\TR\
E
= w
P{tk\Cj,S,n) =
\TR\ ^ E w'jUkir.
i=l
\TR\
E wiK. i=l
(3)
(4)
In (4), Tikim is the number of occurrences of the term tk in the slot Sm of the instance, and the denominator denotes the total weighted length of the slot in the class cj.
This approach allows for the refinement of the profiles by including those words that turn out to be most indicative of user preferences for each preferred book category the system was trained on. An example of a fine-grained profile obtained by rating books about "Computer and Internet" is given in Figure 7.
The extraction phase from the BOL site is performed by the Item Extractor, and produces a local database of book descriptions. A profile-driven interface to the database has been developed in order to test the effectiveness of the finegrained profiles in retrieving interesting items. When a user submits a query q to the system, the products in the result set Rq are ranked by the preferred category, as in the COGITO retrieval process, and by the classification value
In our problem, we have only 2 classes: c+ represents the positive class (user-likes, corresponding to ratings from 6 to 10), and c_ the negative one (user-dislikes, ratings from 1 to 5). Since instances are represented as a vector of three documents, one for each BOW, the conditional probability of a category Cj given an instance di is computed using the formula:
m=l fc=l
(1)
where S= {si, S2, ..., S|5|} is the set of slots, bim is the BOW in the slot Sm of the instance di, Ukim is the number of occurrences of the token t k in bim.
P{c+\d) deRg
computed according to equation (1). This kind of profile gives a more precise products ranking with respect to the one returned by the coarse-grained profiles (Figure 8).
4 Experimental Results
Two different experiments were performed: the former consisted in observing the accuracy of ITR, the latter was conducted to evaluate the combination of the COGITO profiles with the ITR ones. For both experiments, 5 book categories at uk.bol.com were selected: for each category, the system has been trained by a specific COGITO user that rated approximately 90 books. In this way, a dataset of roughly 450 classified instances is obtained (Table 2).
The dataset was analyzed by means of a 10-fold cross-validation and several metrics were used in the testing
phase. In addition to Precision (Pr), Recall (Re) and F-measure (F), we also adopted:
Normalized Distance-based Performance Measure (NDPM) - the distance between the ranking imposed by the user ratings and the ranking predicted by the systein. Values range from 0 (agreement) to 1 (disagreement) [Yao, 1995];
Spearman's Rank Correlation (Rs) - a statistic measure used to establish whether there is any correlation between two sets of data. Its value falls between -1 and 1. A correlation coefficient of 0.3 to 0.6 is considered as moderate and above 0.6 is considered strong;
Error (E) - li is calculated as an average of the absolute difference between the user ratings and those predicted by the system.
Results of the experiment are reported in Table 3.
Values of Pr, Re and F provide evidence that ITR system produces accurate recommendations. NDPM is fairly consistent, while looking at Rs we observe that there is at least a moderate correlation for each category.
In the second experiment, each user was requested to submit 3 different queries to ITR. Then, a feedback is given to the system by rating the 20 top ranked books in each result set. The experiment has been modelled on the basis of two different scenarios. In the first scenario, books are ranked according to the COGITO profile (by category only), whereas in the second scenario the ranking is performed using the COGITO profile integrated with the ITR one. For both scenarios feedback evaluation results are given in Table 4.
For pairwise comparison of methods, the non-parametric Wilcoxon signed rank test is used [Orkin and Drogin, 1990], since the number of independent trials is relatively low and does not justify the application of a parametric test, such as the t-test. In this experiment, the test is adopted in order to evaluate the difference between the effectiveness of the different profiles by means of the metrics pointed out in Table 4, requiring a significance level p <0.05.
On the basis of the values of the W statistic calculated above, we can deduce that there is a consistent statistically significant difference in performance among the two different profiles.
5 Conclusions and Future Work
In this paper we discussed the potential influence of using customers' profiles in retrieving relevant items in extremely large Web repositories, such as online catalogues. The success of the retrieval process can be measured in terms of the percentage of relevant and extraneous information retrieved. It is difficult to identify qualitatively the effectiveness of the retrieval process because only an individual user can determine what is truly relevant.
One way to address this issue is the use of techniques that automatically learn a personal profile containing a user's
preferences by analyzing his or her browsing and purchasing history. In our approach, the added value of the profiles is to provide an intelligent search support in retrieving information customers are really interested in.
In this work, we propose a two-step profile generation process: in the first step, the system learns coarse-grained profiles in which the preferences are the product categories the user is interested into. In the second step, the profiles are refined by a probabilistic model of each preferred product category, induced from the descriptions of the products the user has labelled as interesting. The final outcome is a more specific fine-grained profile able to rank products according to the preferences it contains. Experimental results demonstrate the effectiveness of the strategy proposed.
In each classification problem, it is necessary to give many pre-classified examples, many of which positive, in order to leam a good classifier. This pre-classification process is often expensive and noise because it is done by a user who must classify each instance chosen to train the classifier, as in the second step of our profiling process. A possible solution for avoiding this hand-labelling job is using unlabelled examples in the learning phase maintaining the same classification process. The underlying idea is to use an algorithm for learning from few labelled examples and a large pool of unlabelled examples. In the future, we are planning to combine the Expectation-Maximization (EM) [Dempster et al., 1977] technique with the naive Bayes classifier as in [Nigam et al, 2000]. EM is a class of iterative algorithms for maximum likelihood estimation in problems with incomplete data. The algorithm first trains a classifier using the available labelled documents, and probabilistically labels the unlabelled documents. It then trains a new classifier using the labels for all the documents, and iterates. In this way, a method for incorporating unlabelled data into supervised learning process is provided.
Acknowledgement
The COGITO project is funded by the European Commission under contract 1ST-1999-13347. The consortium is led by FhG-IPSI and comprises the following partners: Logica pdv Untemehmensberatung GmbH (Hamburg, Germany), BOL Medien GmbH (Rheda-Wiedenbrück, Germany), the System Analysis Department at Ris0 National Laboratory (Roskilde, Denmark), the LACAM Laboratory at the University of Ban (Bari, Italy), and Sword ICT S.r.l. (Bari, Italy).
References
[Abbattistaetal., 2002] Abbattista, F., Degemmis, M., Licchelli, O., Lops, P., Semeraro, G., and Zambetta, F. (2002). Improving the usability of an e-commerce web site through personalization. In Ricci, F. and Smith, B., editors. Recommendation and Personalization in Ecommerce, 2"'' International Conference on Adaptive Hy-
permedia and Adaptive Web Based Systems, pages 2029.
[Adomavicius and Tuzhilin, 2001] Adomavicius, G. and Tuzhilin, A. (2001). Using data mining methods to build customer profiles. IEEE Computer, 34(2):74-82.
[Billsus and Pazzani, 1999] Billsus, D. and Pazzani, M. J.
(1999).	A personal news agent that talks, learns and explains. In Etzioni, O., Müller, J. P., and Bradshaw, J. M., editors. Proceedings of the International Conference on Autonomous Agents (Agents'99}, pages 268275, Seattle, WA, USA. ACM Press.
[Dempster et al., 1977] Dempster, M. M., Laird, N. M., and Jain, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of Royal Statistical Society: Series B, 39:1-38.
[Frank and Witten, 1998] Frank, E. and Witten, I. (1998). Generating accurate rule sets without global optimization. In Proceedings of the International Conference on Machine Learning, pages 144-151. Morgan Kaufmann.
[Kobsa,2001] Kobsa, A. (2001). Tailoring privacy to users' needs. In M.Bauer, Gmytrasiewicz, P., and Vas-sileva, J., editors. User Modeling, volume 2109 of Lecture Notes in Artificial Intelligence, pages 303-313. Springer, Berlin.
[McCallum and Nigam, 1998] McCallum, A. and Nigam, K. (1998). A comparison of event models for naive bayes text classification. In Proceedings of the AAAI/ICML-98 Workshop on Learning for Text Categorization, pages 41^8. AAAI Press.
[Mitchell, 1997] Mitchell, T. (1997). Machine Learning. McGraw-Hill, New York.
[Mooney and Roy, 2000] Mooney, R. J. and Roy, L.
(2000).	Content-based book recommending using learning for text categorization. In Proceedings of the ACM Conference on Digital Libraries, pages 195-204, San Antonio, US. ACM Press, New York, US.
[Nigam et al., 2000] Nigam, K., McCallum, A. K., Thrun, S., and Mitchell, T. M. (2000). Text classification from labeled and unlabeled documents using EM. Machine Leammg, 39(2/3): 103-134.
[Orkin and Drogin, 1990] Orkin, M. and Drogin, (1990). Vital Statistics. McGraw-Hill, New York.
R.
[Pazzani and Billsus, 1997] Pazzani, M. and Billsus, D. (1997). Learning and revising user profiles: The identification of interesting web sites. Machine Learning, 27(3):313-331.
[Quinlan, 1993] Quinlan, J. R. (1993). C4.5: Programs for Machine Learning. Morgan Kaufmann.
[Schafer et al., 2001] Schafer, J., Konstan, J., and Riedl, J. (2001). E-commerce recommendation applications. Data Mining and Knowledge Discovery, 5( 1/2): 115153.
[Sebastiani, 2002] Sebastiani, F (2002). Machine learning in automated text categorization. ACM Computing Surveys, 34(1): 1-47.
[Semeraro et al., 2000] Semeraro, G., Esposito, F., Fanizzi, N., and Ferilli, S. (2000). Interaction profiling in digital libraries through learning tools. In Borbinha, J. and Backer, T., editors, Research and Advanced Technology for Digital Libraries, volume 1923 of Lecture Notes in Computer Science, pages 229-238. Springer, Berlin.
[Semeraro et al., 2001] Semeraro, G., Ferilli, S., Fanizzi, N., and Abbattista, F. (2001). Learning interaction models in a digital library service. In M.Bauer, Gmytrasiewicz, P., and Vassileva, J., editors. User Modeling, volume 2109 of Lecture Notes in Artificial Intelligence, pages 44-53. Springer, Berlin.
[Tasso and Omero, 2002] Tasso, C. and Omero, R (2002). Personalization of web content: e-commerce, i-access, e-government (in Italian). Franco Angeli, Milano.
[Webb et al., 2001] Webb, G. L, Pazzani, M., and Billsus, D. (2001). Machine learning for user modeling. User Modeling and User-Adapted Interaction, 11:19-29.
[Witten and Frank, 1999] Witten, I. H. and Frank, E. (1999). Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations. Morgan Kaufmann Publishers, San Francisco.-
[Yao, 1995] Yao, Y. Y. (1995). Measuring retrieval effectiveness based on user preference of documents. Journal of the American Society for Information Science, 46(2):133-145.
Attribute	Description
User_id	Unique identifier of each user
Access_date	Identifies the date of the last access performed by the user
Connections_num	Total number of connections to the site performed by the user
S&axc.\\_n\im<CategoryNanie>	Number of searches for a specific category
Search_freq< CategoryName >	Frequency of searches for a specific category
Vuvcha.sQ_R\im<CategoryName>	Number of purchases for a specific category
V\ivc\\SiSe._ive(\<CategoryName>	Frequency of purchases for a specific category
Table 1: Description of the attributes used to represent examples. <CategoryName> denotes each one of the ten main book categories of the BOL Web site.
Category	Book descriptions	Books with annotation	Avg. annotation length	User Id
Computer & Internet	5414	4190 (77%)	42.39	Userl
Fiction & literature	6099	3378 (55%)	35.54	User2
Travel	3179	1541 (48%)	28.29	User3
Business	5527	3668 (66%)	42.04	User4
SF, horror & fantasy	667	484 (72%)	22.33	User5
1 Total	1 20886	1 13261		1 1
Table 2: Dataset information.
1 Category	1 Pr 1	Re	1 F 1	NDPM 1	Rs	1 E 1
Computer & internet Fiction & literature Travel Business SF, horror & fantasy	0.8500 0.5971 0.8100 0.7364 0.4695	0.5476 0.7033 0.8900 0.6800 0.7833	0.6660 0.6459 0.8481 0.7070 0.5871	0.3241 0.4458 0.3322 0.3741 0.3583	0.5499 0.0676 0.4683 0.3466 0.3970	0.3498 0.3489 0.2885 0.3576 0.4105
1 Avg.	1 0.6926 !	0.7209	1 0.6909 1	0.3670	1 0.3659	1 0.3611 1
Table 3: Results of the 10-fold cross vaUdation.
User
Query
Pr
NDPM
Rs
COGITO ITR COGITO ITR COGITO ITR
1	Java	0.50	0.90	0.594	0.423	-0.288	0.300
1	Graphics	0.30	0.70	0.465	0.328	0.156	0.490
1	Security	0.80	0.75	0.636	0.410	-0.412	0.278
2	Realism	0.35	0.50	0.421	0.400	0.258	0.329 2	romanticism	0.60	0.55	0.505	0.636	-0.053	-0.362
2	Science fiction	0.65	0.55	0.468	0.476	0.042	0.109
3	Islands	0.65	0.90	0.600	0.536	-0.288	-0.136 3	Guides	0.40	0.60	0.539	0.694	-0.130	-0.581
3	restaurants	0.30	0.35	0.505	0.415	0.037	0.338
4	Business manager	0.35	0.60	0.513	0.494	-0.074	0.018 4	enterprise solution 0.20	0.30	0.365	0.292	0.405	0.595
4	investment	0.50	0.70	0.547	0.605	-0.118	-0.312
5	s_king	0.30	0.60	0.589	0.197	-0.261	0.806 5	Space	0.10	0.40	0.447	0.184	0.178	0.839
_5_ßng_0.70	1.00	0.550	0.326	-0.154	0.517
Avg. W=
0.45
130
0.63 0.516 0.428 -0.047 0.215
-74
72
Table 4: Results of the comparison between the COGITO and the ITR profiles.
P«r&onaI details

Irrtjeractwn <(«(«s

11				
M				■SliSsÄKSiii^i»
	Proni* Rut M	ProtSìe P/bragef	Usage Paclems Extractor .....i..........^:^:	Vlir«ka Leamins SuK« XM. comptian
Figure 1: The Architecture of the Profile Extractor. In addition to the profihng sub-modules, the figure shows the Usage Patterns Extractor, that groups usage sessions in order to infer usage patterns exploited for understanding trends useful to further market studies.
There are 11 rules extracted for class Kinderbücher:
if search_number_Kinderbucher > 1.0
and purchase_number_Schule_Und_Bildung <= 38.0 and search_freq_Schule_Und_Bildung <= 0.26 then Class: yes
else if purchase_freq_Kinderbucher > 0.24
and purchase_number_Computer_Und_Intemet <= 29.0 and purchase_number_Computer_Und_Intemet> 5.0 then Class: yes
else if purchase_freq_Kinderbucher > 0.25
and purchase_number_Schule_Und_Bildung <= 24.0 and purchase_number_Kultur_Und_Geschichte <= 18.0 then Class: yes
else if search_num_Kinderbucher <=1.0 and search_number_Reise <=3.0 and search_freq_Nachschlagewerke <= 0.33 and purchase_freq_Nachschlagewerke >0.14 and search_number_Reise <= 2.0 and search_number_Sachbuch_Und_Ratgeber > 0.0 then Class: no
end if
Otherwise Class: no
Figure 2: An example of classification rules for the book category "Kinderbücher"
416 Informatica 26 (2002) 407-418
G. Semeraro et al.
Profile for User: 11
CONNECTIONS NUM	II 23 1		jBelletmtìk	i|yes|!0J)902:|iu) |0i]098l	
'sEAIlCH_NI]™eÌÌetristÌB£	i 3 1		[c«inputer_imd_Intemet		(itoljoJD j
SEARCH_FREQBelletristìk	.JD^J		[Kindeifaucher	JI^M :	jno.flX)
PURCHASE NimffiellelrUtìk	II 23 1		[Kultur und Geschichte	i|yes:l07902	|na|0:2098
PURCHASE FREQBeUetristìk	Ijo^si		jNachschlagewerke	j|^s,|ojo_ ;	[nojlJO ,
SEARCH_NUMCoiii9Uter_imđ_Iiiteme4	il 1 1		jReùe	j|yesjl0X)038'	lnoi|05962
SEARCH_FEEQ Coii^utEi_md_hilemet	JIMJ		|SatJibui;h_und_Ra1gebei	i|yes|i0j5702.	|iuii|0:3298
PURCHASE_NUMCoii¥uter_imd_Inteiii®l	Il '1		1 SchuleundBildung	jbsjiOJJ.........	|n«i!|lJO J
) PtIRCHASE_FREQCon^<iter_iuiđ_Ijitemet	j|0.24i		|röitscliiilit_uiid_Recht	jlyesiiOi)	IndjlX)
SEARCH NUMKindeifcucher	II 0 i		iTOssenjchaftjmdjrechmkj lyes ; io i)		|no:|IjD
ilcr-nrH	il n ;				
Figure 3; A COGITO coarse-grained profile.
Siglala
ffc >teana	ftrfa« atuxaU ?______jQ
Vorschlage:
Suchergebnlss
Die Treffer Ihrer Suchanfragenack iVutor Kìriq (b-jiinhalt«nd) Di« Ergsbniis« sind lorti^rl R«l9vsn?.
aiqckHou»8
A Nc-^tfl
Autori: ^ptgphft» Kir>0 & Parar Stratjl:) Gebunden | Harper Cofins, Lordon | 20Qi Preis: C25.S1 Haute varsartdfértig.
Duddlrs. tìmftmcAXcMM Rfltniin. Aus d, ArrtóiiUn. v. JCiChof» Schwei; Auton s^ep^ep i^m iGebunden | Ullstein. Bb. 1 2001 Preii: C2S.D0 Heule vsrs3f5dli3fOq. mehr '•'wmwwtwwBTj i
é] VttWKeiltcM«

« —
Figure 4: Susanna offers a long list of books belonging to several categories by authors whose last name is King.
I II II, ■■■■■■iiiiii^^ . jm

• läa dì «e™ ffli'rf^«	-'Ql
J
Suchergebnisse • hefertaneJltej
Ol» TrefTtr üirer Suchanfrage nadkT'Äutor: King (b^inhialt^nd)
Computer Ek Internat
Dìo Ergebnis« lind lortisrt osch Ralovtnt.
Vorschläge:
with W-ROrn
Autoren: Hübcrt King ft (^arVvgpv^fHj,; Gebunden | wsley, x>hn & SONS LT | 2D0i Preis: C 70.60
VetičTdfertig in2 Wochen.^ mehr...	Ì
Komporientan'Etitwìc5;!ung fur verteilte Anwendungen, (r
BAutoren: £gCiB£-5bgati9r,g & GiC^Uliog
Gelwnden | MICROSOFT PfiESS OEUTSCHLANO | 1999
Preis: C4S.90
Hdulö VersandCertig. mehr.,, .........

Figure 5: List of books by authors whose last name is King and that belong to the book category "Computer & Internet"


g
hl,'
........Title .... I
{jm:l, begin
Mil_11

— hutlicis I {l._schildt:l},,
	(tecliiijobg:l,
Aiiiotatioiis 1	
	
Figure 6: The BOW extraction process.
User ID: 117 Catego.iy: Computer & Internet
Class Priors:
P(yiS)=
0,61€?947952023346
Slot- title
OlUt. UUt- 0.3830052047574654
Feature	Strengdi
.getn~........ .1415571 «51975797
àrem 2.2707400973131233 .cirtctviir ,1 6517008S89069003
[edit
Jgöö
1.5504909869753871 ........''r482:s:7'?fe948Ìr6
|tb	"1-2,303970881190259
Wvs ■•2 303970831190259 "lee	-2474i9203103012tì4
Figure 7: An example of ITR profile. The features are ranked according to a measure that indicates the discriminatory power of a word in classifying a book.
418 Informatica 26 (2002) 407-418
G. Semeraro et al.
ria a T'à Si «f al -T® • a ;
>yslem
AuthBi<f>

Abstract
rrhù tuie «bould piovute Ü« tools for crefttii« & cor&pkt« s\üt« of pio^:»:» foi thne-idimensioQAl compaler ttniniAtion, modelling and intag: synthesb. The ten ta]g» the teader ftora [the conslrwiioR of polygpnal models of objects through ngid body uùmolioii into hiererchisl Ichancter uiimalion, andfbkAUydownthe tendeiing pipeline for the syiubesisof re«listic iim&ses.Htf fust pail of tbe book, Basic Piinc^ks. covers the Xey concepts of 3D compter igraphicfl. The second part of the book k intend for the pnftssbnaJ "plogbi* and game ergine jdevelo pet «ui provides a coUsction of useful algpiilhms.The «Igorithras are introduced m suclia Fei^on, R, Stuart [way that they can be put nadilyinto practice; where possi))e, thealgsiithnvs an accompanied ibycomputer code. Tlu final pari of tliebook is devoted to sample programs produced with tlie jwidely available 3D graphics Ubnries, OpenOL and Direct 3D.The book should be useful for janyorte who wants to lean about 3D graphic« programming game pti^tunrwis, movie laniraatots, and anyone beginning a career in an industry associated with computer graphic». It [also servw as a reference to a wiia iwp of practical algpritliras. The level extends to the needs iof mote experienced professkjnd software devebpers. for example tk»se who wish to wnte Iplugin modules for any 3D application pto^m or conuneicially «v&ü^le gune en^.
Rank this item: f^ NolhingCl C 2 O 4 C 5 C 6 C 7 T-S C 9 O 10 ' Submit |
ijSystem iirate'.
Jafve Professunal L&rary' contains handover editions of four Java references. 'Java in a iNutshell* xs a quick leferettfo for aO Java pnjgrammers. This accelerated mtnxluctbn to the Java iUngua^i ani its key ^Plsalbwc seasoned progcuomsTS toatut writing co<k tight away, The ]lhird editbn covers Java 1.2 and Java 1,3."Java Examples in a Nutshell. 2nd Editbn" provides !Jav« piogramming examples Ih&l devebpets can both lean ftom and modify for their own use. iThe second editbn coveis Java 1.3 and contains IM complete, practical examples: over 17.900 of densely commented, piofessbnally written Java code covering 20 distinct Java APIs, v^rawiora	Servlets, JavaServtr Pages, XML, Swing, and Java 2D.FinalJy. "Java Foundalbn
'•Classes in a HulsheQ" is a companbn vohune to "Java in a HulshcU* for piogtawmers who write iapplicatbns that use graphics or graphical user interfaces. This intiodoction to the Java APIs :that comprise the Java Foundatbn Classes (JFC) is also a quick reference to all of the classes in [the javax.sw^andjava.awt packages and Iheirmimetous subpackagiis.
David Flanagan Jim Farley
Ranklhisitem: HolWng C 1 02 C3 T 4 C 5 C 6 Ol C S T 9 C tO Submit [
Figure 8: An example of result set obtained by the ITR profile.
Informon—An Emergent Conscious Component
Anton P. Zeleznikar
Volaričeva ul. 8, Ljubljana, SI-1111
s51em@hamradio.si
Keywords: complexity, decomposition, emergentism, informational, informon, informonic consciousness, intention, learning curve, metatheory of informational, step curve of consciousness
Received: July 3, 2002
This article deals with a conscious entity being the building block of conscious systems. Such an informational entity is called informon, with its local and global function. The idea of informon as a conscious unit roots in the property of a sufficient complexity and learning capabilities, realized by different sorts of informational interpretation, formalism, general and metaphysicalistic decomposition concerning informon.
Emotional and cognitive informons show the complexity, intention, and capability of consciously, subconsciously, and self-consciously autonomous entities informing within a conscious system.
1 Understanding informon a
Consciousness seems to be an informational phenomenon emerging within an individual (physical, biological, phenomenal) brain. This kind of conscious informing is grasped as an instantaneous informational process of temporary active attention concerning definite matters in in-formationally complex ways, coming into the foreground out of possible conscious background with conscious potentialities. The search for an realizable concept of consciousness concerns the possibility of design leading to something which could be called artificial, machine-like, or robotic consciousness.
To be conscious means to be involved informationally in a matter (entity, problem, process, event) of awareness, in an instantaneous way, in a moment of attention informationally concerning a distinct matter, that is, by an immediate concern that is spontaneous and transitory simultaneously, being a form of an informational event, happening as a matter of individual experience. This state of conscious possibility and possible experience concerning a determined or verbally named matter a is symbolized and informationally formalized by the notion of informon a, published for the first time in German [9] and described before in [8].
To be more clear let's take the example of an emotion. In English, up to 2,500 names or name phrases for emotions can be distinguished. In Slovene, we can probably identify some hundreds names marking emotional states'. Taking a concrete emotion, for instance, named anger, information-ally formalized by aanger> we search for the meaning of this word in English. Using dictionaries, thesauri, encyclopedias, and our own imagination concerning anger, we can write a dissertation on anger. The meaning of anger is now
determined by a collection of anger explaining sentences, paragraphs, psychological sketches, and the like. We understand that the collected meaning is in no way a final result and can be continued, refined, advanced, etc. The acquired meaning given by the linguistic collection concerning anger we call anger informon or informon, concerning anger, denoting it symbolically by Oanger-
What do we have in mind by the obtained result of the concept? The name danger, a pure marker or basic informational operand concerning the English word anger, is now expanded or informationally propagated within the language to a meaning of anger, the still emerging informon denoted by agnger- In the complexity of language, this meaning is expressed by meanings of other emotional, cognitive, and other sorts of informons. The complexity becomes interweaved —informonically perplexed. In general, for such a situation, we can introduce the informon notations named a in a formula-dependent way.
— Val«; "2
oti
or, in a formula-system way,
^ ^ (Q; 02; ... ; •..)
The third possibility would be to express the informational dependence of operands by the so-called informational concerning of the form

a
a, ai, 02, ..., ai, ...
' I believe that a study of this sort was not made until now. I tried something in this direction by translating known English terms for emotions, however, it happens that many distinguished English terms fell into the domain of one and the same Slovene term.
The first expression is something we know as mathematical expression of a function or formula depending on its operands (variables) a, o^, ..., ou, ... . In case of an informational formula, parenthesis-like floor delimiters and 'J' are used instead of '(' and ')', respectively. The last parenthesis pair is used for delimiting subformulas occurring in informational formulas and, simultaneously, delimiting or enclosing the
420 Informatica 26 (2002) 419^31
A.P. Železnikar
operands (elements) of a formula system ^^ as shown in the second expression. In the third expression, infor-mon a concerns informationally in a complex manner the name a and informons	..., ou, ... emerged dur-
ing a system informing. It is essentially to stress that operands a, o^, ..., ... certainly possess common operands and, in this way, become circularly structured, that is, inform circularly in one or another way. Detailed meanings of such expressions can be found in the study [8] and elsewhere in its references.
Let us show the examples of the discussed three infor-monic system notations Let the emotion of anger, Clanger, inform dominantl^Hn a situation within the conscious system, together with cognition concerning anger and some other involved emotions. All of these components are informonic (complex and consciously structured by themselves) and build up the informonic system of anger as the dominant intention. In the formula dependent way it means, according to the first kind of expression.
- ganger

aanger
Oangeri ^cognition franger
trage langer j Ssadness [langer j ^depression langer
^aanger (^Clanger; Ccognition Clanger], •••,
trage [oanger j .Ssadness langer , ^depression langer
d)
In the third, informational-concerning expression.
^ »anger ^ danger Ccognition [langer
trage danger ; ^sadness danger

lepression
langer
possess the conscious capability within its own informational organization. The hypothesis of the necessary complexity [2], exponential process of learning [3], and informational organization of emerging [8], offers a smart and believable approach to make informational entities intentionally conscious in an artificial way, that is, outside a natural biologically founded consciousness.
In this expression components of the informonic formula langer linked by operators. In the second, formula
system expression, components are linked through common operands, that is.
the system becomes circularly perplexed according to the dominating informon Oanger- It becomes evident that the three systems marked by $ aanger represent nothing other
than the actual informon ttanger within a complex conscious system.
2 Consciousness versus complexity, learning versus time, and decomposition versus emerging of informon
The main problem of informon concept is the implementation of its autonomous conscious function. In principle, each informational entity represented by an informational operand has the property to function consciously per se, to
Learning of skills
Skill mastering Professionalism
Knov\/ledge of a professional
Informational consciousness, 3 o)
o c Q) O)
(b)
Development time
[Linear measure]
Threshold of self-conscious
Informonism
Province of unconscious
Province of conscious
Operand-operator complexity
Intentional complexity
(C)
Area of technical complexity
[Logarithmic measure]
Threshold of conscious complexity
Area of enormous
complexity
Number of decomposition steps
[Logarithmic measure]
Figure 1: (a) The exponential acceleration of an entity's knowledge through learning versus development time (Kurzweil [3] p. 34, the learning curve yj. (b) The step curve J^of emerging of informational consciousness versus operand-operator complexity (Buttazzo [2]). (c) The complexity acceleration versus the number of informational decomposition steps (A, OT, 3, and other sorts of decomposition).
Fig. 1 shows the learning versus time exponentialism (a) and the emerging of consciousness versus component complexity (b) being necessary for the occurrence of a conscious system. The learning curve (a) shows how the state of a skill mastering advances through learning versus time from the knowledge of a beginner. In this part of the curve the acquiring of knowledge has an exponential nature when the beginner grasps a lot of new knowledge and begins to use it as his or her own experience. In a point of development, the critical knowledge is reached putting the beginner near the threshold of becoming the master of the learned skill. Finally, the skill mastering is reached, characterizing the skill of a professional, which then can be still improved in an advanced manner. The point of critical knowledge is a kind of the donkey bridge and, when crossed, the way to the professional side becomes open. One of the best examples of the learning curve is the exponential acceleration in computer technology, where complexity of components and their speed of operation advance exponentially through time. It is understood, that the critical knowledge of the complex technology is already reached, so the advancement to an artificial conscious system becomes possible through the next decades.
The step curve J~m (b) is crucial for the occurrence of consciousness or conscious system. Human brain is an example of neuron and synapse complexity being sufficient for the real occurrence of consciousness in man. In primitive biological systems, the unconscious nervous activity takes care for the functioning of the necessary conditions of life. At some complexity of the neuronal system, intelligence can be observed being a manifest of the species successfully fitting to the environment. The state of the beginning of intelligence can be observed in machines using the technology of artificial intelligence (house equipment, today expert systems, and the like). As we see in Fig. 1 (b), this kind of intelligence is still far under the complexity needed for the most primitive form of consciousness. When complexity raises, a kind of jump to the conscious ability occurs. In man, some synapses and neurons constitute the conscious function being already substantially above the limit of consciousness occurrence.
Finally, the third curve (c) shows how a part of intentional complexity depends on the number of decomposition steps. The point of critical complexity, critical knowledge (a), and the beginning of intelligence (b) coincide. This coincidence is the condition for an informational system to become conscious, as the step curve (b) shows. The transition from unconscious to conscious happens along the "step" in curve (b). The other part of complexity might be conditioned by the number of physical components available for the machine in which a conscious system is embedded.
By the acquiring and emerging of knowledge the complexity of conscious system raises. Vice versa, the enlarged complexity accelerates the acquiring of knowledge. Under artificial or machine-like circumstances, artificial consciousness can widely surpass a biological consciousness
in the brain. A silicon or quantum-technology brain will substantially surpass the biological brain in complexity, speed, interior and exterior communication (informational connectivity), and in (unforgettable) memory, being capable to accumulate everything happened to a conscious system in the past.
Functional and componetial complexity, building up the intentional informational complexity, remains one of the major requirements in the design and implementation of artificial consciousness. The informon as a component of conscious system needs the complexity in itself and in its environment.
3 To verbal and formal etymol- ogy of the word informon, informoron, and cogniton
For a physicist or an electronic engineer, the comparison between the electron concept and the informon concept may be instructive. For instance, electron is in no way a static, definite, or even transparent notion. On the contrary, electron is a dynamic phenomenon imagined in the brain of a physicist constructing the atom theory. It is not only a particle with mass and charge, but a moving particle within the atom organization and outside of the atom as an electrically charged particle. The invisible trajectories of electron within an atom concept exist just as a logical predisposition of the atomic model. Mainly, electron is characterized by its charge, mass, and the dynamics originating in the charge moving along an electro-magnetic field. The point of this story concerning electron is that its definition is in no way as simple as it would be seen at the first glance. The origin of the word electron is Greek and means amber.
Can we expect a simple story of an informon etymology after all? There will be several additions determining infor-mon's nature in the world of informational. The word informon is a fusion of the Latin informo and the Greek on (being, entity). Translated into English, the meaning of informon is informational being or Being of the informational. Informational entity is the most general term to which the term informon belongs. It comes fore as the intentionally extremely complex entity connecting, interweaving, and in this manner including other informational entities and, in this way, informing and being informed consciously by itself. Informational entities, as they occur in conscious systems, are conscious, self-conscious, and subconscious as informational units. In this respect, informon brings a new meaning into the discourse of conscious structuring and organization, and understanding of its informational constitution. A conscious system —biological or artificial—can be imagined as an informonic organization, possessing initially a basic shell structure, complexly connected to various and numerous other entities in a circular way, ensuring the conscious maintaining and development of the involved informational entities.
422 Informatica 26 (2002) 419-431
A.P. Železnikar
Informing complexity
u
z
Local informon
Global informon
Local informoron
Local cogniton
Operand's informing
a
a ^ (a; a; a)
(^q; XS ; Ca ; %)
OL

oi^Ea
Intentional informing

^a i ^T- i ^T- i ^T
'a -^ot
^^ (x^; C^)

Counter-informing


■■a
T


CQ ^ fco
Informational embedding
e.




^ f f „
Table 1 : This table shows, how the initial ( sublocai) components a, 1«, Ca, and Sa of informon a become local informons o^ Xa, Ca, and £a and, in this sense, perform as individual informons by themselves. The similar is shown for the global inforìnónT
Nouns
Adjectives Adverbs Verbs Participles
Information, a Informon, a Informoron, a Cogniton, 5
informational	informonic	informoronic	cognitonic
informationally	informonically	informoronically	cognitonically
to inform	to informonize	to informoronize	to cognitonize
informing	informonizing	informoronizing	cognitonizing
Table 2: An overview of meaning concerning the words information, informon, informoron, and cogniton and their English derivations.
Etymologically, it can be useful to structure the informon into a more detail. For instance, it could be quite appropriate to distinguish the so-called informing-counterinforming subsystem and cognitive subsystem in the initial and further developed informon organization. For such a purpose, two new terms can be coined: informoron and cogniton, respectively. In this texture, informon a is an informational fusion of the informon's namena, informon's informoron a, and informoron's cogniton ä, that is, a formula system a ^ (a- ä; a).
The informon comparison table Tab. I can be useful: it forces us to rethink the recursive definitions of informon and make the introduced symbolism transparent for the common use in the future. According to the table, in a concrete case, the informon a components are informonically constituted as Iq ^ (la,	;	(informon's
informing), ^ (c^, C^, Cf^j (informon's
counterinforming),andfa {£a\	^f»)
formon's informational embedding, called also cogniton).
Evidently, the informon definition is recursive and can reach any reasonable informational depth of a situation decomposition.
In a similar way, this happens to the initially local components a, la, Ca, and Sa (being named operands only), which in the framework of the global informon a become global informons a, Xg, Ca, and 5a as presented in Tab. 1.
Adequately to the local informorons and local cognitons in Tab. 1, global informorons can be defined in the form
; ^^ (^xg; Čg^

and global cognitons in the fqrm
a ^ ta;
'Xa'

Ca'

£a
Global cognitons leave the question "What do they represent?" open. They certainly include a global understanding of the name (title, topic) a, its derivatives, and sub-names, acquired through the informational decomposition
a
propagation in global circumstances, for instance, including the meaning of a in different languages and image presentations and processing [5], Such a sort of cogniton can emerge probably on a natual and an artificial level of consciousness, when the meaning material is assembled and systematically ordered and treated in a nervous system [6] and an informational machine.
To keep in mind the introduced symbols the following list with supplementary explanations could be helpful:
a Name of entity, operand
a Informon, as consciously organized complex entity in local or
individual environment a Inlormoron, a complex component of informon, a subsystem of intentional and counterinformational part
Cogniton, a complex component of informon, a subsystem representing the cognitive part Q Global informon, as consciously organized complex entity in global environment, expanding over local or individual informational borders
Ct Global informoron, a complex component of global informon, a subsystem of intentional and counterinformational part
~ Global cogniton, a complex component of global informon, a QL subsystem representing the cognitive part
Into the etymological view of informon additional words can enter, as shown in Tab. 2. The meaning of adjectives, adverbs, verbs, and participles must be understood in a pragmatic commonsense of the English language. For instance, informonic means to have an informational organization of informon. To informonize means to make an informational entity informon-like. Informoronic means to have an interior organization of informing and counterin-forming simultaneously, etc.
4 Understanding consciousness in an innovative way
Usually, references concerning the topics of consciousness or mind root in some kind of traditional philosophy, for instance, such as Chalmers' conscious mind. In the last couple of years, questions touching problems of an artificial consciousness implementation come to the foreground (e.g., Buttazzo, 2001 [2]). The characteristic of references is a typical reductionism rooting on one side in an abstract philosophical discourse and, on the other side, in a too small number of properties pertaining to conscious systems. To such a conceptualism, usually a hierarchical organization of consciousness is studied and being disputed, rooting in the structure of brain and its functional locations.
For instance, the main question of the traditional consciousness studies remains if and when the design and implementation of an artificially conscious system would be possible. So, the experienced properties of conscious mind are rather very abstract concepts of unity, representation, supervenience, being in relation with, complexity, learning, and the like. A descriptive and exact formalism of conscious phenomena is practically not known, with few ex-
ceptions found in an exhaustive formalistic study of artificial consciousness in [8], In this ongoing research some of the characteristic concepts or properties of conscious systems—biological or artificial —are systematically listed in Tab. 3.
The right column of the table is dedicated to the innovative philosophy, formalism, and methodology, all being relevant for the future artificial consciousness implementation. As the reader might observe, a substantial number of criteria for conscious system is identified—some of them by an entirely new approaches, for instance, how to generate, acquire, embed the meaning of solutions as a sort of results coming up in conscious systems. So, we can discuss some items of the table additionally and complemetarily.
The main point of this paper is to discuss the constitution of informon and show its functional (structural, organizational) sufficiency to operate or inform consciously, that is, as an independent conscious subsystem, coming into the conscious foreground upon the inner and environmental impulses, and putting it into the background, when other, for a particular informon relevant impulses, force other in-formons into the conscious foreground.
The concept of informonic consciousness does not reflect the physical and biological organization of the human brain with a specific and hierarchically structured architecture impacting and conditioning the information-functional specificity of human mind. It concentrates on informational possibility of conscious phenomena as they appear to an individual consciousness and can freely inform within a machine.
5 A concise definition of informon 5.1 The hypothetic background
Formal informational definition of informon needs a complex and environmentally perplexed definition of a formula system. Informon as an informational entity needs its verbal and formal definition. Verbally a lot concerning informon was said. What we need is a unique symbolism for informon a rooting in its name a.
The name a, representing for instance a word, notion, concept, existent, imagined, etc., carries an intention of its meaning. The intentional means something which is already informonic in structure, organization, and propagation of a's meaning through and into informational environment (space):
-	Structure of that which emerges out of the initial name a is informonic, e.g., basically metaphysicalistic in such or another way.
-	Organization of the emerged is spontaneous and emergent in the propagation of the a's meaning. We usually say that the meaning of a arises in complexity, however in an intentionally consistent (predominant) way.
Item
Commonplace mind philosophy
Informational consciousness philosophy
(1) (2)
(3)
(4)
(5)
(6)
(7)
(8) (9)
(10) (11)
(12)
(13)
(14)
(15)
(16)
(17)
(18) (19)
(20) (21)
(22)
(23)
(24)
(25)
(26)
(27)
(28)
Properties of conscious mind
Unity of consciousness
Consciousness representation: language,
image, sound, etc. Being in relation with other components Complexity of brain (neurons), mind Phenomenal intentionality: in cognition,
emotions, sensory domain Conscious spontaneity, occasioning,
happening, being Direct and circular causality Conscious entity, event, process, experience
Parallelism of conscious events,
processes, experiences Conscious seriaiism, consequentialness, with simultaneous conscious parallelism of events
Conscious emergentism, experiential
happening Conscious structure and organization, embodied in the brain architecture and its properties, depending on it
Metaphysics of consciousness, the inner
organization of conscious systems Conscious ontogenesis
Conscious solving of problems
Events: conscious, sub-conscious, self-conscious, and not being conscious
Conscious counterfactual relations
Conscious components, as subsystems of C.S., e.g., cognitive, emotional, atten-tional, behavioral, metastatic, etc. Biological, individual consciousness Physical embodiment of consciousness
Brains
Conscious system
Formalism: exact, physicalistic, scientific,
mathematical Methodology: scientific and experimental
Authenticity of consciousness Qualia
Pure consciousness
Properties of informational consciousness I. systematism, informational formula systems I. operands (q), i. operators ([=), i. formulas (ip), i.f. systems
I. causality captured by i. formulas, common operands Complexity of i. formula systems: operands, operators I. intentionality in meaning, goals, aims, i. stability,
i. perseverance, i. orientation, ideology I. unforeseeability, unpredictability, emergence of the
informationally unexpected, possible I. causal relationship of i. entities I. entity is: 1. operand, i. operator, i. formula, i. formula system, emergence of meaning, understanding, expressed formally, generated by i. decomposition I. parallelism of operands, operators, formulas, formula
systems, represented formally Non-circular and circular serial, reverse serial, biserial, split biserial 1. formulas (<P>)> and uniform and nonuniform i. formula systems [ip'J^, where v e {A, o} and > € {->,	<-)}
I. arising of entities causally, unforeseeably, occasionally,
happening accidentally, environmentally dependent I. structure and organization, expressed by 1. entities, informing continuously in an intentional, emotional, and cognitive way
I. metaphysicalism, with m. shell 9JZt> [a], where a e
|/3, , , $1 ; /3 marks a primitive operand
I. heaping, accumulating, enlarging of complexity, entity
relatedness, causality, meaning Producing of i. entity specific meaning, informationally
fused in the solution f. systems Informing entities are operands, operators, formulas, and f. systems, representing conscious and other events, processes, and experiences
„oil
H.
Counterinforming of 1. entities, in decomposition with counterinforming components (La and c«
Informons, informing in parallel: ^ ;=± where i = 1,2,...; a,- is the name, a ^X^; C^j is informoron, and 5 ^ S^ is cogniton Artificial, computational, 1. individual consciousness 3 I. embodiment in i. machines, robots, local and global
webs, informationally accessible archives I. machines, global web computing systems Informonic systems, systems of informing informons I. formalism: embodied in i. axiomatism, informons, by operands, operators, formulas, and f. systems I. methodology: decomposition {A,dn), parenthesizing (•13), schematizing (6), gestaltizing (F), rotation of operands (SH), i. axiomatism, inferentialism Intentional i. emergentism, individualism, creativity, objectivism
Individual, unique, sensory and metaphysical experience Artificial consciousness as informon 3, concentrating on consciousness name 3 as such
Agenda: c. —conscious, f. —formula, i. —informational, m. —metaphysicalistic, s. —system
Table 3: A correspondence concerning the commonplace concepts and the informational concepts of consciousness. For details see [8],
-	The propagation of a's meaning comes into being by informational decomposition of that what emerged through previous processes of decomposition. Decomposition itself depends of the intentional-ity (meaning) of the name a, beginning from the initial shell of informon a, where the initial intention of the meaning is being captured.
-	Decomposition of something is informational propagation of something's meaning through the informational space.
Hypothesis 1 (Complexity and initial shell) An informational entity (operand, formula, or formula system) is said to inform consciously, if and only if
1.	it possesses a sufficient amount of complexity, that is, sufficiently large or larger number of informational components (operands, formulas, formula systems), and
2.	has an initial informonic structure of organization, that is, an informonic shell, from which the complexity can start according to the intention, given by the entity's name, e.g., a. An example of such a simple initial shell is informational metaphysicalism (decomposition) Tl\ a].	□
Complexity is comprehended as substance (e.g., being material, brain-like) and spiritual (e.g., mental, phenomenal, mind-like, informational, informonic). Complexity emerges intentionally, as a consequence of intention in orientation, materialization, informational decomposition, meaning the propagation of informational intention in the complex informational space.
Hypothesis 2 (Learning and decomposition) A conscious entity develops, that is, enlarges and advances its organization by
-	learning, happening through the sensory system of the conscious system (e.g., nervous system including memory), and
-	informational decomposition, meaning interpretation, deduced from intentional name and its semantic derivatives.	□
An informonic shell has, by definition, its intentionally informing part concerning the informon's name. Complexity, intentionality, learning, and decomposition are all emergent informational phenomena, being structured and organized spontaneously in a serial, biserial, parallel, and circular way of informing.
5.2 Definitions and a consequence concerning informon
We need a couple of definitions determining the concept of informon gradually, from the initial determination
to its conscious informational function. The necessary conditions are, certainly: intentionality (informon's name a), complexity (in local and global informational circumstances), emerging of complexity by decomposition processes and, finally, the constitution of an informonic conscious system.
Definition 1 (Local and global formula system) Informon is a complex and perplexed local or global formula system, denoted by a or ^ respectively, with the operand name (argument) a, possessing a conscious structure of informational organization.	□
Definition 2 (Local and global informon of in-
FORMONS) a local informon a informs in an information-ally localized area, e.g., in an individual brain or machine and its sensory environment. A global informon a is meant to inform in a global area up to the cosmological dimensions and, in this respect, represents an informonic system of other, informationally involved informonic systems. Both local and global informon are systems of informons. □
Definition 3 (General initial informonic decomposition [ad Initial informonic decomposition of operand (name) a, called informonic shell, is, in its most general form, a circular biserial formula system, expressed schematically by
a 1=
t= a;
e
■-t®" r 1
3> M
where t> £ {—, (->, The corresponding graph of this schema is presented in Fig. 2A.	□
The scheme 6
3t> M
covers the graph in Fig. 2A. We see how by considering this graph many different formula schemes can be formed. The graph corresponds exactly to the primitive formula system
3> M
^a	a	a	\
a =1 a =j a =1
H2:« /
As expressed by the general form of informonic decom-
position [a], particular cases are the following: serial informonic decomposition fa], reverse serial in-
Oll'
formonic decomposition [a], (proper) biserial informonic decomposition 3^ [o^l' split biserial infor-
■ ,	• •	r T
monic decomposition 3_+,<_ IQI ■
The informonic organization presented in Fig. 2A is understood as the maximal form of basic (initial) informonic
Intentional informational embedding Local informon's cogniton ä
Informon a's name a
-a ] Intentional counterinforming
(B)
Outputs to the local and global area, a and ^ respectively
Initial metaphysicalistic decomposition
la Initial metaphysicalistic intentional informing
Initial informoron 5, captured by the circular system scheme
|=i« Nc^a 1=^«;
< [al
\
3c. h i« N 3a; €a\=Ca^
Inputs from the local and global area, a and ^ respectively
£a Initial metaphysicalistic intentional embedding
Initial cogniton 5, captured by the circular scheme [= e» |=
Metaphysicalistic shell name a of the initial informon a.
Ca Initial metaphysicalistic intentional counterinforming
Figure 2: (A) The possible graph structure presents a maximal biserial form, called also the bicircular informational supervenience of informing, counterinforming, and informational embedding [8]. (B) Serial metaphysicalistic decomposition of a is an example of the initial shell of informon, developing into the local and global informonic area.
structure. Also, connections between the components of the graph are multitudinous and biserial. What is the minimum form of informon still guarantying the development from an initial informonic state to the conscious informing? The only condition of the informonic graph could be that all the components are circularly linked, thus, some of the connections sketched in Fig. 2A may not occur. Different forms of informational metaphysicalism satisfy this criterion [7, 8, 9]. The simplest form would be the initial serial loop with a, Hai ^ai ^d £a, that is, a circular causality concerning all the informon's components.
The graph in Fig. 2B shows the initial metaphysicalistic
>011
variant of informonic structure marked by . One can see how informonic components a, Xa, Ca, and £a are configured and connected within the graph. By a slightly modified graph to Fig. 2B, of the form
an intentionally stronger controlled initial cogniton is obtained. Intention informs the cognitive informing (£« being directly involved into the emergence of the resulting cognition e^. The graph is structured in a stronger way than
the graph Ö [a] in Fig. 2B. The primitive initial informon for the stronger graph, as presented above, can be expressed transparently by the system of subsystems of basic transitions corresponding to informonic componetns, that is.
Ql ^
3« hi«;
=1 ia;
i« N i« N VCa H^a y
representing, transparently.
q;
\
Ca N Ca; C-a H Cai Ca H \ea\= Ca j
Ca 1= 'Sai
N e«; H f«;
e« N
ia h (Sa;
. \
//

Definition 4 (Conscious system) A conscious system 3 ii a system of informonic components i = 1,2,..., where some of the components, corresponding to a situation and time, represent the contents being currently in the conscious foreground (attention).	□
Consequence 1 (Consciousness of informons) An informational entity (formula system) named ß informs consciously if and only if it is initially structured in an intentional way according to Defs. 1—4, and if it, through sufficient number of decomposition steps, became structured and informationally organized to a sufficient degree of complexity. In this case, the named entity ß has reached the informonic, that is, conscious organization ß.	□
Proof 1 Although the proof of Cons. 1 is evident, its proving interpretation might be truly useful. How did the initial name ß expand or arrive to informon /3? At the beginning, the first step toward the informonizing of ß is the initial, the so-called informonic shell decomposition of ß, marked by Ol"/?! (see, for instance, Fig. 2A or Fig. 2B). By the initial decomposition 3[/3], the initial informon ß ^ Df/^l is coming into existence. This decomposition gives to ß the necessary intentionally informing, intentionally counterin-forming (emotional), and intentionally embedding (cognitive) organization of the shell. In the next procedures of decomposition, the inner components of the initial informon will propagate and expand into the environmental, intentionally related tissue of the informational space. Thus, the complexity of ß will rise and, gradually, will reach the complexity, being necessary for /3's conscious behavior. Defs. 1 —4 concerning ß will gradually, by decompositions considering the /3's intention, structure and organize in a complex way the informon ß for its conscious function.
To stress, decomposition considers learning and, with ß as the meaning structure and organization of intentional meaning concerning the name ß, the acquired knowledge is accumulated as the /3's experience up to the moment, when in a situation, ß enters into the occurring conscious domain. Virtually, this proves the value of the consequence. □
5.3 A new philosophy with informonic consciousness
Informon is an informational unit being in the foreground of the instantaneous consciousness. Consciousness is nothing else than a momentary happening among informons, some of them coming to the conscious surface (attention). In a time interval, consciousness is understood to be a system of actively informing informons, that corhe into conscious existence, that emerge from the unconscious background of consciousness into the conscious existence; they emerge as momentary conscious entities, out of the subconscious domain, where they inform, and depend emer-gently and crucially on the temporary sensory situation. Consciousness is an informonic theater in which informons play their informational roles for the observing and informationally acting conscious system. In this view, conscious system happens as a momentary sequence of groups of informons marching through the time of conscious experience. Consciousness is experienced as a column of informon groups in the understanding of past, present, and future situations. In this sort of grasping, consciousness is just an informon by itself and to itself. The consciousness informon 3 understands currently active informons 3i, 32, ■ • • and specific informons 31,^,... understand (are aware of) consciousness 3.
Each informon is a conscious entity per se. As such, an informon consciously grasps other informons and itself. For instance, aanger grasps consciously itself and other, the informon informationally accompanying informons as, for mstance, Ssadness ; Cembarrassmenl; -Ssurprise! bhate> P plot, etc. In such an angry conscious happening, the informational interplay of the listed and other informons takes place. Thus, anger or any other emotional or cognitive conscious components can pull into the conscious orchestration several other emotional and cognitive components. In a moment of conscious happening, a distinct orchestra of informons plays it informational prelude into the next conscious happening.
The distinction to the common understanding of consciousness is in the ability of conscious informing of each single informon together with other informons. Consciousness 3 is just a specific informon Cconsciousness and nothing more. In the framework of this understanding, informon a means a specific name a, propagated into the possible meaning of a, concerning intentionally and attentionally the entire conscious system. As we know, a is constituted by system complexity, a-intentionality, learning or informedness coming from other informons, and the own ability of decomposition or interpretation accelerating the complexity of the momentary conscious situation.
Now, let's say the said about a conscious system again by other words, with the aim, to acquire the additional clarity in informonic understanding of consciousness. The informonic concept does not stress so much the so-called hierarchically organized system being characteristic for the functioning of the human brain (informational machine). Inten-
tionalities of complexly linked informons make the spontaneous and circular informing of the current conscious matter possible.
Consciousness about something emerges as an informational overlapping of informons, being currently relevant for a specific situation and time. A conscious event is constituted by the currently informing informons, as they occur spontaneously in a distinct time interval. The consciousness about something consists of yet consciously informing informons, transiting from a current situation to the next situation, where the next informons are coming to the conscious surface, that is, in the conscious foreground. In such a happening of conscious events, the hierarchy of the in-formationally relevant is not so much in the game as the informons' intentions and their interweavedness with current situations and time.
This kind of conscious mind comes fore as a spontaneous sequence of conscious events, that is, informons, depending on interior and exterior circumstances, occurring randomly but, certainly, in an intentionally unforeseeable and unpredictable way. Each individual consciousness about something confronts itself with a spontaneous stream of conscious, sub-conscious, self-conscious, and unconscious events, with an existing and entirely new experience, with learned and just learning facts, objectivities, irrationalities, beliefs, attention, cognition, and emotions. Such a conscious happening is offered through an informational overlapping and time-transiting of informons, constituting the feeling, unwinding, and revealing of consciousness about something.
6 Substructuring informon schematically, graphically, and meaningly
Capturing the inner organization of informon, its syntactic (formula-like) and semantic (perenthesis-setting) structuring, can be useful for informational decompositions, starting from the beginning informon's shell components. Decomposition of a current informon development of the form a into a more sophisticated and more complex form of the emerging entity, now in a new form a, hides the basic informon's intentional orientation, given to it by the initial meaning, at that time an informon's pure shell form
a (^a; Z« ; C« ; Ea^, characterized intentionally by the meaning of its name a.
The emerging of informon is reflected in the initial (starting) pure informon's shell, at that time a 3[a], and, after some decompositional steps, advanced to a more developed meaning of a, then a and, finally, to the currently developed form a, in which the maximal amount of meaning was gathered (accumulated) up to now. This meaning, a, is the semantic representation of the name a, structured schematically out of the graphical representation © [a]. Informational schemes form a formula scheme sys-
tem 6|"$], which can finally be parenthesized, ^Pf© ["$]], so the final and precise form of meaning concerning a is obtained, however, merely up to now.
New and new decompositions of the obtained informon's situation are only parts of an infinitesimal procedure, approaching to a more and more precise meaning of a, however never reaching a final or exact meaning, but coming near to a virtually final meaning as close as possible [7].
In this respect, an informon reflects informationally to some extent the slogan One in all and all in one (OA&AO). An informon propagates into the informational space gathering a specific (intentional) meaning concerning the informon's name. Using the graph of an informon, ©[a], the OA&AO problem can be made more transparent. First, let's remind that a system graph <01"$] is equivalently described by the primitive system This yields
Second, we introduce a relation of graphical inclusion,
with the meaning,
This means that all the primitive transitions of system a^ enter into the primitive system a'^. Precisely, by a primitive transition a \= ß, the operand a, the subscribed operator t= (e.g., t=subscript). and the operand ß is meant.
The way from system $ ^ , (/Jj , • • •, Vn^ ) to the corresponding graph	that is, to leads via the
de-parenthesizing of system formulas (6|"$]) and, then, via the primitive partition of formula schemes into primitive system An original formula G $ of length (number of operators in ip^), has been dissolved in the formula scheme 6 [tpj] and definitively lost in the possibility of	) differently parenthesized formulas. Further,
in the transifion from the system scheme 61"$] to the formula schemes are definitively lost. From a primitive system (graph ©r$l), nr=i ^ (X"' ) different formula systems can be derived for each reasonable n$ (a complete overlapping which covers the graph).
Let's mark the informing all (cosmos) by a^n, a part of the all by PpartInaili'	informational shell encir-
cling the part of the all by Sgj^gH Ppartf^'alll • Using the language of graphs, the following can be said:
Ö [a] -< © ©fa] ^ © © ["al ©
^aiil;
Ppart["alll : ® Ppartf^alll -^^roalll;
Sshell
©
Ppart t^all
^shell
Ppartfoalll
©
PpartKlll
The interpretation of this situation is presented topologi-cally [7] in Fig. 3. Namely, expressed by the corresponding
Figure 3: The solution as an informon's meaning, that is, the up-to-the-minute developed informon—the one —represented graphically: (A) The informon as a possibility of the all. (B) The informon as a possibility of a part of the all. (C) The informon as a possibility of a shell surrounding a part of the all.
primitive formula systems, there is o^Ca^l,;
^ PpartT^alll; Ppart^^alll ^ a^,,;
C ®shell
Ppart ®shell
«alll Ppart
"all
CPpartf^all
This set-theoretical presentation helps to make the graphical situation in Fig. 3 definitely transparent.
The meaning of something means to have an informational expression for something called the informational solution on something. For instance, in searching a solution of something a, the at-hand solution is a sort of decomposition, in general, A [a]. However, out of a formula (p or formula system $ in which a occurs, the solution concerning a can be explicitly expressed. Let mark the formula parenthesizing (setting of parenthesis pairs), IH the operand rotation in a circular formula, & the schematizing (deleting the parenthesis pairs) of a formula, and [ai,... ,ai-i,ai, Oi+i,..., a„J a circular formula. In this case it is possible to solve formula (pi,lai,...,ai-i,ai,ai+i,...,a„] on each of its operands ai,..., Qì_i, a,, Qj+i,..., a„, that is, to get n different solutions, expressing the meaning of each particular operand. Then, for a solution, using the operand rotation principle, there is
o I	I
(p^ lai,ai+i,... ,an,ai,.. .,ai-ij ^
i = 1
6

It is understood that the solution upon operand ai is already [aiai_ 1, a,, Qj+i,..., a„J, obtained by an
operand oi decomposition, for instance, etc.
A solution upon an operand can be obtained from a complex formula system taking into account all its formulas. That what comes out of such a consideration are informons
7 The pure informon
The pure informon, iintormon, is the searching, propagating, and determining the meaning of the newly coined and introduced word informon in the informational space. By the acquired meaning, the concept of informon is fortified and made familiar and understood in communities of researchers, scientists, and publicists. For instance, the new word can enter into an English or other dictionary and be regularly explained together with its etymology.
Additionally, the concept of informon can be grasped formally by informational formalism. In fact, at the beginning, we concentrate on a name (phrase, named informational entity) marking it by a. What will be the informon named a and how will it develop informationally? According to the discussed definitions, initially.
( Iintormon W; \
linformon [a]

informon
m;
^'informon T"! ' y ^'informon T«! j
In general, an initially decomposed informon, e.g., 3 [iinformon [o^l 1 > attains the required complexity through the linkage to other informons. For the informonic components of the pure informon there is, evidently, in case of
430 Informatica 26 (2002) 419-431
A.P. Železnikar
intentional component,
'informon
( ^'informon T"! '
intormon
r«! 1
^^'informon '


informon ''"I
in case of counter-intentional component,
/ C	r T
^'informon r«!'
^informon

informon


infoirnonlfL!. '

informon
Fol
and in case of cognitive component.
'informon
/C	.
■^■informon T«! '
^'informon '
informon
fcT I

informon
fol
/
^a; la, Ca, S^, there is, evidently,
The next, being extremely interesting, is the intentional contents of each of informonic components. Beside the in-formons' names, the informing of intentional contents concerns informons and their components. The general properties of each informon and its components are the following:
1.	the existence of the name-a-specific intention iintentionM, the name-a-specific counter-intention Ccounter-intention , and the name-a-specific intentional cognition iintentional_cognition [<^1 ;
2.	the development of the name-a-specific intentional complexity Ccomplexity ^intention [«ll. the name-a-specific counter-intentional complexity CcomplexityPccounter-intention ["11. and the name-a-specific complexity of intentional cognition ^complexity ^intentional_cognition HI; and
3.	the emerging of the name-a-specific intentional decomposition 3[iintentionMl. the name-a-specific counter-intentional decomposition DTccounter-intention ["11. and the name-a-specific complexity of intentional cognition
3 lintentionaLcognition fall.
Within this circle of emergence, the intention is refined, stepping into greater details, and complexity is enlarged, being more and more interweaved into the informational space. The consequence is the enriching of meaning concerning the informon's name a, that is, the name-specific intentionality iintention • For an informon a
^ iintention ', ^complexity Tiintention 1 !
Dr^intentionTo^lli • • •
(^counter-intention W;	\
t^complexity [^counter-intention Ml; 3 [Ccounter-intention Ml; • ■ • /
/ 'intentional_cognition M;
^complex ty | MntentionaLcognition fai V 3 [iintentionaLcognitlon Ml; ••■ y
8 Conclusion
The concept of informon calls for a different, in some way new understanding of consciousness or, precisely, conscious system. It comes close to that what a conscious observer experiences on the own consciousness, its happening in a time slice and in the sequence of conscious time slices. It enables a straightforward reasoning in the direction of artificially conscious systems constituted by informons. Informon seems to be finally just an informational entity depending on complexity, intention, learning, and decomposition concerning meaning of the intentional name. The required complexity seems to be a problem of future technological development, and the remaining necessary requirements are already in the visible scope of mastering them philosophically and technically.
After all, a meme [1] seems to be nothing else than a specific self-replicating informonic entity^ in individual brains which breaks down the reasonable immunity of conscious systems against an informational excess, exaggeration, plethora, or surfeit. By the philosophy of informon, a concrete meme can be studied as an authentically conscious sort of informon, in an innovative general and meaning-specific way. E.g., totalitarian memes have informed as politically local informons.
Regarding the building block of conscious system, called informon, it might be reasonable to understand it as a consequence of macroscopic quantum nature of consciousness—which is pointed out in Perus [5], as well as in Raković [6], from where it follows that informons might be related to quantum eigenstates of the conscious macroquantum sysrem. The whole story should be then related closely to quantum algebra, and naturally explains the nonlocal (holistic) aspect of consciousness.
References
[1] Blackmore, S. 1999. The Meme Machine. Oxford University Press. New York.
^An informon informs in a self-replicating way by informational decomposition considering the informon's intention.
[2]	Buttazzo, G.C. 2001. Artificial consciousness: Utopia or real possibility? IEEE Computer 34:7:2430.
[3]	Kurzweil, R. 1999. The Age of Spiritual Machines. When Computers Exceed Human Intelligence. Penguin Books. New York.
[4]	MORAVEC, H. ì999.Robot. Mere Machine to Transcendent Mind. Oxford University Press. New York.
[5]	PeruŠ, M. 2001. Image processing and becoming conscious of its results. Informatica 25:575-592.
[6]	RakoviĆ, d. 2002. Hopfìeld-like quantum associative neural networks and (quantum) holistic psychosomatic implications. B. Reljin, Ed.: Proc. 6-th NEUREL. IEEE Yugoslavia Section. Belgrade.
[7]	železnikar, A.P. 1998. Topological informational spaces. Informatica 22:287-308.
[8]	železnikar, A.P. 2002. An Introduction to Artificial Consciousness. An Informational Approach, Formalism, and Implementation. Available free in PDF/Acrobat at location http:// www.artifico.org.
[9]	železnikar, A.P. 2002. Informon—ein bewußter Bauteil des Bewußtseins. Grundlagenstudien aus Kybernetik und Geisteswissenschaft —Humankybemetik43 (in press).
A List Scheduling Heuristic for Allocating the Task Graph to Multiprocessors
Janez Brest and Viljem Žumer University of Maribor
Faculty of Electrical Engineering and Computer Science
Smetanova 17, 2000 Maribor, Slovenia
E-mail: janez.brest@uni-mb.si, http://marcel.uni-mb.si/janez
Keywords: parallel processing, compiler, static scheduling Received: July 2, 2002
In this paper we propose a new static scheduling algorithm for allocating the task graph without communication costs to fully connected multiprocessors. A global comparison is carried out for the proposed algorithm and three reported scheduling algorithms. The proposed algorithm outperforms the previous algorithms in terms of the generated schedule length using Standard Task Graph set.
1 Introduction
To efficiently execute a program on a multiprocessor system [11, 19, 20, 8, 18], it is essential to solve a minimum execution time multiprocessor scheduling problem [16, 13, 14, 2, 4, 5], which determines how to assign a set of tasks to processors and in what order those tasks should be executed to obtain the minimum execution time. The tasks can then be scheduled to the processors for execution by using a suitable scheduling algorithm, static in compile-time or dynamic in run-time [7, 9, 3]. The optimal static scheduling, except for a few highly simplified cases, is an NP-complete problem. Thus, heuristic approaches are generally sought to tackle the problem. Traditional static scheduling algorithms attempt to minimize the schedule length through iterative local minimization of the start times of individual tasks. On the other hand for example the Dynamic Level Scheduling (DLS) algorithm dynamically selects tasks during the scheduling process [15]. As optimal scheduling of tasks is a strong NP-hard problem, many heuristic algorithms have been introduced in the literature [6].
In this paper we proposed a low time complexity multiprocessor static scheduling algorithm called MCP/CLR without communication costs, which is based on critical path (CP) algorithm, such as, for example, the MCP [21] algorithm. It generates high quality scheduling solutions.
The remaining paper is organized as follows: In the next section, we present a brief overview of various approaches that have been proposed for the DAG scheduling problem. In Section 3, we present the proposed algorithm, and discuss its design principles. We present the experimental results in Section 4, and conclude the paper with some final remarks in Section 5.
2 The Multiprocessor Scheduling Problem
In static scheduling, a parallel program is presented by a directed acyclic graph (DAG) [19]. In a DAG, G = (V, E), y is a set of V nodes, representing the tasks, and is a set of e directed edges, representing the communication messages. Edges in a DAG are directed and, thus, capture the precedence constraints among the tasks. The cost of node rii, denoted as w{ni), represents the computation cost of the task. The cost of the edge, emerges from the source node Ili and incidents on the destination node rij, denoted by Cij, represents the communication cost of the message. The source node of an edge is called a parent node, while the destination node is called a child node. A node with no parent is called an entry node and a node with no child is called an exit node. A node can only start execution after it has gathered all of the messages from its parent nodes. The b-level of a node is the length (sum of the computation costs only) of the longest path from this node to an exit node. The t-level of a node is the length of the longest path from an entry node to this node (excluding the cost of this node).
The objective of scheduling is to minimize the schedule length, which is defined as the maximum finish time of all the nodes, by properly assigning tasks to processors such that the precedence constraints are preserved.
The existing scheduling algorithms are classified into four categories by Ahmad and Kwok [2, 14]:
1.	Bounded Number ofProcessors(BNP) Scheduling: A BNP algorithm schedules a DAG to a limited number of processors direcdy. The processors are assumed to be fully connected without any regard to link contention and scheduling of messages. The proposed algorithm belongs to this class.
2.	Unbounded Number of Clusters (UNC) Scheduling: An UNC algorithm schedules a DAG to an unbounded
number of clusters. The clusters generated by these algorithms may be mapped onto the processors using a separate mapping algorithm. These algorithms assume the processors to be fully connected.
3.	Arbitrary Processor Network (APN) Scheduling: An APN algorithm performs scheduling and mapping on an architecture in which the processors are connected via a network topology. An APN algorithm also explicitly schedules communication messages on the network channels, taking care of the link contention factor.
4.	Task-Duplication-Based (TDB) Scheduling: A TDB algorithm duplicates tasks in order to reduce the communication overhead. Duplication, however, can be used in any of the other three classes of algorithms.
For our purpose, we will compare the proposed algorithm with three other BNP scheduling algorithms.
In a traditional scheduling algorithm, the scheduling list is statically constructed before node allocation begins, and, more importantly, the sequencing in the list is not modified.
The Earliest Task First (ETF) algorithm [10] uses static node priorities and assumes only a bounded number of processors [16,17]. The High Level First with Estimated Time (HLFET) algorithm [1] assigns the nodes in a DAG to the processors, level by level.
Similar to the ETF and HLFET algorithms, the Modified Critical Path (MCP) algorithm [21] constructs a list of tasks before the scheduling process starts. The MCP algorithm uses the ALAP (As-Late-As-Possible) start time of a node as the scheduling priority. The MCP algorithm first computes the ALAP times of all the nodes, then constructs a list of nodes in ascending order of ALAP times. Ties are broken by considering the ALAP times of the children of a node. The MCP algorithm then schedules the nodes on the list one by one so that a node is scheduled to a processor that allows the earliest start time using the insertion approach. The MCP algorithm looks for an idle time slot for a given node. The algorithm is briefly described in Figure 1 [21, 16, 14]. The complexity of the MCP algorithm is logw).
(1)	Compute the ALAP time of each node.
(2)	For each node, create a list which consists of the ALAP times of the node itself and all its children in descending order.
(3)	Sort these lists in ascending lexicographical order. Create a node list according to this order.
Repeat
(4)	Schedule the first node in the node list to a processor that allows the earliest execution, using the insertion approach.
(5)	Remove the node from the node list.
Until the node list is empty.
Figure 1 : The MCP algorithm.
sor which contains the most suitable place in time for a selected node.
The MCP algorithm schedules the selected node to a processor that allows for the earliest start time. The proposed algorithm has another processor selection criteria and they are described as follows.
3.1 The MCP/CLR Algorithm
Bmld_ALAP{y,
Sort_ALAP{)-
//v is number of tasks
for (i = 0; i < u; Ì++) {
ti = est(alap(ni));
if a processor j exists where SLj (i) < ti
then
schedule node n^ to a processor j where SLj{i) — ti is minimal
else
schedule node rii to a processor that allows the earliest execution
Figure 2: The MCP/CLR algorithm.
3 The Heuristic Algoritlim
In this section we discuss some of the principles used in the design of proposed algorithm. To minimize the final schedule length, we select a node as it is selected in the MCP algorithm. At each step of the scheduling process, the first node is removed from the list of nodes (the list of nodes is sorted in increasing lexicographical order of the latest possible start times) and it is scheduled to a processor. While we are able to identify a selected node, we still need a method to select an appropriate processor for scheduling that node into the most suitable idle time slot. At each step, the algorithm needs to find the most suitable proces-
The function Build_ALAP{) computes the ALAP time of each node and creates a list, which consists of the ALAP times of the node itself and all its children in descending order. Function Sort_ALAP{) sorts these lists in ascending lexicographical order as in the MCP algorithm.
Assumed that, in the scheduling process there are already scheduled i — 1 nodes. Next selected node is Ui. SLj{i) is the schedule length of the step i of the scheduling process on the processor j. The MCP/CLR {MPC/Close-Left-Right) algorithm (see Fig. 2) tries to find a processor j for the selected node n». It is needed to distinguish two cases of the processor selection step. If a processor exists, say j, which satisfy that SLj{i) is less or equal to the earliest start time {EST) of the selected node rii, our
algorithm assigns the selected node n» to the processor j with the smallest value SLj{i) — t^. Otherwise it assigns the selected node ni to a processor that allows the earliest execution (like the MCP algorithm), using non-insertion approach. The complexity of the MCP/CLR algorithm is logt;), too.
3.2 Scheduling Example
In this section, we present an example to demonstrate the operation of the proposed algorithm using the task graph shown in Fig. 3. The task graph was drawn using the Graphlet Tool {http://www.fini.uni-passau.de/Graphlet). The schedules of the algorithms are shown in Fig. 4. The entry and exit node are dummy. The MCP algorithm creates a list of edges and schedules the task graph onto the multiprocessor machine with 2 processors (processing elements) in the order: m, ng, ns, ns, ng, ny, 714, nu, rijo, ng, ne, ni2. The HLFET and MCP/CLR schedule the nodes in the same order as thè MCP algorithm. The ETF algorithm schedules the nodes in the order:
ng,nii,ni2. The order of nodes n4,nr,ns and the processor selection during the scheduling process, have caused different schedules of the task graph, and therefore also different schedule lengths.
(a)
PEOi'	ÜB			SSPP^		
PE 1	5	BI iPIS^Ìlillllli 1®	!#! 1 '^-lei	f |g ,	I^Bil-Äs	
						
			. tio ;	'40		tü
(b)
PEO	2		llllil'ii^Bt	B^BIilBllllll^BII	
PE 1	5' .	liii^iimi			
	_______... . ... . . . ______ .. ........................				
1				, bo	
(c)




Figure 4: The schedules of the task graph on Fig. 3 generated by: (a) ETF algorithm (schedule length = 67 time units); (b) HLFET and MCP algorithms (schedule length = 64 time units); and (c) MCP/CLR algorithm (schedule length = 63 time units).
Table 3: Number of times the optimal schedule is found, and a global error
	Optimal		Global
Algorithm	schedule	%	error
ETF	33	12.94	5577
HLFET	50	19.61	3189
MCP	57	22.53	1531
MCP/CLR	167	65.49	257
Figure 3: An example of a task graph with 12 nodes.
4 Results
In this section, we present the performance results of the proposed algorithm and compare them with the results of the HLFET, ETF and MCP algorithms.
We have implemented the scheduling algorithms on a SUN workstation using C/C++. They were evaluated by using a Standard Task Graph set:
http://www.kasahara.elec.waseda.ac.jp/schedule/. The Standard Task Graph set has 900 task graphs with 50 to 2700 tasks.
The results obtained in our experiments are shown in Table 1. The second and third columns indicate the name of the task graph instance and number of nodes, respectively. In next four columns results of the schedule length for the all of algorithrns are shown, respectively. In the last column the optimal schedule length value is shown. If the optimal; schedule is found,, the schedule length value is boldface. For some problem instances, the optimal schedule length is not known.
In order to rank all the algorithms in terms of the schedule lengths, we made a global comparison [17]. We observed the number of times each algorithm performed better, worse or the same compared to each of the other algorithms. This comparison is presented in Fig. 5, where some boxes have the left and the right side. Each left side of the box compares two algorithms - the algorithm on the left side and the algorithm on the top. Each left side of the box contains three numbers preceded by ">", "<", and "=" signs which indicate the number of times the algorithm on the left performed better, worse, or the same, respectively, compared to the algorithm shown on the top. Each comparison is based on the total of 300 task graphs. Each right side of the box contains the number of times when one of algorithms, the algorithm on the left side or the algorithm on the top, find the optimal schedule length. Optimal sched-



tOlOtOtO^——	—
Ulto —

S2-2-SS°°°SSSS2.SSS§-2.SSSSSSS2sSsS2.2S
oooooooooooooooooooooooooooooooo
t0t0N)r0N)t0K)l0t0t0K)N)N)t0N>t0N>t0K)N)OOOOOOOOOOOOOOOOOOOO
-----------------------------------------------........
\0 oo	- -	- -	-....................
K> — o VO CO
VD 00
TOcretreTOaQcisoqoqoocrQmTOoqoqootretroOQTOOoTOtrQCrotrQOooQTOoqaQ
IOI010ION)tOK)tO(OtO — — — — — — — — — — — — — — — — — K>u> — — — oooj — —
— t-otooo — j^i.Lft^cxiOouivoL'i
^Oto— hJOSMDVOvovooovovCUi
g
— ^
Ln Cs vo ^ Lft -J Ji. VO U> (O
— tO-JM — — — 00K>
— N) —
Co Co ^ ^ Cj è ft
--1	V_'	-f^ VJJ I-- INJ I--
— ^Joovovo^:^a^N> — ujoooou)

—	K>-JtO — — — 00(0
—	^ Ln W W Ji.
Ln ^ — vo to u> o o ^
O^ — OLntoj^to ■^^lOLnoto^^to
èg
—	to	UJ
OJ Ji.	—
u> to	UJ
00 VO	—
ooPSuiOi-" — t0a\0\Sui oSsooovovototoi^-vo'^on — tooo — ooo'^-t^
ON ^
o ii
^ 1.1 k^i nn nft i	t ij m
— IO 00 .
K ^ ° 00 s
ÌÀ
oo 00 Ln — to \0 VD 00

^to — — — ooto — — — —

o tji — to OS ON
—	^ '-'J t^	^ +=>■ ^ ^ ^ (Ti t^	— iCi ^ fii	^	^ ^ — ^ -ll r-S o ^	—I ^'n r^i
u> a\ ui
— -a to (/1 oo ~ tn u) tn Ol ^ U) \0 bi 4^	s
—	~J	to —	— h—	-^a	tyi	to 4i.	o o	ON Hi	Ui to	t*)	W 00	vo to
OVOIOh-tOONSON
to W „ K> ^ H- to fe ß § 11 -
_ tn — tooNONS OOO^oo — — 4i. — o — U)4iO""
ON 4^ 5
a* o ON § NO
00 - Ul g 1-
IJl ON o 00 NO to
4^ t*> ON ui g W W H
fi. — -jtotnoo^Tui 06 4i. to (/1 4^ s
NA r^l KJ KJ ^^ u ^
(/)tON©tOIOIOO\LO
oc
—	to~Jto — — — ooto — — — —
—	4^^UiU>tO4^Ol-n42.N0NDwS 4^^00N — Lfl^ovoto — tOLn^
—	— LOUiOO^tOVO —
^ -
to to to OJ
ss
uioivi — tooNONÄuiS; ooooooo — of^oPj 4^0000 — tON0^4^^
NO
00 3
OJ ON Oi 4^
UJ to ^ 00 4:^ — NO
to
t-n UJ ^ U\ U>	OJ
a^ — ^
o s
■a
3-
a-
&0 o 3" o D. C
o
00
•-1 p
'O
o
■a 3
C
3

to
U)
r
co
n p
Table 2: Schedule length with respect to the optimal solution
	Quality of the solution (Error)	ETF	HLFET	MCP	MCP/CLR
1	0% (optimum)	33	50	57	167
2	< 5%	178	182	195	88
3	5%- 10%	30	21	3	0
4	10%- 15%	11	2	0	0
5	15%-20%	3	0	0	0
6	Optimum not known	45	45	45	45
	Total	300	300	300	300
Figure 5: A global comparison of four algorithms in terms of better, worse, and equal performance.
these experiments, all the algorithms can be sorted in the following order: MCP/CLR, MCP, HLFET and ETF. The same order of the MCP and ETF algorithms can be found in [17], where communications are also assumed among the tasks.
5 Conclusion
This paper presents the static task scheduling algorithm which can schedule directed acyclic graphs (DAGs) with a complexity of log v), where v is the number of tasks in the DAG. The algorithm schedules the tasks and it is suitable for the graphs with arbitrary computation and without communication costs, and is applicable to the system with homogeneous fully connected processors. The performances of the proposed algorithm has been observed by comparing it with other existing bounded number of processor (BNP) scheduling algorithms in terms of the schedule length.
ule lengths are known for 255 of all 300 task graphs. They were computed on a parallel machine using the ISH algorithm [13, 12]. For example, the MCP/CLR algorithm performed better than the MCP algorithm in 238 cases, never performed worse, and performed the same in 62 cases. The MCP/CLR algorithm or the MCP algorithm or both of them found optimal solution of the schedule length in 167 cases. An additional box for each algorithm compares that algorithm with all other algorithms combined.
The experimental results of the quality of the schedule length are summarized in Table 2. For example, the MCP/CLR algorithm found the optimal schedule length in 167 cases and, additionally, the solution within 5% in 88 cases.
Table 3 shows number of times the algorithm has found the optimal schedule, and global error which is defined as difference between the sum of all the optimal schedule values and the sum of all the schedule values generated by the algorithm.
It can be noticed that the proposed MCP/CLR algorithm outperformed three other well known algorithms. Based on
References
[1]	T. L. Adam, K. M. Chandy, and J. R. Dickson. A comparison of list schedules for parallel processing systems. Communications of the ACM, 17(12):685-690, December 1974.
[2]	L Ahmad and Y.-K. Kwok. On parallelizing the multiprocessor scheduling problem. lEEETPDS: IEEE Transactions on Parallel and Distributed Systems, 10, 1999.
[3]	J. Brest, V. Žumer, and M. Ojsteršek. Dynamic scheduling on a network heterogeneous computer system. LNCS 1557, pages 584-585,1999.
[4]	J. Brest and V. Žumer. A Performance Evaluation of List Scheduling Heuristics for Task Graphs without Communication Costs. Proceedings of the International Workshop on Parallel Processing (ICPP'OO), pages 421^28,2000.
[5]	J. Brest, J. Jejčič, A. Vreže and V. Žumer. An Approximation Algorithm for the Static Task Schedul-
ing on Multiprocessors. VECPAR'2000 4th International Meeting on Vector and Parallel Processing, Vol. 1, pages 46-56,2000.
[6]	D. Darbha'and'D. P. Agrawal: Optimal scheduling algorithm for distributed-memory machines. lEEET-PDS: IEEE Transactions on Parallel and Distributed Systems, 9, 1998.
[7]	M, M. Eshagian, editor. Heterogeneous Computing. Artech House, Inc., Norwood, MA 02062, ISBN 089006-552-7,1996.
[8]	I.Foster. Designing and Building Parallel Programs. Addison-Wesley, ISBN 0-201-57594-9,1995.
[9]	E. Haddan. Load Balancing and Scheduling in Network Heterogeneous Computing. In M. M. Eshagian, editor, Heterogeneous Computing, pages 224-276, Norwood, MA 02062, ISBN 0-89006-552-7, 1996. Artech House, Inc.
[10]	J. J. Hwang, Y.-C. Chow, E D. Anger, and C.-Y. Lee. Scheduling precedence graphs in systems with inter-processor communication times. SIAM Journal on Computing, 18(2):244-257, April 1989.
[11]	K. Hwang and Z. Xu. Advanced Computer Architecture: Technology, Architecture, Programming. McGraw-Hill, New York, 1998.
[12]	H. Kasahara, H. Honda, and S. Narita. Parallel processing of near fine grain tasks using static scheduling on OSCAR (optimally scheduled advanced multiprocessor). In IEEE, editor, Proceedings, Supercomputing '90: November 12-16, pages 856-864. IEEE Computer Society Press, 1990.
[13]	H. Kasahara and S. Narita. Practical multiprocessor scheduling algorithms for efficient parallel processing. IEEE Trans, on. Computers, 33(M):1023, November 1984.
[14]	Y.-K. Kwok. High-Performace Algorithms for Compile-Time Scheduling of Parallel Processors. PhD thesis. The Hong Kong University of Science and Technology, 1997.
[15]	Y.-K. Kwok and I. Ahmad. FASTEST: A practical low-complexity algorithm for compile-time assignment of parallel programs to multiprocessors. IEEE Transactions on Parallel and Distributed Systems, 10(2):147-159, February 1999.
[16]	Y.-K. Kwok and I. Ahmad. Parallel program scheduling technique. In Buyya Raykumar, editor. High Performance Cluster Computing: Architectures and Systems. Prentice Hall - PTR, NJ, USA, 1999.
[17]	Y.-K. Kwok and I. Ahmad. Dynamic critical-path scheduling: An effective technique for allocating task
graphs to multiprocessors:. IEEE Transactions on Parallel and Distributed Systems, 7(5):506-521, May 1996.
[18]	M. Quinn. Parallel Computing: Theory and Practice. McGraw-Hill, 1994.
[19]	B. Raykumar, editor. High Performance Cluster Computing: Architectures and Systems. Prentice Hall - PTR, NJ, USA, 1999.
[20]	B. Wilkinson and M. Allen. Parallel Programming: Techniques and Applications Using Networked Workstations and Parallel Computers. Prentice-Hall, En-glewood Cliffs, NJ 07632, USA, 1998.
[21]	M.-Y. Wu and D. D. Gajski. Hypertool: A programming aid for message-passing systems. IEEE Transactions on Parallel and Distributed Systems, l(3):330-343,July 1990.
JOŽEF STEFAN INSTITUTE
Jožef Stefan (1835-1893) was one of the most prominent physicists of the 19th century. Born to Slovene parents, he obtained his Ph.D. at Vienna University, where he was later Director of the Physics Institute, Vice-President of the Vienna Academy of Sciences and a member of several scientific institutions in Europe. Stefan explored many areas in hydrodynamics, optics, acoustics, electricity, magnetism and the kinetic theory of gases. Among other things, he originated the law that the total radiation from a black body is proportional to the 4th power of its absolute temperature, known as the Stefan-Boltzmann law.
The Jožef Stefan Institute (JSI) is the leading independent scientific research institution in Slovenia, covering a broad spectrum of fundamental and applied research in the fields of physics, chemistry and biochemistry, electronics and information science, nuclear science technology, energy research and environmental science.
The Jožef Stefan Institute (JSI) is a research organisation for pure and applied research in the natural sciences and technology. Both are closely interconnected in research departments composed of different task teams. Emphasis in basic research is given to the development and education of young scientists, while applied research and development serve for the transfer of advanced knowledge, contributing to the development of the national economy and society in general.
At present the Institute, with a total of about 700 staff, has 500 researchers, about 250 of whom are postgraduates, over 200 of whom have doctorates (Ph.D.), and around 150 of whom have permanent professorships or temporary teaching assignments at the Universities.
In view of its activities and status, the JSI plays the role of a national institute, complementing the role of the universities and bridging the gap between basic science and applications.
Research at the JSI includes the following major fields: physics; chemistry; electronics, informatics and computer sciences; biochemistry; ecology; reactor technology; applied mathematics. Most of the activities are more or less closely connected to information sciences, in particular computer sciences, artificial intelligence, language and speech technologies, computer-aided design, computer architectures, biocybemetics and robotics, computer automation and control, professional electronics, digital communications and networks, and applied mathematics.
ranean Europe, offering excellent productive capabilities and solid business opportunities, with strong international connections. Ljubljana is connected to important centers such as Prague, Budapest, Vienna, Zagreb, Milan, Rome, Monaco, Nice, Bern and Munich, all within a radius of 600 km.
In the last year on the site of the Jožef Stefan Institute, the Technology park "Ljubljana" has been proposed as part of the national strategy for technological development to foster synergies between research and industry, to promote joint ventures between university bodies, research institutes and innovative industry, to act as an incubator for high-tech initiatives and to accelerate the development cycle of innovative products.
At the present time, part of the Institute is being reorganized into several high-tech units supported by and connected within the Technology park at the Jožef Stefan Institute, established as the beginning of a regional Technology park "Ljubljana". The project is being developed at a particularly historical moment, characterized by the process of state reorganisation, privatisation and private initiative. The national Technology Park will take the form of a shareholding company and will host an independent venture-capital institution.
The promoters and operational entities of the project are the Republic of Slovenia, Ministry of Science and Technology and the Jožef Stefan Institute. The framework of the operation also includes the University of Ljubljana, the National Institute of Chemistry, the Institute for Electronics and Vacuum Technology and the Institute for Materials and Construction Research among others. In addition, the project is supported by the Ministry of Economic Relations and Development, the National Chamber of Economy and the City of Ljubljana.
Jožef Stefan Institute
Jamova 39, 1000 Ljubljana, Slovenia
Tel.:-h386 1 4773 900, Fax.:-H386 1 219 385
Tlx.:31 296 JOSTIN SI
WWW: http://www.ijs.si
E-mail: matjaz.gams@ijs.si
Contact person for the Park: Iztok Lesjak, M.Sc.
Public relations: Natalija Polenec
The Institute is located in Ljubljana, the capital of the independent state of Slovenia (or S^nia). The capital today is considered a crossroad between East, West and Mediter-
CONTENTS OF Informatica Volume 26 (2002) pp. 1-439
Papers
Informatica 26:57-64.
Atluri, V. & R. Mukkamala. 2002. Employing an extended transaction model in multilevel secure transaction processing. Informatica 26:105-115.
Avdushin, & V. Serdiouk. 2002. Some approaches to information security of communication networks. Informatica 26:1-10.
Bai, C.-Y., R. Houston & G.-L. Feng. 2002. Practical construction for multicast re-keying schemes using R-S code and A-G code. Informatica 26:263-270.
BAjrc, V.B., S. Tang, H. Han & V. Brusic. 2002. Artificial neural networks based systems for recognition of genomic signals and regions: A review. Informatica 26:389-400.
Bavec, C. 2002. An assessment of the organization vir-tuality with three different reference models. Informatica 26:347-352.
Blockeel, H. & J. Struyf. 2002. Deriving biased classifiers for better ROC performance. Informatica 26:77-84.
Brandi, W. & M. Olivier. 2002. Maintaining integrity within mobile self protecting objects. Informatica 26:117-122.
Brank, J. 2002. Using image segmentation as a basis for categorization. Informatica 26:353-358.
Brest, J. & V. Žumer. 2002. A list scheduling heuristic for allocation the task graph to multiprocessors. Informatica 26:433-438.
Brumen, B., I. Golob, t. Welzer, i. Rozman, m. DružOVEC, & H. Jaakkola. 2002. Data protection for outsourced data mining. Informatica 26:205-210.
Chang, C.-C, J.-C. Chuang & T.-S. Chen. 2002. Recognition of image authenticity using significant DCT coefficients quantization. Informatica 26:359-366.
Claessens, J., B. Preneel & J. Vanderwalle. 2002. Combining World Wide Web and wireless security. Informatica 26:123-132.
Damiani, E., P. Samarati, S. De Capitani Di Vimercati & S. Paraboschi. 2002. XML access control systems: A component-based approach. Informatica 26:133-140.
Datta, a.. 2002. Efficient parallel clustering algorithms. Informatica 26:65-75.
De Win, B., B. Vanhaute & B. De Decker. 2002. How aspect-oriented programming can help to build secure software. Informatica 26:141-149.
Debudaj-Grabysz, a. 2002. a concurrent implementation of the simulated annealing by the method of multiple trials.
Diaz, J.A.P. & D.A. GutiéRREZ. 2002. Protecting the data state of mobile agents by using bitmaps and XOR operators. Informatica 26:367-372.
Erfurth, C. & W. rossak. 2002. Adapting proactive mobile agents to dynamically reconfigurable networks. Informatica 26:401^06.
Erjavec, t. 2002. Compiling and using the US-ELAN parallel corpus. Informatica 26:299-307.
Prigioni, D., L. tarantino & t. Di Mascio. 2002. On formal aspects of zooming in geographic maps. Informatica 26:309-319.
Hung, K.-L. & C.-C. Chang. 2002. An improvement of a technique for color quantization using reduction of color space dimensionality. Informatica 26:11-16.
Isazadeh, a. 2002. Software engineering: Configuration languages. Informatica 26:223-24L
Janaček, J. & R. OstertAg. 2002. Problems in practical use of electronic signatures. Informatica 26:151-157.
Korzhik, v., G. Morales-Luna, D. Marakov & I. Marakova. 2002. Digital semipublic watermarking. Informatica 26:279-285.
Krisper, M. & B. Zupan. 2002. Construction and application of hierarchical socioeconomic decision models. Informatica 26:47-56.
Li, Y., B. Yang & X. Hua. 2002. An efficient anonymous fingerprinting scheme. Informatica 26:287-290.
Mandke, V.V. & M.K. Nayar. 2002. Implementing information integrity technology—A feedback control system approach. Informatica 26:169-180.
Manger, R. 2002. Solving path problems on a network of computers. Informatica 26:91-100.
Margaritis, C., N. Kolokotronis, p. Papadopoulou & D. Matakos. 2002. Securing web-based information systems: A model and implementation guidelines. Informatica 26:159-168.
Mu, Y. & V. Varadharajan. 2002. Group cryptography: Signature and Encryption. Informatica 26:249-254.
pieprzyk, J. & X.-M. Zhang. 2002. Multisecret sharing immune against cheating. Informatica 26:271-278.
Pušnik, M., M.B. jurič & I. Rozman. 2002. Evaluation of technologies for business process automation. Informatica 26:373-380.
Rajaungham, k., d. Chadwick & B. Knight. 2002. Efficient methods for checking integrity: A structured spreadsheet engineering methodology. Informatica 26:181-189.
RakoviĆ, d. & m. dugić. 2002. a critical note on the role of the quantum mechanical "collapse" in quantum modeling of consciousness. Informatica 26:85-90.
Welzer, T. & L. Strous. matica 26:103-104.
2002. IFIP-TC 11. Infor-
Professional Societies
Jožef Stefan Institute. Ljubljana, Slovenia. 2002. Informatica 26:101,243,345,439.
ramani, A., s. vhora & s. sanyal. 2002. generation Internet protocol. Informatica 26:27-45.
The next
Sarkar, D. & P.K. Das. 2002. On mirroring, connected component labelling and topological properties of images enclosed as minimized boolean function. Informatica 26:17-25.
Savanović, a., D. Gabrijelčič, B. Jerman Blažič & S. Karnouskos. 2002. An active networks security architecture. Informatica 26:211-221.
Semeraro, G., M. Degemmis & P. Lops. 2002. User profiling to support Internet customers: What do you want to buy today? Informatica 26:407-418.
Spalka, a., A.b. Cremers & h. Langweg. 2002. Trojan horse attacks on software for electronic signatures. Informatica 26:191-203.
Tai, s.-C., C.-C. Wang & C.-s. Yu. 2002. Visual secret sharing watermarking for digital image. Informatica 26:381-388.
Taniar, D., Y. Jiang, K.H. Liu & C.H.C. Leung. 2002. Parallel aggregate-join query processing. Informatica 26:321-332.
Taweel, a. & P. Brereton. 2002. Developing software across time zones: An exploratory empirical study. Informatica 26:333-344.
Wang, C., Y. Wang & F. Zhang. 2002. An anonymous mobile agents scheme for secure web transaction over the Internet. Informatica 26:291-297.
wei, B., D. Liu & X. Wang. 2002. Analysis of AES S-box with Walsh spectrum. Informatica 26:259-262.
Wu, H., F. Bag & R.H. Deng. 2002. Cryptanalysis of some hash functions based on block ciphers and codes. Informatica 26:255-258.
Železnikar, a.p. 2002. Informon—An emergent conscious component. Informatica 26:419-431.
Editorials
Bergadano, F. & C.-K. Wu. 2002. Cryptology and network Security. Informatica 26:245-248.
INFORMATICA AN INTERNATIONAL JOURNAL OF COMPUTING AND INFORMATICS
INVITATION, COOPERATION
Submissions and Refereeing
Please submit three copies of tiie manuscript with good copies of the figures and photographs to one of the editors from the Editorial Board or to the Contact Person. At least two referees outside the author's country will examine it, and they are invited to make as many remarks as possible directly on the manuscript, from typing errors to global philosophical disagreements. The chosen editor will send the author copies with remarks. If the paper is accepted, the editor will also send copies to the Contact Person. The Executive Board will inform the author that the paper has been accepted, in which case it will be published within one year of receipt of e-mails with the text in Informatica format and figures in . eps format. The original figures can also be sent on separate sheets. Style and examples of papers can be obtained by fe-mail from the Contact Person or from FTP or WWW (see the last page of Informatica).
Opinions, news, calls for conferences, calls for papers, etc. should be sent directly to the Contact Person.
QUESTIONNAIRE
Send Informatica free of charge
Yes, we subscribe
Please, complete the order form and send it to Dr. Rudi Murn, Informatica, Institut Jožef Stefan, Jamova 39, 1111 Ljubljana, Slovenia.
Since 1977, Informatica has been a major Slovenian scientific journal of computing and informatics, including telecommunications, automation and other related areas. In its 16th year (more than five years ago) it became truly international, although it still remains connected to Central Europe. The basic aim of Informatica is to impose intellectual values (science, engineering) in a distributed organisation.
Informatica is a journal primarily covering the European com-' puter science and informatics community - scientific and educational as well as technical, commercial and industrial. Its basic aim is to enhance communications between different European structures on the basis of equal rights and international refereeing. It publishes scientific papers accepted by at least two referees outside the author's country. In addition, it contains information about conferences, opinions, critical examinations of existing publications and news. Finally, major practical achievements and innovations in the computer and information industry are presented through commercial publications as well as through independent evaluations.
Editing and refereeing are distributed; Each editor can conduct the refereeing process by appointing two new referees or referees from the Board of Referees or Editorial Board. Referees should not be from the author's country. If new referees are appointed, their names will appear in the Refereeing Board.
Informatica is free of charge for major scientific, educational and governmental institutions. Others should subscribe (see the last page of Informatica).
ORDER FORM - INFORMATICA
Name: .................................................... Office Address and Telephone (optional):
Title and Profession (optional): ....................................................................
........................................................... E-mail Address (optional): .............
Home Address and Telephone (optional): ....................
........................................................... Signature and Date: ...................
http://ai.ijs.si/informatica/ http://orca.st.usm.edu/informatica/
Referees:
"Witold Abramowicz, David Abramson, Adel Adi, Kenneth Aizawa, Suad Alagič, Mohamad Alam, Dia Ali, Alan Aliu, Richard Amoroso, John Anderson, Hans-Jurgen Appelrath, Ivän Araujo, Vladimir Bajič, Michel Barbeau, Grzegorz Bartoszewicz, Catriel Beeri, Daniel Beech, Fevzi Belli, Simon Beloglavec, Sondes Bennasri, Francesco Bergadano, Istvan Berkeley, Azer Bestavros, Andraž Bežek, Balaji Bharadwaj, Ralph Bisland, Jacek Blazewicz, Laszlo Boeszoermenyi, Damjan Bojadžijev, Jeff Bone, Ivan Bratko, Pavel Brazdil, Boštjan Brumen, Jerzy Brzezinski, Marian Bubak, Davide Bugali, Troy Bull, Leslie Burkholder, Frada Burstein, Wojciech Buszkowski, Rajkumar Bvyya, Netiva Caftori, Particia Carando, Robert Cattral, Jason Ceddia, Ryszard Choras, Wojciech Cellary, Wojciech Chybowski, Andrzej Ciepielevk-ski, Vic Ciesielski, Mel Ó Cinnéide, David Cliff, Maria Cobb, Jean-Pierre Corriveau, Travis Craig, Noel Craske, Matthevi' Crocker, Tadeusz Czachorski, Milan Češka, Honghua Dai, Bart de Decker, Deborah Dent, Andrej Dobnikar, Sait Dogru, Peter Dolog, Georg Dorfner, Ludoslaw Drelichowski, Matija Drobnič, Maciej Drozdowski, Marek Druzdzel, Marjan Družovec, Jozo Dujmovič, Pavol Duriš, Amnon Eden, Johann Eder, Hesham El-Rewini, Darrell Ferguson, Warren Fergusson, David Fiater, Pierre Flener, Wojciech Fliegner, Vladimir A. Fomichov, Terrence Forgarty, Hans Fraaije, Hugo de Garis, Eugeniusz Gatnar, Grant Gayed, James Geller, Michael Georgiopolus, Michael Gertz, Jan Golinski, Janusz Gorski, Georg Gottlob, David Green, Herbert Groiss, Jozsef Gyorkos, Marten Haglind, Abdelwahab Hamou-Lhadj, Inman Harvey, Jaak Henno, Marjan Hericko, Elke Hochmueller, Jack Hodges, Doug Howe, Rod Howell, TomäS Hruška, Don Huch, Simone Fischer-Huebner, Alexey Ippa, Hannu Jaakkola, Sushil Jajodia, Ryszard Jakubowski, Piotr Jedrzejowicz, A. Milton Jenkins, Eric Johnson, Polina Jordanova, Djani Juričič, Marko Juvancic, Sabhash Kak, Li-Shan Kang, Ivan Kapust0k, Orlando Karam, Roland Kaschek, Jacek Kierzenka, Jan Kniat, Stavros Kokkotos, Fabio Kon, Kevin Korb, Gilad Koren, Andrej Krajne, Henryk Krawczyk, Ben Kroese, Zbyszko Krolikowski, Benjamin Kuipers, Matjaž Kukar, Aarre Laakso, Les Labuschagne, Ivan Lah, Phil Laplante, Bud Lawson, Herbert Leitold, Ulrike Leopold-Wildburger, Timothy C. Lethbridge, Joseph Y-T. Leung, Barry Levine, Xuefeng Li, Alexander Linkevich, Raymond Lister, Doug Locke, Peter Lockeman, Matija Lokar, Jason Lowder, Kim Teng Lua, Ann Macintosh, Bernardo Magnini, Andrzej Malachowski, Peter Marcer, Andrzej Marciniak, Witold Marciszewski, Vladimir Marik, Jacek Martinek, Tomasz Maruszewski, Florian Matthes, Daniel Memmi, Timothy Menzies, Dieter Merkl, Zbigniew Michalewicz, Gautam Mitra, Roland Mittermeir, Madhav Moganti, Reinhard Moller, Tadeusz Morzy, Daniel Mossé, John Mueller, Jari Multisilta, Hari Narayanan, Jerzy Nawrocki, Rance Necaise, Elzbieta Niedzielska, Marian Niedq'zwiedzinski, Jaroslav Nieplocha, Oscar Nierstrasz, Roumen Nikolov, Mark Nissen, Jerzy Nogieć, Stefano Nolfi, Franc Novak, Antoni Nowakowski, Adam Nowicki, Tadeusz Nowicki, Daniel Olejar, Hubert Osterie, Wojciech Olejniczak, Jerzy Olszewski, Cherry Owen, Mieczyslaw Owoc, Tadeusz Pankowski, Jens Penberg, William C. Perkins, Warren Persons, Mitja Peruš, Stephen Pike, Niki Pissinou, Aleksander Pivk, Uliin Place, Gabika Polcicovä, Gustav Pomberger, James Pomykalski, Dimithu Prasanna, Gary Preckshot, Dejan Raković, Cveta Razdevšek Pučko, Ke Qiu, Michael Quinn, Gerald Quirchmayer, Vojislav D. Radonjic, Luc de Raedt, Ewaryst Rafajlowicz, Sita Ramakrishnan, Kai'Rannenberg, Wolf Rauch, Peter Rechenberg, Felix Redmill, James Edward Ries, David Robertson, Marko Robnik, Colette Rolland, Wilhelm Rossak, Ingrid Rüssel, A.S.M. Sajeev, Kimmo Salmenjoki, Pierangela Samarati, Bo Sanden, P. G. Sarang, Vivek Sarin, Iztok Savnik, Ichiro Satoh, Walter Schempp, Wolfgang Schreiner, Guenter Schmidt, Heinz Schmidt, Dennis Sewer, Zhongzhi Shi, Mària Smolàrovà, Carine Souveyet, William Spears, Hartmut Stadtler, Olivero Stock, Janusz Stoklosa, Przemyslaw Stpiczynski, Andrej Stritar, Maciej Stroinski, Leon Strous, Tomasz Szmue, Zdzislaw Szyjewski, Jure Šile, Metod Škarja, Jih Šlechta, Chew Lim Tan, Zahir Tari, Jurij Tasič, Gheorge Tecuci, Piotr Teczynski, Stephanie Teufel, Ken Tindell, A Min Tjoa, Vladimir Tosic, Wieslaw Traczyk, Roman Trobec, Marek Tudruj, Andrej Ule, Amjad Umar, Andrzej Urbanski, Marko Uršič, Tadeusz Usowicz, Romana Vajde Horvat, Elisabeth Valentine, Kanonkluk Vanapipat, Alexander P. Vazhenin, Jan Verschuren, Zygmunt Vetulani, Olivier de Vel, Valentino Vranic, Jožef Vyskoc, Eugene Wallingford, Matthew Warren, John Weckert, Michael Weiss, Tatjana Welzer, Lee White, Gerhard Widmer, Stefan Wrobel, Stanislaw Wrycza, Janusz Zalewski, Damir Zazula, Yanchun Zhang, Ales Zivkovic, Zonling Zhou, Robert Zorc, Anton P. Železnikar
EDITORIAL BOARDS, PUBLISHING COUNCIL
Informatica is a journal primarily covering the European computer science and informatics community; scientific and educational as well as technical, commercial and industrial. Its basic aim is to enhance communications between different European structures on the basis of equal rights and international referee-ing. It publishes scientific papers accepted by at least two referees outside the author's country. In addition, it contains information about conferences, opinions, critical examinations of existing publications and news. Finally, major practical achievements and innovations in the computer and information industry are presented through commercial publications as well as through independent evaluations.
Editing and refereeing are distributed. Each editor from the Editorial Board can conduct the refereeing process by appointing two new referees or referees from the Board of Referees or Editorial Board. Referees should not be from the author's country. If new referees are appointed, their names will appear in the list of referees. Each paper bears the name of the editor who appointed the referees. Each editor can propose new members for the Editorial Board or referees. Editors and referees inactive for a longer period can be automatically replaced. Changes in the Editorial Board are confirmed by the Executive Editors.
The coordination necessary is made through the Executive Editors who examine the reviews, sort the accepted articles and maintain appropriate international distribution. The Executive Board is appointed by the Society Informatika. Informatica is partially supported by the Slovenian Ministry of Science and Technology.
Each author is guaranteed to receivc the reviews of his article. When accepted, publication in Informatica is guaranteed in less than one year after the Executive Editors receive the corrected version of the article.
Executive Editor - Editor in Chief
Anton P. Železnikar Volaričeva 8, Ljubljana, Slovenia
s51em@lea.hamradio.si
http ://lea.hamradio.si/"s5lem/
Executive Associate Editor (Contact Person)
Matjaž Gams, Jožef Stefan Institute
Jamova 39, 1000 Ljubljana, Slovenia
Phone: -1-386 1 4773 900, Fax: 4-386 1 219 385
mat jaz.gams@ijs.si
http://ai.ijs.si/mezi/matjaz.html
Executive Associate Editor (Technical tlditor)
Rudi Mum, Jožef Stefan Institute
Publishing Council:
Tomaž Banovec, Ciril Baškovič, Andrej Jerman-Blažič, Jožko Čuk, Vladislav Rajkovič
Board of Advisors:
Ivan Bratko, Marko Jagodic, Tomaž Pisanski, Stanko Strmčnik
Editorial Board
Suad Alagić (Bosnia and Herzegovina)
Vladimir Bajić (Republic of South Africa)
Vladimir Batagelj (Slovenia)
Francesco Bergadano (Italy)
Leon Birnbaum (Romania)
Marco Botta (Italy)
Pavel Brazdil (Portugal)
Andrej Brodnik (Slovenia)
Ivan Bruha (Canada)
Se Woo Cheon (Korea)
Hubert L. Dreyfus (USA)
Jozo Dujmović (USA)
Johann Eder (Austria)
Vladimir Fomichov (Russia)
Georg Gottlob (Austria)
Janez Grad (Slovenia)
Francis Heylighen (Belgium)
Hiroaki Kitano (Japan)
Igor Kononenko (Slovenia)
Miroslav Kubat (USA)
Ante Laue (Croatia)
Jadran Lenarčič (Slovenia)
Huan Liu (Singapore)
Ramon L. de Mantaras (Spain)
Magoroh Maruyama (Japan)
Nikos Mastorakis (Greece)
Angelo Montanari (Italy)
Igor Mozetič (Austria)
Stephen Muggleton (UK)
Pavol Navrat (Slovakia)
Jerzy R. Nawrocki (Poland)
Roumen Nikolov (Bulgaria)
Franc Novak (Slovenia)
Marcin Paprzycki (USA)
Oliver Popov (Macedonia)
Kari H. Pribram (USA)
Luc De Raedt (Belgium)
Dejan Raković (Yugoslavia)
Jean Ramaekers (Belgium)
Wilhelm Rossak (USA)
Ivan Rozman (Slovenia)
Claude Sammut (Australia)
Sugata Sanyal (India)
Walter Schempp (Germany)
Johannes Schwinn (Germany)
Zhongzhi Shi (China)
Branko Souček (Italy)
Oliviero Stock (Italy)
Petra Stoerig (Germany)
Jin Šlechta (UK)
Gheorghe Tecuci (USA)
Robert Trappl (Austria)
Terry Winograd (USA)
Stefan Wrobel (Germany)
Xindong Wu (Australia)
An International Journal of Computing and Informatics
An Assessment of the Organization Virtuality with Three Different Reference Models Using Image Segmentation as a Basis for Categorization
Recognition of Image Authenticity Using Significant DCT Coefficients Quantization
Protecting the Data State of Mobile Agents by Using
Bitmaps and XOR Operators
Evaluation of Technologies for Business Process
Automation
Visual Secret Sharing Watermarking for Digital Image
Artificial Neural Networks Based Systems for Recognition of Genomic Signals and Regions: A Review
Adapting Proactive Mobile Agents to Dynamically Reconfigurable Networks
User Profiling to Support Internet Customers: What do You Want to Buy Today?
Informon—An Emergent Conscious Component A List Scheduling Heuristic for Allocating the Task Graph to Multiprocessors
C. Bavec J. Brank
C.-C.	Chang, J.-C. Chuang, T.-S.Chen
J. A. P Diaz
D.	A. Gutiérrez M. Pušnik, M.B. Jurič,
I. Rozman S.-C.Tai, C.-C. Wang, C.-S. Yu V^B.B^ic^ S.Tang, H. Han, V. Brusic C.Erfiirth, W. Rossak G. Semeraro, M. Degemmis, P. Lops
A.P. Železnikar J. Brest, V. Žumer
347 353 359
367 373
381
389
401 407
419 433