Abdelsalam Almarimi, Pokorný Jaroslav
Schema Management for Data Integration: A Short Survey
In: Acta Polytechnica, Volume: 45, No: 1, Czech Technical University in Prague, Prague, 2005, pp. 24-27.
Schema management is a basic problem in many database application domains such as data integration systems. Users need to access and manipulate data from several databases. In this context, in order to integrate data from distributed heterogeneous database sources, data integration systems demand the resolution of several issues that arise in managing schemas. In this paper, we present a brief survey of the problem of schema matching which is used for solving problems of schema integration processing. Moreover, we propose a technique for integrating and querying distributed heterogeneous XML schemas.
Abdelsalam Almarimi, Pokorný Jaroslav
A Mediation Layer for Heterogenous XML Schemas
In: International Journal of Web Information Systems, Volume: 1, No: 1, Troubador Publishing LTD, 2005, pp. 25-32.
Presented at: iiWAS2004 Information Integration and Web Based Applications & Services, 27-29.09.2004, Jakarta,
Indonesia.
This paper describes an approach for mediation of heterogeneous XML schemas. Such an approach is proposed as a tool for XML data integration system. A global XML schema is specified by the designer to provide a homogeneous view over heterogeneous XML data. An XML mediation layer is introduced to manage: (1) establishing appropriate mappings between the global schema and the schemas of the sources; (2) querying XML data sources in terms of the global schema. The XML data sources are described by XML Schema language. The former task is performed through a semi-automatic process that generates local and global paths. A tree structure for each XML schema is constructed and represented by a simple form. This is in turn used for assigning indices manually to match local paths to corresponding global paths. By gathering all paths with the same indices, the equivalent local and global paths are grouped automatically, and an XML Metadata Document is constructed. An XML Query Translator for the latter task is described to translate a global user query into local queries by using the mappings that are defined in the XML Metadata Document.
Abdelsalam Almarimi, Pokorný Jaroslav
A Mediation Layer for Heterogenous XML Schemas
Presented at: iiWAS2004 Information Integration and Web Based Applications & Services, 27-29.09.2004, Jakarta,
Indonesia.
This paper describes an approach for mediation of heterogeneous XML schemas. Such an approach is proposed as a tool for XML data integration system. A global XML schema is specified by the designer to provide a homogeneous view over heterogeneous XML data. An XML mediation layer is introduced to manage: (1) establishing appropriate mappings between the global schema and the schemas of the sources; (2) querying XML data sources in terms of the global schema. The XML data sources are described by XML Schema language. The former task is performed through a semi-automatic process that generates local and global paths. A tree structure for each XML schema is constructed and represented by a simple form. This is in turn used for assigning indices manually to match local paths to corresponding global paths. By gathering all paths with the same indices, the equivalent local and global paths are grouped automatically, and an XML Metadata Document is constructed. An XML Query Translator for the latter task is described to translate a global user query into local queries by using the mappings that are defined in the XML Metadata Document.
Ali K., Pokorný Jaroslav
XML-based Temporal Models
Technical Report: DC-2006-02, Dep. of Comp. Sc. and Engineering, FEE TU, Prague, 2006, 39 p.
Much research work has recently focused on the problem of representing historical information in XML. This report describes a number of temporal XML data models and provides their comparison according to the following properties: time dimension (valid time, transaction time), support of temporal elements and attributes, querying possibilities, association to XML Schema/DTD, and influence on XML syntax.
Ali K., Pokorný Jaroslav
A Three-Dimensional XML-Based Model
In: SOFSEM 2008: Theory and Practice of Computer Science, LNCS 4910, Springer, 2008, pp. 659-671.
Presented at: 34th International Conference on Current Trends in Theory and Practice of Computer Science, 19.-25.1.2008, Nový Smokovec, High Tatras,
Slovakia.
Much research work has recently focused on the problem of
representing historical information in XML. In this paper, we describe an
ongoing work to represent XML changes. Our model is a three-dimensional
XML-based model (3D_XML in short) for representing and querying histories
of XML documents. The proposed model incorporates three time dimensions,
valid time, transaction time, and efficacy time without extending the syntax of
XML. We use XQuery to express complex temporal queries on the evolution of
the document contents. We believe that native XML databases (NXDs) present
a viable alternative to relational temporal databases when complex time
dependent data has to be manipulated and stored. So NXDs will be our choice.
Azarová Irina, Sinopalníková Anna
Using Corpus Statistics for Wordnet Structuring
In: Proceedings of the Second International Conference on Corpus Linguistics (Corpora-2004), Saint-Petersburg State University Press, Saint-Petersburg, Russia, 2004, pp. 3-11.
Presented at: Second International Conference on Corpus Linguistics (Corpora-2004), , Saint-Petersburg, Russia.
Azarová Irina, Sinopalníková Anna, Yavorskaya Maria
Guidelines for RussNet structuring (Guidelines for RussNet structuring)
In: Proceedings of the Dialogue 2004 International Conference on the Computational Linguistics and Intellectual Technologies, Moscow: Nauka, Moscow, 2004, pp. 232-241.
ISBN: 5-02-002826-6
Presented at: Dialogue 2004 International Conference on the Computational Linguistics and Intellectual Technologies, 2004, Moscow, Russia.
Bartoň Stanislav
Indexing Structure for Discovering Relationships in RDF Graph Recursively Applying Tree Transformation
In: Proceedings of Semantic Web Workshop at 27th Annual International ACM SIGIR Conference, 2004, pp. 58-68.
Presented at: Semantic Web Workshop at 27th Annual International ACM SIGIR Conference, 25.7.-29.7. 2004, University of Sheffield, Sheffield,
Great Britain.
Discovering the complex relationships between entities is one way of benefitting from the Semantic Web. This paper discusses new approaches to implementing rho-operators into RDF querying engines which will enable discovering such relationships viable. The cornerstone of such implementation is creating an index which describes the original RDF graoh. The index is created by recursive application of a transformation of graph to forest of trees and then to each tree its extended signature is created. The signatures are accompanied by the additional information about transformed problematic nodes breaking the tree structure. The components described by the signatures are assumed as a single node in the following step. The transitions between the signatures represent edges.
Bartoň Stanislav, Zezula Pavel
RhoIndex - An Index for Graph Structured Data
Presented at: 8th International DELOS Workshop on Future Digital Library Management Systems, 29.3.-1.4.2005, Schloss Dagstuhl,
Germany.
The effort described in this paper introduces an indexing structure for path search in the graph structured data called rho-index. It is based on a graph segmentation S(G) that is meant to represent the indexed graph G in a simpler manor yet having similar properties as the graph G had. This is achieved using graph transformations and a special type of a matrix used to represent the transformed graph.
Bartoň Stanislav, Zezula Pavel
Designing and Evaluating an Index for Graph Structured Data
In: Proceedings of ICDM MCD 2006, IEEE Press, Hong Kong, China, 2006, pp. 1-5.
Presented at: The Second International Workshop on Mining Complex Data - MCD'06 - In Conjunction with IEEE ICDM'06, 18.12.-22.12.2006, Hong Kong,
China.
Bartoň Stanislav, Zezula Pavel
Mining Citation Graphs Employing an Index for Graph Structured Data
In: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu, Ústav informatiky AV ČR, Prague, 2006, pp. 1-9.
ISBN: 80-903298-7-X
Presented at: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu - Seminář projektu programu Informační společnost, 5.10.-7.10.2006, Zadov,
Czech Republic.
In this paper a correlation between rho-operators and indirect
relations in citation analysis is presented. The rho-operators were defined
to explore complex relationships in graph structured data. Various direct
and indirect relations identified in citation analysis are used to study the
semantics within the citation network. The rho-index was used to implement the rho - path search in the citation network and gained results are
evaluated and discussed.
Bartoň Stanislav, Zezula Pavel
rho-Index - Designing and Evaluating an Indexing Structure for Graph Structured Data
Technical Report: FIMU-RS-2006-07, FI MU, Brno, 2006, 24 p.
Bartoň Stanislav
Searching Indirect Relationships in Citation Analysis Using an Index for Graph Structured Data
In: 2nd Doctoral Workshop on Mathematical and Engineering Methods in Computer Science MEMICS 2006, 2006, pp. 9-16.
Presented at: 2nd Doctoral Workshop on Mathematical and Engineering Methods in Computer Science MEMICS 2006, 27.10.-29.10.2006, Mikulov,
Czech Republic.
Bartoň Stanislav, Dohnal Vlastislav, Sedmidubský Jan, Zezula Pavel
Gauging the Evolution of Metric Social Network
In: 5th International Workshop on Databases, Information Systems and Peer-to-Peer Computing (DBISP2P 2007) held at 33rd International Conference on Very Large Data Bases (VLDB 2007), 2007, pp. 12.
Presented at: Fifth International Workshop on Databases, Information Systems and Peer-to-Peer Computing (DBISP2P 2007), 24.9.2007, Vienna,
Austria.
In this paper, we tackle the issues of analyzing the struc-
tural evolution of the metric social network. The metric social network
operates in a P2P environment where peers maintain their own data
and the relationships among them are formed on the basis of the pro-
cessed similarity queries. The evolution is analyzed by traditional social
networking tools the characteristic path length and the clustering co-
efficient. Nonetheless, due to the special structure of the metric social
network, own designed gauges the average overlap and robustness of
description coefficients are presented to analyze the structure of emerg-
ing communities encompassing similar data.
Bartoň Stanislav, Zezula Pavel
Indexing Structure for Graph-Structured Data
In: Studies in Computational Intelligence, Volume: 165, Springer Berlin/Heidelberg, Berlin, 2008, pp. 167-188.
Bartoň Stanislav, Dohnal Vlastislav, Sedmidubský Jan, Zezula Pavel
Building Self-Organized Image Retrieval Network
In: Proceedings of 6th Workshop on Large-Scale Distributed Systems for Information Retrieval (LSDS-IR '08), ACM, USA, 2008.
(in_print)
Basovník Stanislav, Dekár Martin, Jusko Pavol, Mikulík Andrej, Obdržálek David, Pechal Radim, Petrůšek Tomáš, Piták Roman
Logion - A Robot Which Collects Rocks
In: Proc. of International Conference on Research and Education in Robotics, 2008, pp. 276-287.
ISBN: 978-80-7378-042-5
Presented at: EUROBOT 2008: International Conference on Research and Education in Robotics, 21.-24.5.2008, Heidelberg, Germany.
Batko Michal, Skopal Tomáš, Lokoč Jakub
New Dynamic Construction Techniques for M-tree
In: Journal of Discrete Algorithms, Elsevier, Amsterdam, The Netherlands, 2008.
(in_print)
Since its introduction in 1997, the M-tree became a respected metric access method (MAM), while remaining, together with its descendants, still the only database-friendly MAM, that is, a dynamic structure persistent in paged index. Although there have been many other MAMs developed over the last decade, most of them require either static or expensive indexing. By contrast, the dynamic M-tree construction allows us to index very large databases in subquadratic time, and simultaneously the index can be maintained up-to-date (i.e., supports arbitrary insertions/deletions). In this article we propose two new techniques improving dynamic insertions in M-tree—the forced reinsertion strategies and so-called hybrid-way leaf selection. Both of the techniques preserve logarithmic asymptotic complexity of a single insertion, while they aim to produce more compact M-tree hierarchies (which leads to faster query processing). In particular, the former technique reuses the well-known principle of forced reinsertions, where the new insertion algorithm tries to re-insert the content of an M-tree leaf that is about to split in order to avoid that split. The latter technique constitutes an efficiency-scalable selection of suitable leaf node wherein a new object has to be inserted. In the experiments we show that the proposed techniques bring a clear improvement (speeding up both indexing and query processing) and also provide a tuning tool for indexing vs. querying efficiency trade-off. Moreover, a combination of the new techniques exhibits a synergic effect resulting in the best strategy for dynamic M-tree construction proposed so far.
Batko Michal, Novák David, Falchi Fabrizio, Zezula Pavel
Scalability Comparison of Peer-to-Peer Similarity Search Structures
In: Future Generation Computer Systems, Volume: 24, No: 8, Elsevier, Amsterdam, The Netherlands, 2008, pp. 834-848.
Batko Michal, Dohnal Vlastislav, Zezula Pavel
M-Grid: Similarity Searching in Grids
In: Proceedings of International Workshop on Information Retrieval in Peer-to-Peer Networks, ACM Press, Arlington, 2006, pp. 1-8.
Batko Michal, Novák David, Falchi Fabrizio, Zezula Pavel
On Scalability of the Similarity Search in the World of Peers
In: InfoScale '06: Proceedings of the 1st international conference on Scalable information systems, ACM Press, New York, NY, USA, 2006, pp. 1-12.
ISBN: 1-59593-428-6
Due to the increasing complexity of current digital data,
similarity search has become a fundamental computational
task in many applications. Unfortunately, its costs are still
high and the linear scalability of single server implementations
prevents from efficient searching in large data volumes.
In this paper, we shortly describe four recent scalable
distributed similarity search techniques and study their performance
of executing queries on three different datasets.
Though all the methods employ parallelism to speed up
query execution, different advantages for different objectives
have been identified by experiments. The reported results
can be exploited for choosing the best implementations
for specific applications. They can also be used for designing
new and better indexing structures in the future.
Batko Michal, Novák David, Zezula Pavel
MESSIF: Metric Similarity Search Implementation Framework
In: Digital Libraries: Research and Development, Springer-Verlag, LNCS 4877, Berlin, Heidelberg, 2007, pp. 1-10.
ISBN: 978-3-540-77087-9
Batko Michal, Novák David, Zezula Pavel
MESSIF: Metric Similarity Search Implementation Framework
In: DELOS Conference 2007 - Working Notes, Information Society Technologies, Pisa, Italy, 2007, pp. 11-23.
Presented at: DELOS Conference 2007, 13-14.2.2007, Pisa,
Italy.
The similarity search has become a fundamental computational task in many applications. One
of the mathematical models of the similarity the metric space has drawn attention of many
researchers resulting in several sophisticated metric-indexing techniques. An important part of a
research in this area is typically a prototype implementation and subsequent experimental evaluation
of the proposed data structure. This paper describes an implementation framework called MESSIF
that eases the task of building such prototypes. It provides a number of modules from basic storage
management to automatic collecting of performance statistics. Due to its open and modular design it
is also easy to implement additional modules if necessary. The MESSIF also offers several ready-to-use
generic clients that allow to control and test the index structures and also measure its performance.
Batko Michal, Skopal Tomáš, Lokoč Jakub
New Dynamic Construction Techniques for M-tree
In: Journal of Discrete Algorithms, Elsevier, Amsterdam, The Netherlands, 2008.
(in_print)
Since its introduction in 1997, the M-tree became a respected metric access method (MAM), while remaining, together with its descendants, still the only database-friendly MAM, that is, a dynamic structure persistent in paged index. Although there have been many other MAMs developed over the last decade, most of them require either static or expensive indexing. By contrast, the dynamic M-tree construction allows us to index very large databases in subquadratic time, and simultaneously the index can be maintained up-to-date (i.e., supports arbitrary insertions/deletions). In this article we propose two new techniques improving dynamic insertions in M-tree—the forced reinsertion strategies and so-called hybrid-way leaf selection. Both of the techniques preserve logarithmic asymptotic complexity of a single insertion, while they aim to produce more compact M-tree hierarchies (which leads to faster query processing). In particular, the former technique reuses the well-known principle of forced reinsertions, where the new insertion algorithm tries to re-insert the content of an M-tree leaf that is about to split in order to avoid that split. The latter technique constitutes an efficiency-scalable selection of suitable leaf node wherein a new object has to be inserted. In the experiments we show that the proposed techniques bring a clear improvement (speeding up both indexing and query processing) and also provide a tuning tool for indexing vs. querying efficiency trade-off. Moreover, a combination of the new techniques exhibits a synergic effect resulting in the best strategy for dynamic M-tree construction proposed so far.
Batko Michal, Novák David, Falchi Fabrizio, Zezula Pavel
Scalability Comparison of Peer-to-Peer Similarity Search Structures
In: Future Generation Computer Systems, Volume: 24, No: 8, Elsevier, Amsterdam, The Netherlands, 2008, pp. 834-848.
Batko Michal, Kohoutková Petra, Zezula Pavel
Combining Metric Features in Large Collections
In: 1st International Workshop on Similarity Search and Applications (SISAP 2008), IEEE Computer Society, Los Alamitos CA, Washington, Tokyo, 2008, pp. 79-86.
Batko Michal, Falchi Fabrizio, Lucchese Claudio, Novák David, Perego Raffaele, Rabitti Fausto, Sedmidubský Jan, Zezula Pavel
Crawling, Indexing, and Similarity Searching Images on the Web
In: Proceedings of the Sixteenth Italian Symposium on Advanced Database, 2008, pp. 382-389.
Bednárek David
Output-Driven XQuery Evaluation
In: Proc. of 2nd International Symposium on Intelligent Distributed Computing, Studies in Computational Intelligence, (Ed. C. Badica et al.), Volume: 162, Springer-Verlag, Heidelberg, 2008, pp. 55-64.
ISBN: 978-3-540-85256-8
Presented at: IDC 2008: 2nd International Symposium on Intelligent Distributed Computing, 18.-19.9.2008, Catania, Italy.
Bednárek David
Reducing Temporary Trees in XQuery
In: Proc. of 12th Advances in Databases and Information Systems, LNCS 5207, Springer-Verlag, Berlin, 2008, pp. 30-45.
ISBN: 978-3-540-85712-9
Presented at: ADBIS 2008: 12th Advances in Databases and Information Systems, 5.-9.9.2008, Pori, Finland.
Bednárek David, Yaghob Jakub, Zavoral Filip
Fine Grained Access Rights Definition in a Three Tiered Information System
In: Proc. of 5th International Conference on Innovations in Information Technology, IEEE Computer Society Press, 2008.
(in_print)
Presented at: Innovations 2008: 5th International Conference on Innovations in Information Technology, 16.-18.12.2008, Al Ain, United Arab Emirates.
Bednárek David
Extending Datalog to Cover XQuery
In: Proc. of Information Technologies - Application and Theory, (Ed. P. Vojtáš), PONT, Seňa, 2008, pp. 1-6.
ISBN: 978-80-969184-9-2
Presented at: Konferencia o informačných (inteligentných) technológiách - aplikácie a teória 2008, 22.-26.9.2008, High Tatras,
Slovakia.
Bednárek David
Statická typová kontrola XSLT programů
In: Proceedings of ITAT 2005, Information Technologies - Applications and Theory, (Ed. Vojtáš P.), Prírodovedecká fakulta Univerzity Pavla Jozefa Šafárika, Košice, 2005, pp. 393-401.
ISBN: 80-7097-609-8
Presented at: ITAT 2005, 20.9. - 25.9.2005, Račkova dolina,
Slovakia.
Bednárek David, Obdržálek David, Yaghob Jakub, Zavoral Filip
Data Integration Using DataPile Structure
In: Proceedings of the 9th East-European Conference on Advances in Databases and Information Systems, Tallin, 2005, pp. 178-188.
Presented at: 9th East-European Conference on Advances in Databases and Information Systems (ADBIS 2005), 12.9.-15.9.2005, Tallin,
Estonia.
One of the areas of data integration covers systems that maintain co-herence among a heterogeneous set of databases. Such a system repeatedly col-lects data from the local databases, synchronizes them, and pushes the updates back. One of the key problems in this architecture is the conflict resolution. When data in a less relevant data source changes, it should not cause any data change in a store with higher relevancy. To meet such requirements, we propose a DataPile structure with following main advantages: effective storage of historical versions of data, straightfor-ward adaptation to global schema changes, separation of data conversion and replication logic, simple implementation of data relevance. Key usage of such mechanisms is in projects with following traits or require-ments: integration of heterogeneous data from sources with different reliability, data coherence of databases whose schema differs, data changes are performed on local databases and minimal load on the central database.
Bednárek David
Turingovské vzory v XSLT programech
In: Proceedings of ITAT 2006, Information Technologies - Applications and Theory, 2006.
ISBN: 80-969184-4-3
Presented at: ITAT 2006, 26.9.-1.10.2006, Chata Kosodrevina, Bystrá dolina, Nízke Tatry,
Slovakia.
Bednárek David
Optimizing XQuery/XSLT programs using backward analysis
In: Proceedings of ITAT 2007, Information Technologies - Applications and Theory, (Ed. Vojtáš P.), PONT s.r.o., Seňa, 2007, pp. 17-22.
Presented at: Konferencia o informačných (inteligentných) technológiách - aplikácie a teória 2007, 21.-27.9.2007, Polana,
Slovakia.
Benda J., Obdržálek David
GFE - Graphical Finite State Machine Editor for Parallel Execution
In: Workshop on Educational Robotics, DIEES, 2006, pp. 41-47.
Presented at: Workshop on Educational Robotics 2006, 1.6.2006, Acireale, Italy.
Beuster Gerd, Neruda Roman
Configuring Computational Agents
In: Knowledge Grid and Grid Intelligence 2004, Halifax, Saint Mary`s University, 2004, pp. 57-62.
ISBN: 0-9734039-8-5
Presented at: International Workshop on Knowledge Grid and Grid Intelligence (2.), 20.-24.09.2004, Peking,
China.
A formalism for the logical description of computational agents and multi-agent systems is given. It is explained how it such a formal description can be used to configure and reason about multi-agent systems realizing computational intelligence models. A usage within a real software system Bang 3 is demonstrated. The logical description of multiagent systems opens Bang 3 for interaction with ontology based distributed knowledge systems like the Semantic Web or the Knowledge Grid.
Bosch Sonja, Fellbaum Christiane, Pala Karel
Derivational Relations in English, Czech and Bantu Wordnet
In: Proc. of Fourth Global WordNet Conference, University of Szeged, Department of Informatics, 2008, pp. 74-90.
Presented at: GWC 2008, 22.-25.1.2008, Szeged,
Hungary.
Bustos B., Skopal Tomáš
Dynamic Similarity Search in Multi-Metric Spaces
In: Proceedings of ACM MIR 2006 (a workshop at ACM Multimedia 2006), ACM Press, Santa Barbara, CA, USA, 2006.
Presented at: ACM MIR 2006, 26.10.-27.10.2006, Santa Barbara,
CA, USA.
Daniel Milan
The minC Combination of Belief Functions: Derivation and Formulas
Technical Report: V-964, ICS AS CR, Prague, 2006, 19 p.
Daniel Milan
Formulas for Min-C Combination of Belief Functions
In: Proceedings of Eight International Conference on Fuzzy Sets Theory and Applications, (Ed. Klement P.), Academy of Armed Forces of General M. R. Štefánik, Liptovský Mikuláš, 2006, pp. 32-33.
ISBN: 80-8040-284-1
Presented at: Eight International Conference on Fuzzy Sets Theory and Applications, 30.1.-3.2.2006, Liptovský Ján,
Slovakia.
Daniel Milan
A Generalization of the minC Combination to DSm Hyper-power Sets
In: Proceedings of Information Processing and Management of Uncertainty in Knowledge-Based Systems (IPMU 2006), Edition EDK, 2006, pp. 420-427.
ISBN: 2-84254-112-X
Presented at: Information Processing and Management of Uncertainty in Knowledge-Based Systems (IPMU 2006), 2.7.-7.7.2006, Paris,
France.
Daniel Milan
Several Comments to Josang`s `normal` Coarsening
In: Proceedings of Czech-Japan Seminar on Data Analysis and Decision-Making under Uncertainty, (Ed. Watada J.), Wasada University, Kitakyushu, 2006, pp. 71-78.
Presented at: Czech-Japan Seminar on Data Analysis and Decision-Making under Uncertainty, 18.8.-22.8.2006, Kitakyushu, Japan.
Daniel Milan
What is it Hybrid DSm Rule for Combination of Belief Functions?
In: Proceedings of WUPES 2006, University of Economics, Prague, 2006, pp. 36-51.
Presented at: 7th Workshop on Uncertainty Processing WUPES'06 , 16.9.-20.9.2006, Mikulov,
Czech Republic.
Daniel Milan
A Comparison of the Generalized minC combination and the Hybrid DSm combination Rules
In: Advances and Applications od DSmT for Information Fusion, (Ed. Smarandache F.), Volume: II, American Research Press, 2006, pp. 113-129.
ISBN: 1-59973-000-6
Daniel Milan
Classical Combination Rules Generalized to DMs Hyper-Power Sets and their Comparison with the Hybrid DSm rule
In: Advances and Applications od DSmT for Information Fusion, (Ed. Smarandache F.), Volume: II, American Research Press, 2006, pp. 89-112.
ISBN: 1-59973-000-6
Daniel Milan
Generalization of the Classic Combination Rules to DSm Hyper-Power Sets
In: Information & Security, Volume: 20, 2006, pp. 50-64.
In this article, the author generalizes Dempster`s rule, Yager`s rule,
and Dubois-Prade`s rule for belief functions combination in order to be applicable
to hyper-power sets according to the Dezert-Smarandache (DSm) Theory. A
comparison of the rules with the DSm rule of combination is further presented.
Daniel Milan
Classical Belief Conditioning and its Generalization to DSm Theory
In: Proceedings of The 6th International Conference on Information and Management Sciences, California Polytechnic State University, Berlin, 2007, pp. 596-603.
Presented at: The Sixth International Conference on Information and Management Sciences (IMS2007), 1.-6.7.2007, Lhasa,
Tibet, China.
Daniel Milan
Several Comments and Questions to Josang`s Smooth Coarsening
In: Proceedings of Czech-Japan Seminar on Data Analysis and Decision Making under Uncertainty, (Ed. Kroupa T., Vejnarová J.), UTIA AV ČR, Praha, 2007, pp. 27-40.
Presented at: Czech-Japan Seminar on Data Analysis and Decision Making under Uncertainty, 15.-18.09.2007, Liblice,
Czech Republic.
Daniel Milan
The DSm Approach as a Special Case of the Dempster-Shafer Theory
In: ECSQARU 2007, (Ed. Mellouli K.), LNAI 4724, Springer-Verlag, 2007, pp. 381-392.
Presented at: ECSQARU 2007, 31.10.-2.11.2007, Hammamet,
Tunisia.
This contribution deals with a belief processing which enables managing of
multiple and overlapping elements of a frame of discernment.
An outline of the Dempster-Shafer theory for such cases is presented,
including several types of constraints for simplification of its large
computational complexity.
DSmT - a new theory rapidly developing the last five years - is briefly
introduced.
Finally, it is shown that the DSmT is a special case of the general
Dempster-Shafer approach.
Daniel Milan
Comments on Josang's Normal Coarsening and Consensus Operator
In: IJICIC, Volume: 4, No: 5, 2008, pp. 1079-1088.
Definitions of two different ways of coarsening of basic belief assignments to
opinions the simple coarsening and the normal coarsening are recalled in this contribution.
A relation of results of combination of the normal opinions using the consensus
operator to belief functions on an original n-element frame of discernment is examined.
A questionable meaning of the normal coarsening is discussed.
Daniel Milan
Contribution of DSm Approach to the Belief Function Theory
In: IPMU 2008, 2008, pp. 417-424.
Presented at: IPMU 2008, 22.-27.06.2008, Malaga - Torremolinos,
Spain.
In this study we engage in belief processing which enables managing
of multiple and overlapping elements of a frame of discernment.
Our focus is directed on DSm approach, which was originally introduced as a
generalization of
the Dempster-Shafer theory. Paradoxically, later it was presented also as a special
case of the Dempster-Shafer approach applied on such frame of discernment.
In this paper we discuss what is new in DSm approach, what are the benefits of DSm
approach, and what is its real contribution to belief function theory.
Daniel Milan
Relations of DSm Belief Conditioning Rules to Dempster-Shafer Approach
In: Proceedings of Czech-Japan Seminar on Data Analysis and Decision-Making under Uncertainty, (Ed. T. Itoh, A. Shioura), 2008, pp. 63-68.
Presented at: 11th Czech-Japan Seminar on Data Analysis and Decision-Making under Uncertainty, 15.-17.9.2008, Sendai - Tohoku University, Japan.
Daniel Milan
Mathematical Structure and Formulas for the minC Combination of Belief Functions
In: Fusion 2008, 2008, pp. 760-767.
Presented at: Fusion 2008, 30.06-03.07.2008, Cologne,
Germany.
Principal ideas of the minC combination are recalled.
A mathematical structure of generalized frames of discernment is formalized.
A generalized schema for a computation of the minC combination is presented.
Conflicting belief masses redistribution among non-conflicting focal elements is
overviewed.
Final general formulas for computation of the minC combination are presented.
Some examples of computation of the minC combination follow.
Related works and approaches are referred in the end.
Dědek Jan, Eckhardt Alan, Vojtáš Peter
Experiments with Czech Linguistic Data and ILS
In: Inductive Logic Programming (Late Breaking Papers), Action M, Prague, 2008, pp. 20-25.
ISBN: 978-80-86742-26-7
Presented at: ILP 2008: Inductive Logic Programming, 10.-12.9.2008, Prague, Czech Republic.
Dědek Jan, Vojtáš Peter
Computing aggregations from linguistic web resources: a case study in Czech Republic sector/traffic accidents
In: Proc. of International Conference on Advanced Engineering Computing and Applications in Science, IEEE Computer Society Press, 2008, pp. 7-12.
ISBN: 978-0-7695-3369
Presented at: ADVCOMP 2008: International Conference on Advanced Engineering Computing and Applications in Science, 29.9.-4.10.2008, Valencia, Spain.
Dědek Jan, Vojtáš Peter
Extrakce informací z textově orientovaných zdrojů webu
In: Znalosti 2008, (Ed. V. Snášel), Vydavatelstvo STU, Bratislava, 2008.
Presented at: Znalosti 2008, 13.-15.2.2008, Bratislava,
Slovakia.
V tomto příspěvku se zbýváme extrakcí informací z webových
zdrojů převážně textového charakteru. K tomuto účelu jsme se pokusili
využít několik lingvistických nástrojů pro zpracování přirozeného
textu v češtině. Jmenovitě se jedná o nástroje pražského projektu PDT
a český WordNet. Cílem příspěvku je přiblížit možnosti, které tyto nástroje
pro extrakci informací z textu poskytují. Extrakcí informací se
zde zabýváme především v kontextu sémantického webu a zkoumáme
možnosti, jak tyto nástroje využít pro automatizaci sémantické anotace
stránek současného webu.
Dědek Jan, Eckhardt Alan, Galamboš Leo, Vojtáš Peter
Sémantický web
In: DATAKON 2008, (Ed. Řepa V., Svatoš O.), Masaryk university, 2008, pp. 12-30.
Presented at: DATAKON 2008, 18.-21.10.2008, Brno,
Czech Republic.
Dohnal Vlastislav, Gennaro Claudio, Zezula Pavel
Efficiency and Scalability Issues in Metric Access Methods
In: Computational Intelligence in Medical Informatics, Springer Verlag, Berlin, Germany, 2008.
ISBN: 978-3-540-75766-5
The metric space paradigm has recently received attention as an
important model of similarity in the area of Bioinformatics. Numerous techniques have been proposed to solve similarity (range or
nearest-neighbor) queries on collections of data from metric domains. Though important representatives are outlined, this chapter is not trying to
substitute existing comprehensive surveys. The main objective is to explain and prove by experiments that similarity searching is typically an expensive
process which does not easily scale to very large volumes of data, thus distributed architectures able to exploit parallelism must be employed.
After a review of applications using the metric space approach in the field of Bioinformatics, the chapter provides an overview of methods used for
creating index structures able to speedup retrieval. In the metric space approach, only pair-wise distances between objects are quantified, so they
represent the level of dissimilarity. The key idea of index structures is to partition the data into subsets so that queries are evaluated without
examining entire collections -- minimizing both the number of distance computations and the number of I/O accesses. These objectives are obtained
by exploiting the property of metric spaces called the triangle inequality which states that if two objects are near a third object, they cannot be too
distant to one another. Unfortunately, computational costs are still high and the linear scalability of single-computer implementations prevents from
searching in large and ever growing data files efficiently. For these reasons, we describe very recent parallel and distributed similarity search
techniques and study performance of their implementations. Specifically, Section 12.1 presents the metric space approach and its applications in the
field of Bioinformatics. Section 12.2 describes some of the most popular centralized disk-based metric indexes. Consequently, Section
12.3 concentrates on parallel and distributed access methods which can deal with data collections that for practical purposes can be arbitrary large, which
is typical for Bioinformatics workloads. An experimental evaluation of the presented distributed approaches on real-life data sets is presented in 12.4.
The chapter concludes in Section 12.5.
Dohnal Vlastislav, Sedmidubský Jan, Zezula Pavel, Novák David
Similarity Searching: Towards Bulk-loading Peer-to-Peer Networks
In: 1st International Workshop on Similarity Search and Applications (SISAP 2008), IEEE, 2008, pp. 87-94.
Presented at: SISAP 2008 - Workshop at ICDE 2008, 11.-12.04.2008, Cancun,
Mexico.
Due to the exponential growth of digital data and its complexity,
we need a technique which allows us to search such collections efficiently.
A suitable solution is based on the peer-to-peer (P2P) network paradigm and
the metric-space model of similarity. When a large volume of data is being
inserted, the P2P network must expand to new peers in order to maintain its
efficiency. Thus, many peers must be split. During a peer split, the data is
halved and one half is migrated to a new peer. In this paper, we study the
problem of peer splits and propose a specialized algorithm for speeding it
up. In particular, we use the structured P2P network called the M-Chord.
Search performance within a single peer is enhanced by the M-tree. In
experimental evaluation, we compare the proposed algorithm with several
straightforward solutions on a real network organizing 10 million images.
Our algorithm provides a significant performance boost.
Dokulil Jiří, Katreniaková J.
Visual Exploration of RDF Data
In: SOFSEM 2008: Theory and Practice of Computer Science, LNCS 4910, Springer, 2008, pp. 572-583.
ISBN: 978-3-540-77565-2
Presented at: 34th International Conference on Current Trends in Theory and Practice of Computer Science, 19.-25.1.2008, Nový Smokovec, High Tatras,
Slovakia.
Dokulil Jiří, Yaghob Jakub, Zavoral Filip
Evoluce replikačních algoritmů v stohově orientovaných systémech
In: Proceedings of ITAT 2005, Information Technologies - Applications and Theory, (Ed. Vojtáš P.), Prírodovedecká fakulta Univerzity Pavla Jozefa Šafárika, Košice, 2005, pp. 393-401.
ISBN: 80-7097-609-8
Presented at: ITAT 2005, 20.9. - 25.9.2005, Račkova dolina,
Slovakia.
Dokulil Jiří, Yaghob Jakub, Zavoral Filip
Infrastruktura pro dotazování nad semantickými daty
In: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu, Ústav informatiky AV ČR, Prague, 2006, pp. 10-26.
ISBN: 80-903298-7-X
Presented at: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu - Seminář projektu programu Informační společnost, 5.10.-7.10.2006, Zadov,
Czech Republic.
Idea sémantického webu je široce diskutována mezi odbornou
veřejností již mnoho let. Přestože je vyvinuta řada technologií, jazyků,
prostředků a dokonce i softwarových nástrojú, málokdo někdy nějaký
reálný sémantický web viděl. Za jeden z hlavních dùvodù tohoto stavu
považujeme neexistenci potřebné infrastruktury pro provoz sémantického
webu. V našem článku popisujeme návrh takové infrastruktury, která je
založena na využití a rozšíření technologie datového stohu a nástrojích
pro něj vyvinutých a jejich kombinaci s webovými vyhledávači a dalšími
nástroji a prostředky.
Dokulil Jiří
Použití relačních databází pro vyhodnocování SPARQL dotazů
In: Proceedings of ITAT 2006, Information Technologies - Applications and Theory, 2006.
ISBN: 80-969184-4-3
Presented at: ITAT 2006, 26.9.-1.10.2006, Chata Kosodrevina, Bystrá dolina, Nízke Tatry,
Slovakia.
Dokulil Jiří
Evaluation of SPARQL queries using relational databases
In: Proceedings of 5th International Semantic Web Conference, ISWC, 2006, (Ed. Cruz I.), LNCS 4273, Springer Verlag, Athens, FA, USA, 2006, pp. 972-973.
Basic storage and querying of RDF data using a relational
database can be done in a very simple manner. Such approach can run
into trouble when used on large and complex data. This paper presents
such data and several sample queries together with analysis of their performance.
It also describes two possible ways of improving the performance
based on this analysis.
Dokulil Jiří, Tykal J., Yaghob Jakub, Zavoral Filip
Semantic Web Infrastructure
In: Proc. of the First IEEE International Conference on Semantic Computing, IEEE, 2007, pp. 209-215.
Presented at: ICSC 2007, 17.-19.9.2007, Irvine,
California.
The Semantic Web is not widespread as it has been expected by its founders. This is partially caused by lack of standard and working infrastructure for the Semantic Web. We have built a working, portable, stable, highperformance infrastructure for the Semantic Web. This paper is focused on tasks performed by the infrastructure.
Dokulil Jiří, Tykal J., Yaghob Jakub, Zavoral Filip
Semantic Web Repository and Interfaces
In: Proc. of SEMAPRO (Int. Conf. on Advances in Semantic Processing), IEEE, 2007.
Presented at: SEMAPRO (Int. Conf. on Advances in Semantic Processing), 4.-9.11.2007, Papeete,
French Polynesia (Tahiti) .
The Semantic Web is not widespread as it has been
expected by its founders. This is partially caused by
lack of standard and working infrastructure for the Semantic
Web. We have built a working, portable, stable,
high-performance infrastructure for the Semantic
Web. This enables various experiments with the Semantic
Web in the real world.
Dokulil Jiří, Katreniaková J.
Visualization of large schemaless RDF data
In: Proc. of SEMAPRO (Int. Conf. on Advances in Semantic Processing), IEEE, 2007, pp. 243-248.
Presented at: SEMAPRO (Int. Conf. on Advances in Semantic Processing), 4.-9.11.2007, Papeete,
French Polynesia (Tahiti) .
Since many XML documents do not contain any schema definition, we expected that there will be also RDF documents without RDF schema or ontology.Then the data can only be viewed as a general labeled directed graph and the idea to present the data to the user by drawing the graph seems natural. Because the data can be extremely large, it is impossible to display the whole graph at one time. Only a suitable start node is displayed and the rest of the graph can be explored by incremental navigation.To conserve space and show possible directions of further navigation to the user we have come up with a technique called node merging. By combining suitable graph drawing and navigation techniques we get a tool that can give the user good idea about structure and content of the data.
Dokulil Jiří, Tykal J., Yaghob Jakub, Zavoral Filip
Experimental Platform for the Semantic Web
In: Proceedings of ITAT 2007, Information Technologies - Applications and Theory, (Ed. Vojtáš P.), PONT s.r.o., Seňa, 2007, pp. 67-72.
Presented at: Konferencia o informačných (inteligentných) technológiách - aplikácie a teória 2007, 21.-27.9.2007, Polana,
Slovakia.
Dokulil Jiří, Katreniaková J.
Vizualizácia RDF dát pomocou techniky zlučovania vrcholov
In: Proceedings of ITAT 2007, Information Technologies - Applications and Theory, (Ed. Vojtáš P.), PONT s.r.o., Seňa, 2007, pp. 23-28.
Presented at: Konferencia o informačných (inteligentných) technológiách - aplikácie a teória 2007, 21.-27.9.2007, Polana,
Slovakia.
Dokulil Jiří, Katreniaková J.
Visual Exploration of RDF Data
In: SOFSEM 2008: Theory and Practice of Computer Science, LNCS 4910, Springer, 2008, pp. 672-683.
Presented at: 34th International Conference on Current Trends in Theory and Practice of Computer Science, 19.-25.1.2008, Nový Smokovec, High Tatras,
Slovakia.
We have developed and implemented [1,2] infrastructure and
RDF storage for the Semantic Web. When we filled it with data the need
for some tool that could explore the data became evident. Unfortunately,
none of existing solutions fulfills requirements imposed by the data and
users expectations. This paper presents our RDF visualizer that was
designed specifically to handle large RDF data by means of incremental
navigation. A detailed description of the algorithm is given as well as
actual results produced by the visualizer.
Duží Marie, Vojtáš Peter
Multi-Criterion Search from the Semantic Point of View
In: EJC`07, (Ed. Jaakkola H. et al.), Juvenes Print-TTY, Tampere, 2007, pp. 21-39.
Presented at: THE 17th EUROPEAN - JAPANESE CONFERENCE ON INFORMATION MODELLING AND KNOWLEDGE BASES , 4.-8.6.2007, Pori,
Finland.
Dvořáková Jana, Zavoral Filip
Xord: An Implementation Framework for Efficient XSLT Processing
In: Proc. of 2nd International Symposium on Intelligent Distributed Computing, Studies in Computational Intelligence, (Ed. C. Badica et al.), Volume: 162, Springer-Verlag, Heidelberg, 2008, pp. 95-104.
ISBN: 978-3-540-85256-8
Presented at: IDC 2008: 2nd International Symposium on Intelligent Distributed Computing, 18.-19.9.2008, Catania, Italy.
Dvořáková Jana, Zavoral Filip
BUXT Engine in Xord: Fragment Buffers for Streaming XSLT Transformations
In: Proc. of 5th International Conference on Innovations in Information Technology, IEEE Computer Society Press, 2008.
(in_print)
Presented at: Innovations 2008: 5th International Conference on Innovations in Information Technology, 16.-18.12.2008, Al Ain, United Arab Emirates.
Dvořáková Jana, Zavoral Filip
Schema-Based Analysis of XSLT Streamability
In: Proc. of International Conference on Advanced Engineering Computing and Applications in Science, IEEE Computer Society Press, 2008, pp. 187-192.
ISBN: 978-0-7695-3369
Presented at: ADVCOMP 2008: International Conference on Advanced Engineering Computing and Applications in Science, 29.9.-4.10.2008, Valencia, Spain.
Dvořáková Jana, Zavoral Filip
A Low-Memory Streaming Algorithm for XSLT Processing Implemented in Xord Framework
In: Proc. of 1st International Conference on the Applications of Digital Information and Web Technologies, IEEE Computer Society Press, 2008, pp. 239-247.
ISBN: 978-1-4244-2624-9
Presented at: ICADIWT 2008: 1st International Conference on the Applications of Digital Information and Web Technologies, 4.-6.8-2008, Ostrava, Czech Republic.
Dvořáková Jana, Zavoral Filip
Determining XSLT Streamability Using New Hierarchical XSD Model
In: Proc. of Information Technologies - Application and Theory, (Ed. P. Vojtáš), PONT, Seňa, 2008, pp. 7-12.
ISBN: 978-80-969184-9-2
Presented at: Konferencia o informačných (inteligentných) technológiách - aplikácie a teória 2008, 22.-26.9.2008, High Tatras,
Slovakia.
Eckhardt Alan, Horváth T., Maruščák D., Novotný R., Vojtáš Peter
Uncertainty Issues in Automating Process Connecting Web and User
In: Proc. of Uncertainty Reasoning for the Semantic Web Workshop 2007, (Ed. F. Bobillo), CEUR Workshop Proc., 2007, pp. 1-12.
Presented at: Dateso 2008: Annual International Workshop on DAtabases, TExts, Specifications and Objects, 16.4.-18.4.2008, Desná - Černá Říčka,
Czech Republic.
Eckhardt Alan, Horváth T., Vojtáš Peter
Learning different user profile annotated rules for fuzzy preference top-k quering
In: Scalable Uncertainty Management, Springer, LNAI 4772, Berlin, 2007, pp. 116-130.
Presented at: SUM 2007 International Conference, 10.10.-12.10.2007, Washington,
US.
Uncertainty querying of large data can be solved by providing top-k answers according to a user fuzzy ranking/scoring function. Usually different users have different fuzzy scoring function a user preference model. Main goal of this paper is to assign a user a preference model automatically. To achieve this we decompose user’s fuzzy ranking function to ordering of particular attributes and to a combination function. To solve the problem of automatic assignment of user model we design two algorithms, one for learning user preference on particular attribute and second for learning the combination function. Methods were integrated into a Fagin-like top-k querying system with some new heuristics and tested.
Eckhardt Alan, Vojtáš Peter
Uživatelské preference při hledání ve webovských zdrojích
In: Znalosti 2007, Fakulta elektrotechniky a informatiky, VŠB - Technická univerzita Ostrava, 2007, pp. 179-190.
Presented at: Znalosti 2007, 21.2.-23.2.2007, Ostrava,
Czech Republic.
Eckhardt Alan, Vojtáš Peter
Towards ontology language handling imperfection
In: Proceeding of the 1st Workshop on Intelligent and Knowledge oriented Technologies, 2006, pp. 124-125.
Presented at: 1st Workshop on Intelligent and Knowledge oriented Technologies, 28.11.-29.11.2006, Bratislava,
Slovakia.
Eckhardt Alan
Inductive Models of User Preferences for Semantic Web
In: Proceedings of the Dateso 2007, CEUR Workshop Proc., 2007, pp. 103-114.
Presented at: Dateso 2007 Annual International Workshop on DAtabases, TExts, Specifications and Objects, 18.4.-20.4.2007, Desná - Černá Říčka,
Czech Republic.
User preferences became recently a hot topic. The massive
use of internet shops and social webs require the presence of a user modelling,
which helps users to orient them selfs on a page. There are many
different approaches to model user preferences. In this paper, we will
overview the current state-of-the-art in the area of acquisition of user
preferences and their induction. Main focus will be on the models of user
preferences and on the induction of these models, but also the process of
extracting preferences from the user behaviour will be studied. We will
also present our contribution to the probabilistic user models.
Eckhardt Alan, Pokorný Jaroslav, Vojtáš Peter
Integrating user and group preferences for top-k search from distributed web resources
In: Proc. of DEXA Workshop Decision Support for Structural Health Monitoring and Flexible Query Processing, (Ed. Tjoa A.M., Wagner R.R..), IEEE, 2007, pp. 317-322.
Presented at: DEXA Workshop, 3.-7.9.2007, Regensburg,
Germany.
We discuss models of user and group preferences in social networks and the Semantic web. We construct a model for user and group preference querying over RDF data as well as for ordering of answers by aggregation of particular attribute ranking. We have implemented our methods and heuristics into the Tokaf middleware framework prototype. We describe also experiments with Tokaf.
Eckhardt Alan, Pokorný Jaroslav, Vojtáš Peter
A system recommending top-k objects for multiple users preference
In: Proc. of FUZZ-IEEE 2007 International Conference on Fuzzy Systems, IEEE, 2007, pp. 1101-1106.
Presented at: FUZZ-IEEE 2007, 23.-26.7.2007, London,
UK.
We discuss models of user preferences in Web environment. We construct a model for user preference querying over a number of data sources and ordering of answers by a combination of particular attribute rankings. We generalize Fagin's algorithm in two directions - we develop some new heuristics for top-k search in the model without random access and propose a method of ordering lists of objects by user fuzzy function. To enable different user preferences our system does not require objects to be sorted - instead we use a B+- tree on each of the attribute domains. This leads to a more realistic model of Web services. We implement our methods and heuristics for search of top-k answers into Tokaf middleware framework prototype. We describe experiments with Tokaf and compare different performance measures with some other methods.
Eckhardt Alan, Horváth T., Vojtáš Peter
PHASES: A User Profile Learning Approach for Web Search
In: Web Intelligence, IEEE Computer SocietyScalable Uncertainty Management, Los Alamitos, 2007, pp. 780-783.
Presented at: WI 2007. IEEE/WIC/ACM International Conference on Web Intelligence, 2.11.-5.11.2007, Silicon Valley,
US.
Web search heuristics based on Fagin’s threshold
algorithm assume we have the user profile in the form
of particular attribute ordering and a fuzzy
aggregation function representing the user combining
function. Having these, there are sufficient algorithms
for searching top-k answers. Finding particular
attribute ordering and aggregation for a user still
remains a problem. In this short paper our main
contribution is a proof of concept of a new iterative
process of acquisition of user preferences and attribute
ordering .
Eckhardt Alan, Horváth T., Maruščák D., Novotný R., Vojtáš Peter
Uncertainty Issues in Automating Process Connecting Web and User
In: Proc. of Uncertainty Reasoning for the Semantic Web (URSW 2007), Workshop at ISWC+ASWC 2007, (Ed. deCosta P. et al.), 2007, pp. 97-108.
Presented at: ISWC 2007, 12.11.2007, Busan,
Korea.
Falchi Fabrizio, Gennaro Claudio, Rabitti Fausto, Zezula Pavel
A distributed incremental nearest neighbor algorithm
In: International Conference on Scalable Information Systems, Volume: 304, ACM Press, New York, 2007, pp. 1-10.
Presented at: INFOSCALE 2007, 6.-8.6.2007, Suzhou,
China.
Falchi Fabrizio, Gennaro Claudio, Zezula Pavel
Nearest neighbor search in metric spaces through Content-Addressable Networks
In: Information Processing and Management, Volume: 44, No: 1, Elsevier, 2008, pp. 411-429.
Farský Miroslav, Neruda Martin, Neruda Roman
Mass and energy flows in consequences of company environmental acconting
In: Proceeding of the Environmental Accounting - Sustainable Development Indicators, International Conference EA-SDI 2005, (Ed. Ritschelová I.), Jan Evangelista Purkyně University and Charles University, Ústí nad Labem and Prague, 2005, pp. 356-362.
ISBN: 80-7044-676-5
Presented at: International Conference EA-SDI 2005, 26.9.-27.9.2005,
Czech Republic.
During the implementation of an environmental accounting system in a company, one of the most important pieces of information to obtain is a detailed understanding about material flows (raw materials, semi-finished products, final products and wastes) and flows of different types of energy inputs (buying, selling and wastage) when thinking about the consequences on the company. The authors, in the article: 1) study the question of the quantification of the flows, and the accuracy of their measurement, 2) provide an environmental accounting statement, with help of standards and indices, statistical trends analysis.
Feuerlicht George, Pokorný Jaroslav, Richta Karel
Object-Relational Database Design: Can your application benefit from SQL:2003?
In: The Inter-Networked World: ISD Theory, Practice, and Education, (Ed. Barry C., Lang M., Wojtkowski W., Wojtkowski G., Wrycza S., Zupancic J.), Springer-Verlag, New York, 2008.
ISBN: 978-0387304038
Frolov A., Polyakov P. Y., Húsek Dušan, Řezanková Hana
Neural Network Based Boolean Factor Analysis of Parliament Voting
In: Proceedings in Computational Statistics, Physica-Verlag, Heidelberg, 2006, pp. 861-868.
ISBN: 3-7908-1708-2
Presented at: Compstat 2006, 28.8.-1.9.2006, Rome,
Italy.
Frolov A., Húsek Dušan, Muraviev P. Igor, Polyakov P. Y.
Boolean Factor Analysis by Attractor Neural Network
In: IEEE Transactions on Neural Networks, Volume: 18, No: 3, IEEE, 2007, pp. 698-707.
A common problem encountered in disciplines such as statistics, data analysis, signal processing, textual data representation, and neural network research, is finding a suitable representation of the data in the lower dimension space. One of the principles used for this reason is a factor analysis. In this paper, we show that Hebbian learning and a Hopfield-like neural network could be used for a natural procedure for Boolean factor analysis. To ensure efficient Boolean factor analysis, we propose our original modification not only of Hopfield network architecture but also its dynamics as well. In this paper, we describe neural network implementation of the Boolean factor analysis method. We show the advantages of our Hopfield-like network modification step by step on artificially generated data. At the end, we show the efficiency of the method on artificial data containing a known list of factors. Our approach has the advantage of being able to analyze very large data sets while preserving the nature of the data.
Galamboš Leo
Dynamic Inverted Index Maintenance
In: International Journal of Computer Science, Volume: 1, No: 2, 2006, pp. 157-162.
Galamboš Leo
Inverted Index Maintenance
In: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu, Ústav informatiky AV ČR, Prague, 2006, pp. 27-38.
ISBN: 80-903298-7-X
Presented at: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu - Seminář projektu programu Informační společnost, 5.10.-7.10.2006, Zadov,
Czech Republic.
This paper presents a method for dynamization which may
be used for fast and effective inverted index maintenance. Experimental
results show that the dynamization process is possible and that it guarantees the response time for the query operation and index actualization.
Galamboš Leo, Lánský Jan, Chernik K.
Compression of Semistructured Documents
In: International Enformatika Conference IEC 2006, Enformatika, Transactions on Engieering, Computing and Technology, 2006, pp. 222-227.
Galamboš Leo
Vyhledávání na Webu
In: DATAKON 2007, (Ed. Popelínský L., Výborný O.), Masaryk university, 2007, pp. 17-24.
Presented at: DATAKON 2007, 20.10.-23.10.2007, Brno,
Czech Republic.
Galamboš Leo, Lánský Jan, Žemlička M., Chernik K.
Compression of Semistructured Documents
In: International Journal of Information Technology, Volume: 4, No: 1, Elsevier, 2007, pp. 11-17.
EGOTHOR is a search engine that indexes the Web
and allows us to search the Web documents. Its hit list contains URL
and title of the hits, and also some snippet which tries to shortly
show a match. The snippet can be almost always assembled by an
algorithm that has a full knowledge of the original document (mostly
HTML page). It implies that the search engine is required to store
the full text of the documents as a part of the index.
Such a requirement leads us to pick up an appropriate compression
algorithm which would reduce the space demand. One of the solutions
could be some use of common compression methods, for instance
gzip or bzip2, but it might be preferable to develop a new method
which would take advantage of the document structure, or rather, the
textual character of the documents.
There already exist special compression text algorithms and methods
for a compression of XML documents. The aim of this paper is
an integration of the two approaches to achieve an optimal level of
the compression ratio.
Gurský Peter, Horváth T., Novotný R., Vaneková Veronika, Vojtáš Peter
UPRE: User preference based search system
In: Proceeding of the IEEE/WIC/ACM International Conference on Web Intelligence, ACM IEEE WIC, 2006, pp. 4.
Presented at: IEEE/WIC/ACM International Conference on Web Intelligence WI-06, 18.12.-22.12.2006, Hong-Kong
.
Gurský Peter, Vojtáš Peter
Multikriteriálne vyhľadávanie najlepších objektov s podporou viacerých užívateľov
In: Znalosti 2007, Fakulta elektrotechniky a informatiky, VŠB - Technická univerzita Ostrava, 2007, pp. 52-62.
Presented at: Znalosti 2007, 21.2.-23.2.2007, Ostrava,
Czech Republic.
Gurský Peter, Horváth T., Jirásek J., Krajči S., Novotný R., Vaneková Veronika, Vojtáš Peter
Web Search with Variable User Model
In: DATAKON 2007, (Ed. Popelínský L., Výborný O.), Masaryk university, 2007, pp. 111-121.
Presented at: DATAKON 2007, 20.10.-23.10.2007, Brno,
Czech Republic.
Gurský Peter, Horváth T., Jirásek J., Krajči S., Novotný R., Vaneková Veronika, Vojtáš Peter
Knowledge Processing for Web Search - An integrated Model
In: Proc. of the 1st International Symposium on Intelligent and Distributed Computing (IDC 2007), STUDIES IN COMPUTATIONAL INTELLIGENCE, (Ed. Badica C., Paprzycki M.), Volume: 78, Springer, 2007, pp. 95-104.
Presented at: IDC 2007: 1st International Symposium on Intelligent and Distributed Computing, 18.-20.10.2007, Craiova,
Romania.
We propose a model of a middleware system enabling personalized
web search for users with different preferences. We integrate both inductive and
deductive tasks to find user preferences and consequently best objects. The
model is based on modeling preferences by fuzzy sets and fuzzy logic. We
present the model-theoretic semantic for fuzzy description logic f-EL which is
the motivation of creating a model for fuzzy RDF. Our model was
experimentally implemented and integration was tested.
Hájek Petr
Making fuzzy description logic more general
In: Fuzzy Sets and Systems, Volume: 154, 2005, pp. 1-15.
A version of fuzzy description logic based on the basic (continuous t-norm based) fuzzy predicate logic BL is presented. Problems of satisfiability, validity and subsumption of concepts are discussed and reduced to problems of fuzzy propositional logic known to be decidable for any continuous t-norm. For Lukasiewicz t-norm some stronger results are obtained.
Hájek Petr
What does Mathematical Fuzzy Logic Offer to Description Logic?
In: Fuzzy Logic and the Semantic Web, (Ed. Sanchez E.), Elsevier, 2006.
ISBN: 0-444-51948-3
Continuous t-norm based fuzzy predicate logic is surveyed as a generalization of classical predicate logic; then a kind of fuzzy description logic based on our fuzzy predicate logic is briefly described as a powerful but still decidable formal system of description logic dealing with vague (imprecise) concepts.
Hájek Petr, Mesiar R.
On copulas, quasicopulas and fuzzy logic
In: Soft Computing, Volume: 12, No: 12, Springer, 2008, pp. 1239-1243.
Hanks Patrick
The Organization of the Lexicon: Semantic Types and Lexical Sets
In: Proceedings of Euralex Conference, University of Torino, 2006.
Presented at: The 12th EURALEX International Congress, 6.9.-9.9.2006, Torino,
Italy.
This paper reports a new kind of lexicon currently being developed as a resource for
natural language processing, language teaching, and other applications. This is a "Pattern
Dictionary of English", based on detailed and extensive corpus analysis of each sense of each verb in the language. A pattern consists of a verb with its valencies, plus semantic values for each valency and other relevant clues, and is associated with an implicature that associates the meaning with the context rather than with the word in isolation. For each verb, all normal patterns are recorded. The semantic types in each argument slot are linked to actual words via a large ontology.
The dictionary is aimed primarily at the NLP community, but it also has relevance for language teaching. For NLP purposes, matching actual uses of verbs in previously unseen text to patterns in the pattern dictionary offers some hope of solving the “Word Sense Disambiguation (WSD) problem”.
The paper discusses the relationship between A) words as they are actually used and B) semantic types and functions in a theoretical lexicon. An attempt will be made in the full paper to relate empirically observable, corpus-based facts about ordinary word use to the theoretical abstractions of Generative Lexicon Theory of James Pustejovsky. Lexicography and linguistic theory are often uneasy bedfellows, but I shall suggest that in at least these two cases, there is a possibility of a harmonious and productive relationship.
Hanks Patrick
Why Bother with Corpus Evidence
In: Proceedings of the Second International Conference of the German Cognitive Linguistics Association, 2007.
(in_print)
Presented at: Second International Conference of the German Cognitive Linguistics Association, 5.10.-7.10.2006, Munich,
Germany.
Hanks Patrick, Pala Karel, Rychlý Pavel
Using Corpus Analysis to Mapping Lexical Sets onto Semantic Types through Corpus Analysis
In: Proceedings of the Fourth International Workshop on Generative Approaches to the Lexicon, 2007.
(in print)
Presented at: Fourth International Workshop on Generative Approaches to the Lexicon, 10-11.5.2007, Paris,
France.
Hanks Patrick, Pala Karel
Towards an empirically well-founded semantic ontology for NLP
In: Proceedings of the Fourth International Workshop on Generative Approaches to the Lexicon, 2007.
Presented at: Fourth International Workshop on Generative Approaches to the Lexicon, 10-11.5.2007, Paris,
France.
This paper examines some issues involved in
building a corpus-based ontology for use in
determining the meaning of words in text, in the
context of creating a “pattern dictionary”. How do
words cluster in paradigmatic lexical sets in actual
usage (as reflected in a large corpus), and can these
clusters be mapped onto a semantically structured
ontology? What semantic notions need to be
distinguished for this purpose, and what are the
appropriate theoretical foundations? What other
elements are needed for the application of
determining meaning in text?
Hanks Patrick
Editorial: Cognition and the Lexicon
In: Lexicology, (Ed. Hanks P.), Volume: 5, Routledge, Taylor and Francis Group, 2007.
ISBN: 978-0-415-70098-6
Hanks Patrick
Editorial: Formal Approaches to the Lexicon
In: Lexicology, (Ed. Hanks P.), Volume: 6, Routledge, Taylor and Francis Group, 2007.
ISBN: 978-0-415-70098-6
Hlaváčková D., Horák Aleš, Kadlec V.
Exploitation of the VerbaLex Verb Valency Lexicon in the Syntactic Analysis of Czech
In: Text, Speech and Dialogue - Proceedings of the 9th International Conference, TSD 2006, LNCS 4188, Springer-Verlag, Berlin / Heidelberg, 2006, pp. 79-86.
Presented at: Ninth International Conference on TEXT, SPEECH and DIALOGUE TSD 2006,, 11.9.-15.9.2006, Brno,
Czech Republic.
Hlaváčková D., Pala Karel
Surface and Deep Valency Frames in Czech
In: Proceedings of the 25th International Conference on Lexis and Grammar, 2007.
(in_print)
Presented at: The 25th International Conference on Lexis and Grammar, 6.9.-10.9.2006, Palermo,
Italy.
Hlaváčková D., Pala Karel
Computer Processing Derivational Relations in Czech
In: Computer Treatment of Slavic and East European Languages, L. Štúr Institute of Linguistics, Slovak Academy of Sciences, Bratislava, 2007, pp. 198-208.
Presented at: Slovko 2007, 25.-27.10.2007, Bratislava,
Slovakia.
In the paper we deal with the derivational relations in Czech that form typical derivational nests (or subnets). Derivational relations are mostly of semantic nature and their regularity in Czech allows us to describe them in a way suitable for computer processing and then add them to the electronic databases such as WordNet almost automatically. For this purpose we have used the derivational version of morphological analyzer Ajka that is able to handle the basic and most productive derivational relations in Czech. A special derivational interface has been developed in our NLP Lab at FI MU by means of which we have explored the semantic nature of the selected noun derivational suffixes (22) as well as verb prefixes and established a set of the semantically labeled derivational relations, presently 14. With regard to the verbs we have paid attention to the selected verb semantic classes in connection with the derivational relations between selected prefixes (4) and corresponding Czech verbs. As an application we have added the selected derivational relations to the Czech WordNet and in this way enriched it with approx. 30 000 new Czech synsets.
Hliněná Dana, Vojtáš Peter
A Note on an Example of Use of Fuzzy Preference Structures
In: Acta Universitatis Matthiae Belii, Volume: 14, 2008, pp. 29-39.
Holeňa Martin
Získávání logických tvrzení z dat jako významný směr dobývání znalostí z dat
In: Proceedings of ITAT 2005, Information Technologies - Applications and Theory, (Ed. Vojtáš P.), Prírodovedecká fakulta Univerzity Pavla Jozefa Šafárika, Košice, 2005, pp. 311-322.
ISBN: 80-7097-609-8
Presented at: ITAT 2005, 20.9. - 25.9.2005, Račkova dolina,
Slovakia.
Příspěvek se zabývá problematikou získávání logických tvrzení z dat, tedy těmi metodami dobývání znalostí z dat (data mining), jejichž výsledky lze vyjádřit v jazyce nějaké formální logiky. Je podán velmi stručný přehled širokého spektra rozmanitých metod tohoto typu, jak metod vycházejících ze statistických přístupů, tak i metod spočívajících na principech strojového učení, a je poukázáno na specifický charakter metod založených na umělých neuronových sítích. Pro ilustraci jsou podrobněji popsány dvě konkrétní metody získávání logických tvrzení z dat. Jednou z nich je metoda Guha, která vychází z observační logiky a je pravděpodobně nejstarší metodou získávání pravidel z dat vůbec. Druhou je metoda založená na po částech lineárních vícevrstvých perceptronech.
Horák Aleš, Pala Karel, Rambousek Adam, Povolný Martin
DEBVisDic – First Version of New Client-Server Wordnet Browsing and Editing Tool
In: Proceedings of the Third International Wordnet Conference, 2006.
Presented at: The 3rd International WordNet Conference (GWC-06), 22.1.-26.1.2006, Jeju Island,
Korea.
In this paper, we present the new wordnet development
tool called DEBVisDic. It is built
on the recently developed platform for clientserver
XML databases, called DEBii. This
platform is able to cover many possible applications,
from which we concentrate on the new,
complete reimplementation of one of the mostspread
wordnet editor and browser – VisDic.
We argue for the benefits the new DEBii platform
brings to wordnet editing and to XML
databases in general. In the paper, we describe
the state of the implementation, the insides and
interfaces of the DEBVisDic tool. We also discuss
its functionality and some distinctions in
comparison with other dictionary writing systems.
Horák Aleš, Rambousek Adam
Administration Framework for the DEB Dictionary Server
In: Computer Treatment of Slavic and East European Languages, L. Štúr Institute of Linguistics, Slovak Academy of Sciences, Bratislava, 2007, pp. 70-79.
Presented at: Slovko 2007, 25.-27.10.2007, Bratislava,
Slovakia.
This paper presents a new implementation of administration framework for the DEBII dictionary writing system. We present the details and examples of the user management part as well as graphical scenarios for dictionary service setup, adaptation and automatic generation of user application based on the dictionary XML schema.
Tento článek představuje novou implementaci administračního rozhraní systému pro tvorbu slovníků DEBII. V článku je podrobně popsán systém správy uživatelů a také grafikou doplněný postup vytvoření nového slovníku, jeho přizpůsobení a automatické generování uživatelské aplikace pomocí XML schématu slovníku.
Horák Aleš, Rambousek Adam
DEB Platform Deployment - Current Applications
In: RASLAN 2007: Recent Advances in Slavonic Natural Language Processing, Masaryk University, Brno, 2007, pp. 3-11.
In this paper, we summarize the latest development regarding the client dictionary writing applications based on the DEB development platform. The DEB framework is nowadays used in several full grown projects for preparation of high quality lexicographic data created within (possibly distant) teams of researchers. We briefly present the current list of DEB applications with the relevant projects and their phases. For each of the applications, we offer display the view of the interface with overview description of the most important features.
Horák Aleš, Rambousek Adam
Dictionary Management System for DEB Development Platform
In: NLPCS 2007: Proceedings of the 4th International Workshop on Natural Language Processing and Cognitive Science, INSTICC PRESS, Funchal, Portugal, 2007, pp. 129-138.
Presented at: NLPCS 2007, 12.-16.6.2007, Funchal,
Madeira - Portugal.
In the paper, we introduce new dictionary management interface for design, preparation and presentation of generic electronic XML dictionaries using the DEB (Dictionary Editing and Browsing) development platform. The DEB platform provides a strict client-server environment for general dictionary writing systems. So far several successful NLP tools have been implemented on this platform, one of the most known being the DEBVisDic tool for wordnet semantic network editing and visualization. This paper describes a new part of the DEB platform -- the Administration interface that is shared by all DEB applications running on one server machine.
Článek představuje nové rozhraní pro správu slovníků, které umožňuje návrh, přípravu a prezentaci obecných elektronických slovníků ve formátu XML s použitím vývojové platformy DEB (Dictionary Editing and Browsing). Platforma DEB poskytuje prostředí v architektuře klient-server pro obecné systémy pro vytváření slovníků. V současné době bylo na této platformě implementováno několik úspěšných NLP nástrojů, nejznámnější je nástroj DEBVisDic pro editaci a vizualizaci sémantických sítí typu wordnet. Tento článek popisuje novou součást platformy DEB -- administrační rozhraní, které sdílí všechny DEB aplikace spuštěné na jednom serveru.
Horák Aleš, Pala Karel, Rambousek Adam
The Global WordNet Grid Software Design
In: Proc. of Fourth Global WordNet Conference, University of Szeged, Department of Informatics, 2008, pp. 194-199.
Presented at: GWC 2008, 22.-25.1.2008, Szeged,
Hungary.
Horák Aleš, Holan Tomáš, Kadlec V., Kovář Vojtěch
Dependency and Phrasal Parsers of the Czech Language: A Comparison
In: Proceedings of Text, Speech and Dialogue 2007, Springer, LNAI 4629, Berlin, Heidelberg, 2007, pp. 76-84.
Presented at: TSD 2007, 3.-7.9.2007, Plzeň,
Czech Republic.
In the paper, we present the results of an experiment with comparing the effectiveness of real text parsers of Czech language based on completely different approaches stochastic parsers that provide dependency trees as their outputs and a meta-grammar parser that generates a resulting chart structure representing a packed forest of phrasal derivation trees.
We describe and formulate main questions and problems accompanying such experiment, try to offer answers to these questions and finally display also factual results of the tests measured on 10 thousand Czech sentences.
Horák Aleš
Computer Processing of Czech Syntax and Semantics
In:
Horák Aleš, Pala Karel, Rambousek Adam
Tools for Managing Multiligual Lexical Resources
In: Proc. of International Conference Inteligent Information Systems, Polish Academy of Sciences, 2008, pp. 451-460.
Presented at: International Conference Inteligent Information Systems, , Zakopane, Poland.
Húsek Dušan, Pokorný Jaroslav, Řezanková Hana, Snášel Václav
Data clustering: From documents to the Web
In: Web Data Management Practices: Emerging Techniques and Technologies, (Ed. Vakali A., Pallis G.), Idea Group Inc., 2007, pp. 1-33.
The chapter provides a survey of some clustering methods relevant to the clustering document collections and, in consequence, Web data. We start with classical methods of cluster analysis which seem to be relevant in approaching to cluster Web data. The graph clustering is also described since its methods contribute significantly to clustering Web data. A use of artificial neural networks for clustering has the same motivation. Based on previously presented material, the core of the chapter provides an overview of approaches to clustering in the Web environment. Particularly, we focus on clustering web search results, in which clustering search engines arrange the search results into groups around a common theme. We conclude with some general considerations concerning the justification of so many clustering algorithms and their application in the Web environment.
Húsek Dušan, Snášel Václav, Owais Suhail S. J., Krömer Pavel
Using Genetic Algorithms for Boolean Queries Optimization
In: Proceedings of the Ninth IASTED International Conference INTERNET AND MULTIMETIA SYSTEMS AND APPLICATIONS, ACTA Press, 2005, pp. 178-184.
Presented at: Ninth IASTED International Conference INTERNET AND MULTIMETIA SYSTEMS AND APPLICATIONS, 15.8.-17.8.2005, Honolulu,
Hawaii, USA.
Most of information retrieval systems depend on Boolean queries. The performance of an information retrieval system is usually measured in terms of two different criteria, precision and recall. This way, the optimization of any of its components is a clear example of a multiobjective problem. However, although evolutionary algorithms have been widely applied in the information retrieval area, in all of these applications both criteria have been combined in a single scalar fitness function by means of a weighting scheme. In this paper, we deal with using of Genetic algorithms in Information retrieval specially in optimizing of a Boolean query.
Húsek Dušan, Frolov A., Polyakov P. Y., Řezanková Hana
Neural Network Based Boolean Factor Analysis: Efficient Tool for Automated Topics Search
In: Proceedings of International Multiconference on Computer Science and Information Technology CSIT 2006, Applied Science Private University, Amman, 2006, pp. 321-327.
ISBN: 9957-8592-X
Presented at: International Multiconference on Computer Science and Information Technology CSIT 2006, 5.4.-7.4.2006, Amman,
Jordan.
Húsek Dušan, Moravec Pavel, Snášel Václav, Frolov A., Polyakov P. Y.
Comparison of Neural Network Boolean Factor Analysis Method with Some Other Dimension Reduction Methods on Bars Problem
In: Pattern Recognition and Machine Intelligence, (Ed. Ghosh A., De R.), LNCS 4815, Springer, Berlin, 2007.
ISBN: 978-3-540-77045-9
Presented at: PReMI 2007. International Conference (2.), 18.-22.12.2007, Kolkata,
India.
In this paper, we compare performance of novel neural network based algorithm for Boolean factor analysis with several dimension reduction techniques as a tool for feature extraction. Compared are namely singular value decomposition, semi-discrete decomposition and non-negative matrix factorization algorithms, including some cluster analysis methods as well. Even if the mainly mentioned methods are linear, it is interesting to compare them with neural network based Boolean factor analysis, because they are well elaborated. Second reason for this is to show basic differences between Boolean and linear case. So called bars problem is used as the benchmark. Set of artificial signals generated as a Boolean sum of given number of bars is analyzed by these methods. Resulting images show that Boolean factor analysis is upmost suitable method for this kind of data.
Ivanova K., Heid U., Schulte im Walde S., Kilgarriff A., Pomikálek Jan
Evaluating a German Sketch Grammar: A Case Study on Noun Phrase Case
In: Proceedings of the Sixth International Language Resources and Evaluation (LREC'08), European Language Resources Association (ELRA), 2008.
Presented at: International Conference on Language Resources and Evaluation, , Marrakech, Morocco.
Jiroušek Radim, Vejnarová Jiřina, Daniel Milan
Compositional Models of Belief Functions
In: ISIPTA'07, Charles University, Faculty of Mathematics and Physics, Prague, 2007, pp. 243-252.
Presented at: ISIPTA'07 - FIFTH INTERNATIONAL SYMPOSIUM ON IMPRECISE PROBABILITY: THEORIES AND APPLICATIONS, 16.-19.7.2007, Prague,
Czech Republic.
After it has been successfully done in probability and
possibility theories, the paper is the first attempt to
introduce the operator of composition also for belief
functions. We prove that the proposed definition
preserves all the necessary properties of the operator
enabling us to define compositional models as an
efficient tool for multidimensional models representation.
Jusko Pavol, Obdržálek David, Petrůšek Tomáš
Software-Hardware Mapping in a Robot Design
In: Proc. of International Conference on Research and Education in Robotics, LNCS, Springer-Verlag, Heidelberg, 2008, pp. 42-51.
ISBN: 978-80-7378-042-5
Presented at: EUROBOT 2008: International Conference on Research and Education in Robotics, 21.-24.5.2008, Heidelberg, Germany.
Kochánek Jiří, Lánský Jan, Uzel Petr, Žemlička M.
Multistream Compression
In: Proc. of Data Compression Conference, IEEE Computer Society Press, 2008, pp. 557.
ISBN: 978-0-7695-3121-2
Presented at: DCC 2008: Data Compression Conference, 25.-27.3.2008, Snowbird, Utah, USA.
Kovář Vojtěch, Horák Aleš
Reducing the Number of Resulting Parsing Trees for the Czech Language Using the Beautified Chart Method
In: Proceedings of 3rd Language and Technology Conference, Wydawnictwo Poznańskie, Poznań, 2007, pp. 433-437.
Presented at: LTC`07, 5.-7.2007, Poznań,
Poland.
In the paper, we present the beautified chart method used for reducing the number of output derivation trees for the Czech syntax parser synt. We show the evaluation results of the method, describe the appropriate algorithms and the parser internal data structures as well as problems with their implementation.
Článek popisuje metodu beautified chart pro omezení počtu výstupních derivačních stromů syntaktického analyzátoru češtiny synt. Je popsána naměřená úspěšnost metody, příslušné algoritmy, datové struktury a některé problémy při implementaci.
Krušina Pavel
Models of Multi-Agent Systems
In: Doktorandský den `04, MATFYZPRESS, 2004, pp. 58.
ISBN: 80-86732-30-4
Presented at: Institute of Computer Science Ph.D. Student`s Days 04, 29.09.-01.10.2004, Paseky nad Jizerou,
Czech Republic.
Multi-agent systems typically utilize a non-blocking asynchronous communication in order to achieve required flexibility and adaptability. High performance computing techniques exploit the current hardware ability of overlapping asynchronous communication with computation to load the available computer resources efficiently. On the contrary, widely used parallel processes modeling methodologies do not often allow for an asynchronous communication description. At the same time those models do not allow their user to select the granularity level and provide only a fixed set of machine and algorithm description quantities. In this work we addressed this issue and designed a new parallel processes modeling methodology. Its main features include an open set of atomic operations that are calculated and predicted for the algorithm in question, and the computer aided semi-automatic measuring of operation counts and approximation of cost functions. This allows not only for tuning the model granularity as well as accuracy according to user needs, but also to reach a such description complexity that would be very difficult to obtain without any computer aid. We demonstrated that our approach gives good results on the parallel implementation of a selected generalized genetic algorithm. A model was constructed and its predictions compared with the reality on various computer architectures, including one parallel cluster machine. We also designed and implemented an open multi-agent system suitable for the above mentioned experiments and many others. This system synthesizes the areas of high performance computing, multi-agent systems and computational intelligence into an efficient and flexible means of running experiments.
Kudělka Miloš, Snášel Václav, El-Qawasmeh Eyas, Lehečka Ondřej
Semantic Web: Web Patterns in Web Page Semantics
In: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu, Ústav informatiky AV ČR, Prague, 2006, pp. 39-51.
ISBN: 80-903298-7-X
Presented at: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu - Seminář projektu programu Informační společnost, 5.10.-7.10.2006, Zadov,
Czech Republic.
This paper introduces a novel method for semantic annotation of web pages. Semantic annotation is performed with regard to
unwritten and empirically proven agreement between users and web designers using Web patterns. This method is based on extraction of patterns which are characteristic for concrete domain. Patterns provide formalization of the agreement and allows assignment of semantics to parts
of web pages. Experimental results verify the e®ectives of the proposed
method.
Kudělka Miloš, Snášel Václav, Lehečka Ondřej, El-Qawasmeh Eyas, Pokorný Jaroslav
Web pages reordering and clustering based on web patterns
In: SOFSEM 2008: Theory and Practice of Computer Science, LNCS 4910, Springer, 2008, pp. 731-742.
Presented at: 34th International Conference on Current Trends in Theory and Practice of Computer Science, 19.-25.1.2008, Nový Smokovec, High Tatras,
Slovakia.
Kudová Petra, Neruda Roman
Learning in Radial Basis Function Networks and Regularization networks
Presented at: Sheffield Machine Learning Workshop, 7.9.-10.9.2004, Sheffield,
Great Britain.
We discuss two approaches to supervised learning, namely regularization networks and RBF networks, and demonstrate their performance on experiments. We claim that the performance of these two models is comparable, so the RBF networks can be used as a cheaper alternative to regularization networks.
Kudová Petra, Neruda Roman
Kernel Based Learning Methods: Regularization Networks and RBF Networks
In: Proceedings of the Sheffield Machine Learning Workshop, Springer Verlag, 2005, pp. 124-136.
ISBN: 3-540-29073-7
Presented at: Sheffield Machine Learning Workshop, 7.9.-10.9.2004, Sheffield,
Great Britain.
Kernel based learning methods are subject of great interest at present. We discuss two kernel based learning methods, namely the Regularization Networks (RN) and the Radial Basis Function Network (RBF networks).
The RNs are derived from the regularization theory, had been studied thoroughly from a function approximation point of view, and therefore have very good theoretical background.
The RBF networks represent a model of artificial neural networks with both neuro-physiological and mathematical motivation. In addition they may be treated as a generalised form of Regularization Networks, i.e. RN with increased number of kernel functions.
We demonstrated the performance of both approaches on experiments, including both benchmark and real-life learning tasks. We claim that the performance of RN and RBF network is comparable in terms of generalisation error. The RN approach usually leads to solutions with higher model complexity (high number of base units). In this situations, the RBF networks can be used as a ’cheaper’ alternative.
Kudová Petra
Learning with Regularization Networks in Bang
Presented at: TAM06, 14.6.-16.6.2006, Barcelona,
Spain.
In this paper we study learning with Regularization Networks (RN). RN are feedforward neural networks with one hidden layer. Since they have a very good theoretical background, we study their practical aspects and applicability. On experiments we demonstrate the role of the regularization parameter, compare RN with different kernels and parameter settings on benchmark data sets. Then we apply RN to a problem of a flow rate prediction, real data from Czech river Sázava are used. All experiments were made using the system Bang.
Kudová Petra
Learning Algorithms Based on Regularization
In: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu, Ústav informatiky AV ČR, Prague, 2006, pp. 52-59.
ISBN: 80-903298-7-X
Presented at: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu - Seminář projektu programu Informační společnost, 5.10.-7.10.2006, Zadov,
Czech Republic.
The problem of supervised learning is a subject of great interest at present. It covers wide range of tasks, such as various classification,
prediction, or forecasting problems, i.e. problems that also often arise in
semantic web applications. We study one approach to this problem - regularization networks. We introduce composite types of kernel functions,
sum kernels and product kernels. On experiments we demonstrate the
role of the regularization parameter and kernel function in the regularization network learning, and compare networks with different types of
kernel functions.
Kůrková Věra, Sanguineti Marcello
Learning with generalization capability by kernel methods of bounded complexity
In: Journal of Complexity, Volume: 21, Elsevier, 2005, pp. 350-367.
Learning from data with generalization capability is studied in the framework of minimization of regularized empirical error functionals over nested families of hypothesis sets with increasing model complexity. ForTikhonov`s regularization with kernel stabilizers, minimization over restricted hypothesis sets containing for a fixed integer n only linear combinations of all n-tuples of kernel functions is investigated. Upper bounds are derived on the rate of convergence of suboptimal solutions from such sets to the optimal solution achievable without restrictions on model complexity.The bounds are of the form 1/sqrt(n) multiplied by a term that depends on the size of the sample of empirical data, the vector of output data, the Gram matrix of the kernel with respect to the input data, and the regularization parameter.
Kůrková Věra
Inverse Problem in Data Analysis
In: Przeglad elektrotechniczny, Volume: 82, No: 4, 2006, pp. 41-47.
It is shown that learning from data modelled as minimaization of error functionals can be reformulated in terms of inverse problems. This reformulation allows to characterize optimal input-output functions of networks with kernel units.
Kůrková Věra
Estimates of Data Complexity in Neural-Network Learning
In: SOFSEM 2007, LNCS 4362, Springer, Berlin, 2007.
Presented at: SOFSEM 2007, 20.2.-26.2.2007, Harrachov,
Czech Republic.
Complexity of data with respect to a particular class of neural networks is studied. Data complexity is measured by the magnitude
of a certain norm of either the regression function induced by a probability measure describing the data or a function interpolating a sample
ofinput/output pairs of training data chosen with respect to this probability. The norm is tailored to a type of computational units in the
network class. It is shown that for data for which this norm is small,
convergence of infima of error functionals over networks with increasing number of hidden units to the global minima is relatively fast. Thus
for such data, networks with a reasonable model complexity can achieve
good performance during learning. For perceptron networks, the relationship between data complexity, data dimensionality and smoothness
is investigated.
Kuthan T., Lánský Jan
Genetic Algorithms in Syllable-Based text Compression
In: Proceedings of the Dateso 2007, CEUR Workshop Proc., 2007, pp. 21-34.
Presented at: Dateso 2007 Annual International Workshop on DAtabases, TExts, Specifications and Objects, 18.4.-20.4.2007, Desná - Černá Říčka,
Czech Republic.
Syllable based text compression is a new approach to compression
by symbols. In this concept syllables are used as the compression
symbols instead of the more common characters or words. This new
technique has proven itself worthy especially on short to middle-length
text files. The effectiveness of the compression is greatly affected by the
quality of dictionaries of syllables characteristic for the certain language.
These dictionaries are usually created with a straight-forward analysis
of text corpora. In this paper we would like to introduce an other way of
obtaining these dictionaries using genetic algorithm. We believe, that
dictionaries built this way, may help us lower the compress ratio. We will
measure this effect on a set of Czech and English texts.
Lánský Jan, Galamboš Leo, Chernik K.
Komprese webového uložiště
In: Proceedings of ITAT 2006, Information Technologies - Applications and Theory, 2006.
ISBN: 80-969184-4-3
Presented at: ITAT 2006, 26.9.-1.10.2006, Chata Kosodrevina, Bystrá dolina, Nízke Tatry,
Slovakia.
Lánský Jan, Chernik K., Vlčková Z.
Syllable-Based Burrows-Wheeler Transform
In: Proceedings of the Dateso 2007, CEUR Workshop Proc., 2007, pp. 1-10.
Presented at: Dateso 2007 Annual International Workshop on DAtabases, TExts, Specifications and Objects, 18.4.-20.4.2007, Desná - Černá Říčka,
Czech Republic.
The Burrows-Wheeler Transform (BWT) is a compression
method which reorders an input string into the form, which is preferable
to another compression. Usually Move-To-Front transform and then
Huffman coding is used to the permutated string. The original method [3]
from 1994 was designed for an alphabet compression. In 2001, versions
working with word and n-grams alphabet were presented. The newest
version copes with the syllable alphabet [7]. The goal of this article is to
compare the BWT compression working with alphabet of letters, syllables,
words, 3-grams and 5-grams.
Lánský Jan, Žemlička M.
Compression of a Set of Strings
In: Proc. of 2007 Data Compression Conference (DCC 2007), IEEE Computer Society Press, 2007, pp. 390-390.
Presented at: DCC 2007 Data Compression Conference, 27.-29.3.2007, Snowbird, Utah,
USA.
Lánský Jan, Chernik K., Vlčková Z.
Comparison of Text Models for BWT
In: Proc. of 2007 Data Compression Conference (DCC 2007), IEEE Computer Society Press, 2007, pp. 389-389.
Presented at: DCC 2007 Data Compression Conference, 27.-29.3.2007, Snowbird, Utah,
USA.
Linková Zdeňka
Data Integration in VirGIS and in the Semantic Web
Technical Report: V-922, ICS AS CR, Prague, 2005, 11 p.
Integration has been an acknowledged data processing problem for a long time. However, there is no universal tool for general data integration. Because various data descriptions, data heterogeneity, and machine unreadability, it is not easy way. Improvement in this situation could bring the Semantic Web. Its idea is based on machine understandable web data, which bring us an opportunity of better automated processing. The SemanticWeb is still a future vision, but there are already some features we can use. The paper describes how is integration solved in mediation integration system VirGIS and discusses use of nowadays Semantic Web features to improve it. According to the proposed changes, a new ontology that covers data used in VirGIS is presented.
Linková Zdeňka
The Logic Summer School 2004
Technical Report: V-925, ICS AS CR, Prague, 2005, 10 p.
Abstract Logic is the foundational discipline of many sciences. Part mathematics, part philosophy and part computing science, logic remains a core intellectual study and is increasingly relevant to practical concerns. It spreads into planning, into program synthesis, into circuit design and into discourse analysis. It underpins the entire science of artiŻcial intelligence. In order to increase knowledge from the field of logic, I participated in the Logic Summer School. This report covers some information.
Linková Zdeňka
Integrace dat a sémantický web
In: Doktorandský den `04, MATFYZPRESS, 2004, pp. 66-74.
ISBN: 80-86732-30-4
Presented at: Institute of Computer Science Ph.D. Student`s Days 04, 29.09.-01.10.2004, Paseky nad Jizerou,
Czech Republic.
World Wide Web obsahuje data, která jsou pro počítačové programy nesrozumitelná. Následkem toho je na něm obtížné některé věci zautomatizovat. Nedostatky současného webu by měl odstranit sémantický web, ve kterém data budou mít přesně popsaný význam. Zlepšení může přinést také v oblasti integrace, která je v případě dat pocházejících z webu velmi obtížná. Tento článek se zabývá integrací webových dat. Zaměřuje se na relační data ve formátu XML a navrhuje postupy základních integračních operací.
Linková Zdeňka, Nedbal Radim, Řimnáč Martin
Building Ontologies for GIS
Technical Report: V-932, ICS AS CR, 2005, 9 p.
Knowledge representation in geographic information systems (GIS) and associated data processing presents many challenges for researchers. To use ontologies as knowledge representation belongs to the most topical problems to solve. This involves ontology development as well as ontology re-usage. The goal of the research described in this paper is to develop a specific ontology for a given GIS area.
Linková Zdeňka, Nedbal Radim
Building Ontologies for GIS - Part 2
Technical Report: V-938, ICS AS CR, 2005, 12 p.
Ontologies play an important role in knowledge representation. Among various fields, where ontologies can be useful, is the GIS data area. We consider data in a specific GIS domain and develop a new ontology. The result is described in this paper.
Linková Zdeňka, Nedbal Radim
VirGIS Data in Semantic Web Environment
In: Proceedings of SOFSEM 2006, Volume: II, ICS AS CR, Prague, 2006, pp. 120-127.
ISBN: 80-903298-4-5
Presented at: SOFSEM 2006: Theory and Practice of Computer Science, 21.1.-27.1.2006, Měřín,
Czech Republic.
A crucial point in automated data processing is the way in which the data are expressed. One possibility is to employ existing features of the Semantic Web - ontologies. Ontologies play an important role in a knowledge representation.
The aim of the research presented in this paper is to provide more automated VirGIS system. VirGIS is an integration system that works with GIS (Geographical Information Systems) data. As a first step of our research, we describe its data using common Semantic Web techniques and build a VirGIS ontology.
Linková Zdeňka
Data Integration in VirGIS and in the Semantic Web
In: Doktorandský den 05, (Ed. Hakl F.), MATFYZPRESS, Prague, 2005, pp. 87-93.
ISBN: 80-86732-56-8
Presented at: Institute of Computer Science Ph.D. Student`s Days 05, 5.10.-7.10.2005, Nový Dvůr,
Czech Republic.
Integration has been an acknowledged data processing problem for a long time. However, there is no universal tool for general data integration. Because various data descriptions, data heterogeneity, and machine unreadability, it is not easy way. Improvement in this situation could bring the Semantic Web. Its idea is based on machine understandable web data, which bring us an opportunity of better automated processing. The SemanticWeb is still a future vision, but there are already some features we can use. The paper describes how is integration solved in mediation integration system VirGIS and discusses use of nowadays Semantic Web features to improve it. According to the proposed changes, a new ontology that covers data used in VirGIS is presented.
Linková Zdeňka, Nedbal Radim
Building Ontology for VirGIS System
In: Proceedings of ITAT 2005, Information Technologies - Applications and Theory, (Ed. Vojtáš P.), Prírodovedecká fakulta Univerzity Pavla Jozefa Šafárika, Košice, 2005, pp. 233-242.
ISBN: 80-7097-609-8
Presented at: ITAT 2005, 20.9. - 25.9.2005, Račkova dolina,
Slovakia.
Ontologies play an important role in a knowledge representation. It involves ontology development as well as ontology re-use. Among various fields, where ontologies can be useful, is the GIS (Geographical Information System) data area. The goal of the research described in this paper is to develop a specific ontology for a given GIS domain. At first, we describe a general methodology and main tools for ontology development. Then a new ontology that covers data used in a VirGIS integration system is presented. The paper describes the VirGIS specified ontology as well as a list of spatio-temporal data ontologies that are available and possible to use for a general data features description.
Linková Zdeňka
Ontology-based Integration System
In: Doktorandský den 06, (Ed. F. Hakl), MATFYZPRESS, 2006, pp. 57-63.
ISBN: 80-86732-87-8
Presented at: Doktorandský den 06, 20.9.-22.9.2006, Monínec, Sedlec-Prčice,
Czech Republic.
Integration has been an acknowledged problem for a long time. With the aim at combining data from different sources, data integration usually provides a unified global view over these data. A crucial part of the task is the establishment of the connection between the global view and the local sources. Two basic approaches have been proposed for this purpose: Global As View (GAV) and Local As View (LAV).With the Semantic Web and its data description means, there is also another possibility - to employ ontologies for the relationship description in an integration system.
Linková Zdeňka
European Summer School in Information Retrieval ESSIR 2005
Technical Report: V-949, ICS AS CR, Prague, 2006, 8 p.
Information Retrieval (IR) as a process of searching relevant information is a significant discipline of a data processing field. European Summer School in Information Retrieval ESSIR provides students, academic and industrial researchers and developers a grounding in the core objects of IR (models, architectures, algorithms), as well as covering some current topics, e.g. information retrieval from the Web. We have participated its 5th year that was held at Dublin City University in Dublin, Ireland.
Linková Zdeňka
Integrace dat v prostředí Sémantického Webu
In: Sborník workshopu doktorandů FJFI oboru Matematické inženýrství, (Ed. P. Ambrož, Z. Masáková), 2006, pp. 89-98.
Presented at: Doktorandské dny 2006, 10.11.2006 and 24.11.2006, Prague,
Czech Republic.
Datová integrace je uznávaný problém v oblasti zpracování dat. Jejím cílem je obvykle
poskytnout ucelený pohled na několik datových zdrojů. V případě nematerializovaného řešení je
klíčové stanovení vazeb mezi poskytovaným virtuálním pohledem a daty uloženými ve zdrojích.
článek se zabývá řešením stanovení těchto vazeb. Svůj přístup zakládá na ontologiích.
Linková Zdeňka, Nedbal Radim
Ontology approach to integration of geographical data
In: WETDAP 2007, Proceedings of the 1st Workshop Evolutionary Techniques in Data-processing, In Conjunction with Znalosti (Knowledge) 2007, Faculty of Electrical Engineering and Computer Science, VŠB - Technical University of Ostrava, Ostrava, 2007, pp. 35-41.
Presented at: Workshop Evolutionary Techniques in Data-processing, Associated with ZNALOSTI 2007 conference
, 21.-23.2.2007, Ostrava,
Czech Republic.
A key point in modern automated data processing is metadata semantics representation. Employing Semantic Web existing features - ontologies - is a promising option. Ontologies open a novel approach to knowledge representation.
The paper presents a GIS (Geographic Information System) domain application illustrating ontological approach to data integration and data
processing automation in the specific system. This VirGIS system is an integration system that works with spatio-temporal data. We start our
study with developing the data representation based on common Semantic Web techniques and build a VirGIS ontology.
Linková Zdeňka
Ontology-Based Schema Integration
In: Proceedings of SOFSEM 2007, ICS AS CR, Prague, 2007, pp. 71-80.
Presented at: SOFSEM 2007, 20.2.-26.2.2007, Harrachov,
Czech Republic.
Data integration usually provides a unified global view over
several data sources. A crucial part of the task is the establishment of the
connection between the global view and the local sources. For this purpose, two basic mapping approaches have been proposed: GAV (Global
As View) and LAV (Local As View). On the Semantic Web, there can
be considered also an ontological approach.
In this paper, data integration is solved using ontologies of the sources. To
express relationships between the global view and local source schemas,
an ontology for the integration system is built. Thus, a schema integration task is transformed to an ontology merging task.
Linková Zdeňka
Integrace dat na sémantickém webu
In: Doktorandské dny '08, (Ed. F. Hakl), MATFYZPRESS, 2008, pp. 61-68.
ISBN: 978-80-7378-054-8
Presented at: Doktorandské dny 2008, 29.9.-1.10.2008, Jizerka,
Czech Republic.
V tomto příspěvku je popsán přístup k virtuální integraci dat využívající současných principů, metod a nástrojů sémantického webu.
Přístup pracuje s daty ve formátu RDF a předpokládá dostupnost ontologií, které je popisují.
Ontologie jsou základem pro všechny kroky prezentovaného integračního procesu. Jsou využity jak k určení vztahů mezi daty a poskytovaným integrovaným pohledem,
tak i k zápisu nalezených korespondencí. Ty jsou dále použity při zpracování dotazů kladených na integrovaná data.
Linková Zdeňka
Schema Matching in the SemanticWeb Environment
In: Doktorandský den 07, (Ed. F. Hakl), MATFYZPRESS, 2007, pp. 36-42.
Presented at: Doktorandské dny 2007, 17.-19.9.2007, Malá Úpa,
Czech Republic.
The paper deals with one step of non-materialized data integration - schema matching task. It works with data
sources on the Semantic Web; the crucial assumption for the considered task is available ontologies describing data
to integrate. Source ontologies are used to find correspondences between source schemas elements. For this, also
techniques known from ontology alignment and ontology merging field are used.
Linková Zdeňka
Mapování schémat v prostředí Sémantického webu
In: Doktorandské dny na KM FJFI 07, 2007, pp. 117-126.
ISBN: 978-80-01-03913-7
Článek se zabývá úlohami, které je třeba řešit při nematerializované
integraci dat. Zaměřuje se na hledání korespondencí mezi schématy a
mapování schémat. Návrh přístupu řešení těchto úloh na Sémantickém
webu těží z dostupných ontologiích popisujících integrované zdroje.
Ontologie jsou využity jak k hledání mapování, tak i při jejich
popisu.
Linková Zdeňka, Řimnáč Martin
Automatizovaný návrh pravidel pro integraci dat a sémantický web
In: Znalosti 2008, (Ed. V. Snášel), Vydavatelstvo STU, Bratislava, 2008.
Presented at: Znalosti 2008, 13.-15.2.2008, Bratislava,
Slovakia.
Článek se zabývá přístupem, jak se pokusit zautomatizovat mnohdy netriviální úlohu nalezení pravidel pro integraci dat. Předkládaný přístup automaticky generuje kandidáty pravidel včetně jejich
ohodnocení pomocí nepřímé míry definující jejich prioritu. Priorita může
následně být použita buďto návrhářem (člověkem) jako pomocný prvek
pro přípravu návrhu, nebo při automatickém návrhu integračního procesu zahrnující pravidla s maximální prioritou. Studie v příspěvku se
detailně věnuje dvěma základním typům pravidel, ekvivalenci a hierarchii, přičemž ohodnocení kandidátů je založeno na (strukturální) analýze
aktivních domén atributů. V neposlední řadě příspěvek ukazuje možnost
decentralizovaného přístupu k integraci dat, jenž je inspirován webovými
technologiemi.
Lokoč Jakub, Skopal Tomáš
On Reinsertions in M-tree
In: Proc. of 1st international workshop on Similarity Search and Applications, IEEE Computer Society Press, 2008, pp. 121-128.
ISBN: 0-7695-3101-6
Presented at: SISAP 2008: 1st international workshop on Similarity Search and Applications, 11.-12.4.2008, Cancun, Mexico.
Lokoč Jakub, Skopal Tomáš
NM-tree: Flexible Approximate Similarity Search in Metric and Non-metric Spaces
In: Proc. of 19th International Conference on Database and Expert Systems Applications, LNCS 5181, Springer-Verlag, Berlin, 2008, pp. 312-325.
ISBN: 978-3-540-85653-5
Presented at: DEXA 2008: 19th International Conference on Database and Expert Systems Applications, 1.-5.9.2008, Turin, Italy.
Lokoč Jakub, Skopal Tomáš
On Reinsertions in M-tree
In: 1st International Workshop on Similarity Search and Applications (SISAP 2008), IEEE, 2008.
(in_print)
Presented at: SISAP 2008 - Workshop at ICDE 2008, 11.-12.04.2008, Cancun,
Mexico.
In this paper we introduce a new M-tree building method, utilizing the classic idea of forced reinsertions. In case a leaf is about to split, some distant objects are removed from the leaf (reducing the covering radius), and then again inserted into the M-tree in a usual way. A regular leaf split is performed only after a series of unsuccessful reinsertion attempts. We expect the forced reinsertions will result in more compact Mtree hierarchies (i.e., more efficient query processing), while the index construction costs should be kept as low as possible. Considering both low construction costs and low querying costs, we examine several combinations of construction policies with reinsertions. The experiments show that forced reinsertions could significantly decrease the number of distance computations, thus speeding up indexing as well as querying.
Matousek T., Zavoral Filip
Extracting Zing Models from C Source Code
In: SOFSEM 2007, LNCS 4362, Springer, Berlin, 2007, pp. 900-910.
Presented at: SOFSEM 2007, 20.2.-26.2.2007, Harrachov,
Czech Republic.
In the paper, we propose an approach to an automatic extraction of verification models for the C language source code. We primarily focus on the representation of pointers and arrays, which make the extraction from the C language specific. We provide an implementation of the model extractor as a part of our broader effort to develop a verifier of Windows kernel drivers based on the Zing model checker. To demonstrate the feasibility of our approach, we give examples of the extraction results on a practical synchronization problem.
Mlýnková Irena
An Analysis of Approaches to XML Schema Inference
In: Proc. of 4th International Conference on Signal-Image Technology and Internet-Based Systems, IEEE Computer Society Press, 2008.
ISBN: 0-7695-3101-6
(in_print)
Presented at: SITIS 2008: 4th International Conference on Signal-Image Technology and Internet-Based Systems, 30.11.-3.12.2008, Bali, Indonesia.
Mlýnková Irena
Equivalence of XSD Constructs and its Exploitation in Similarity Evaluation
In: Proc. of 7th International Conference on Ontologies, DataBases, and Applications of Semantics, LNCS 5332, Springer-Verlag, Berlin, 2008, pp. 1252-1269.
ISBN: 978-3-540-85712-9
(in_print)
Presented at: ODBASE 2008: 7th International Conference on Ontologies, DataBases, and Applications of Semantics, 11.-13.11.2008, Monterrey, Mexico.
Mlýnková Irena
Similarity of XML Schema Definitions
In: Proc. of 8th ACM Symposium on Document Engineering, ACM Press, Berlin, 2008, pp. 187-190.
ISBN: 978-1-60558-081-4
Presented at: DocEng 2008: 8th ACM Symposium on Document Engineering, 16.-19.9.2008, Sao Paulo, Brazil.
Mlýnková Irena
Current Trends in Testing XMLMSs
In: Proc. of 17th International Conference on Information Systems Development, Springer Science + Business Media, Inc., Berlin, 2008.
ISBN: 978-3-540-85712-9
(in_print)
Presented at: ISD 2008: 17th International Conference on Information Systems Development, 25.-27.8.2008, Paphos, Cyprus.
Mlýnková Irena
XML Schema Inference: A Study
Technical Report: 2008/6, Dep. of Software Engineering, MFF, Charles University, Prague, 2008, 18 p.
Mlýnková Irena
UserMap - an Enhancing of User-Driven XML-to-Relational Mapping Strategies
Technical Report: 2007/3, Charles University, Prague, 2007, 38 p.
As XML has undoubtedly become a standard for data representation, it is inevitable to propose and implement techniques for
efficient managing of XML data. A natural alternative is to exploit features and functions of (object-)relational database systems, i.e. to rely
on their long theoretical and practical history. The main concern of such
techniques is the choice of an appropriate XML-to-relational mapping
strategy.
In this paper we focus on enhancing of user-driven techniques which
leave the mapping decisions in hands of users. We propose an algorithm
which exploits the user-given annotations more deeply searching the
user-specified "hints" in the rest of the schema and applies an adaptive
method on the remaining schema fragments. We describe the proposed
algorithm, the similarity measure designed for this purpose, sample implementation of key features of the proposal called UserMap, and results
of experimental testing on real XML data.
Mlýnková Irena, Pokorný Jaroslav
From XML Schema to Object-Relational Database – An XML Schema-Driven Mapping Algorithm
In: Proceedings of the IADIS International Conference WWW/Internet, (Ed. Isaias P., Karmakar N.), IADIS, 2004, pp. 115-122.
Presented at: IADIS International Conference WWW/Internet 2004, 06.-09. 10. 2004, Madrid,
Spain.
Since XML becomes a crucial format for representing information, it is necessary to establish techniques for managing XML documents. A possible solution can be found in storing XML data in (object-)relational databases. For this purpose most of the existing techniques often exploit an XML schema of the stored XML data, usually expressed in DTD. But the more complex today’s applications are, the more insufficient the DTD becomes and the necessity to use XML Schema language becomes more essential. The paper proposes an algorithm for mapping XML Schema structures to an object-relational database schema (defined by the SQL:1999 standard) using a (modified) DOM interface and an algorithm for storing the valid XML data into relations of the resulting schema. The main aim is to exploit object-oriented features XML Schema has and the advantages of object-relational databases and to preserve the structure as well as semantic constraints of the source schema in the target schema.
Mlýnková Irena, Pokorný Jaroslav
XML in the World of (Object-) Relational Database Systems
In: Information Systems Development Advances in Theory, Practice and Education, (Ed. Vasilecas O. et al.), Kluwer, 2004.
ISBN: 0-387-25026-3
Presented at: 13th International Conference on Information Systems Development, ISD`2004, 9.9.-11.9. 2004, Vilnius,
Lithuania.
Mlýnková Irena, Toman Kamil, Pokorný Jaroslav
Statistical Analysis of Real XML Data Collections
Technical Report: 2006/5, MFF UK, Prague, 2006, 39 p.
Recently XML has achieved the leading role among languages for data representation and thus we can witness a massive boom of corresponding techniques for managing XML data. Most of the processing techniques however suffer from various bottlenecks worsening their time and/or space efficiency.We assume that the main reason is they consider XML collections too globally, involving all their possible features, although real data are often much simpler. Even though some techniques do restrict the input data, the restrictions are often unnatural. In this paper we analyze existing XML data, their structure and real complexity in particular.We have gathered more than 20GB of real XML collections and implemented a robust automatic analyzer. The analysis considers existing papers on similar topics, trying to confirm or confute their observations as well as to bring new findings. It focuses on frequent but often ignored XML items (such as mixed content or recursion) and relationship between schemes and their instances.
Mlýnková Irena
XML Data in (Object-)Relational Databases
In: Diploma Thesis, Charles University, Prague, 2007, pp. 142.
Mlýnková Irena, Toman Kamil, Pokorný Jaroslav
Statistical Analysis of Real XML Data Collections
In: Proceeding of the 13th International Conference on Management of Data - COMAD 2006, (Ed. Lakshmanan, L.L., Roy, P., Tung, A.), Tata McGraw Hill Publ. Comp., Delhi, 2006, pp. 20-31.
Presented at: 13th International Conference on Management of Data - COMAD 2006, 14.12.-16.12.2006, Delhi,
India.
Mlýnková Irena
An XML-to-Relational User-driven Mapping Strategy Based on Similarity and Adaptivity
In: Proc. of SYRCoDIS `07 4th Spring Young Researchers Colloquium on Databases and Information Systems, Volume: 256, CEUR Woskhop Proc., 2007, pp. 9-20.
Presented at: SYRCoDIS`07, 31.5.-1.6.2007, Moscow,
Russia.
As XML has become a standard for data representation,
it is inevitable to propose and implement
techniques for efficient managing of XML
data. A natural alternative is to exploit features
and functions of (object-)relational database
systems, i.e. to rely on their long theoretical
and practical history. The main concern of
such techniques is the choice of an appropriate
XML-to-relational mapping strategy.
In this paper we focus on enhancing of userdriven
techniques which leave the mapping decisions
in hands of users. We propose an algorithm
which exploits the user-given annotations
more deeply searching the user-specified
“hints” in the rest of the schema and applies an
adaptive method on the remaining schema fragments.
We describe the algorithm theoretically,
discussing the key ideas of the approach, chosen
solutions, their reasons, and consequences.
Finally, we overview the open issues related to
implementation of the proposed algorithm and
its experimental testing on real XML data.
Mlýnková Irena, Pokorný Jaroslav
Similarity and XML Technologies
In: Proc. of IADIS International Conference WWW/Internet 2007, (Ed. Isaias P., Nunes M.B., Barroso J.), IADIS, 2007, pp. 277-287.
Presented at: WWW/Internet 2007, 5.-8.10.2007, Vila Real,
Portugal.
As XML technologies have undoubtedly become a standard for data representation, it is inevitable to provide efficient implementations of W3C recommendations. A possible optimization of particular types of techniques can be found in exploitation of similarity of XML data and/or matching of XML patterns. In this paper we provide an overview and classification of such techniques from various points of view. We briefly describe the best known representatives of particular ideas and we discuss their key advantages and disadvantages. The text should serve as a good starting point for proposing an appropriate similarity-based optimization.
Mlýnková Irena, Pokorný Jaroslav
Similarity of XML Schema Fragments Based on XML Data Statistics
In: Proc. of Innovations '07: Proceedings of the 4th International Conference on Innovations in Information Technology, IEEE Computer Society Press, 2007, pp. 243-247.
Presented at: 4th International Conference on Innovations in Information Technology, 18.-20.11.2007, Dubai,
United Arab Emirates.
As XML has become a standard for data representation, it can be found in plenty of information technologies. A possible optimization of XML-based approaches can be exploitation of similarity of XML data. In this paper we propose a technique for evaluating similarity of XML schema fragments focusing on two often omitted aspects - structural level of similarity and tuning of parameters of the similarity measure. In the former case we exploit the results of statistical analysis of real-world XML data. In the latter case we show that the tuning problem is a kind of constraints optimization problem and can be solved using corresponding approaches. We have analyzed (dis) advantages of two of them, genetic algorithms and simulated annealing, and in further experiments we show that appropriate tuning produces a more precise similarity measure.
Mlýnková Irena, Pokorný Jaroslav
UserMap - an Adaptive Enhancing of User-Driven XML-to-Relational Mapping Strategies
In: ADC '08: Proceedings of the 19th Australasian Database Conference, Volume: 75, Australia Computer Society, Wollongong, New South Wales, 2008, pp. 165-174.
Presented at: ADC '08: 19th Australasian Database Conference, 22.-25.01.2008, Wollongong, New South Wales,
Australia.
As the XML has become a standard for data representation, it is inevitable to propose and implement techniques for efficient managing of XML data. A natural alternative is to exploit features of (object-)relational database systems, i.e. to rely on their long theoretical and practical history. The main concern of such techniques is the choice of an appropriate XML-to-relational mapping strategy. In this paper we focus on enhancing of user-driven techniques which leave the mapping decisions in hands of users who specify their requirements using schema annotations. We describe our prototype implementation called UserMap which is able to exploit the annotations more deeply searching the user-specified “hints” in the rest of the schema and applies an adaptive method on the remaining schema fragments. Using a sample set of supported fixed mapping methods we discuss problems related to query evaluation for storage strategies generated by the system, in particular correction of the candidate set of annotations and related query translation. And finally, we describe the architecture of the whole system.
Mlýnková Irena
UserMap - an Exploitation of User-Specified XML-to-Relational Mapping Requirements and Related Problems
Technical Report: 2007/8, Charles University, Prague, 2007, 26 p.
As the XML has become a standard for data representation, it is inevitable
to propose and implement techniques for efficient managing of XML
data. A natural alternative is to exploit features of (object-)relational database systems,
i.e. to rely on their long theoretical and practical history. The main concern
of such techniques is the choice of an appropriate XML-to-relational mapping
strategy.
In this paper we focus on enhancing of user-driven techniques which leave the
mapping decisions in hands of users who specify their requirements using schema
annotations.We describe our prototype implementation called UserMap which is
able to exploit the annotations more deeply searching the user-specified “hints” in
the rest of the schema and applies an adaptive method on the remaining schema
fragments. Using a sample set of supported fixed mapping methods we discuss
problems related to query evaluation for storage strategies generated by the system,
in particular correction of the candidate set of annotations and related query
translation. And finally, we describe the architecture of the whole system.
Nečaský Martin, Pokorný Jaroslav
Conceptual Modeling of IS-A Hierarchies for XML
In: Proc. of 18th European Japanese Conference on Information Modelling and Knowledge Bases, EJC2008 Program Comitee and EJC2008 Program Coordination team, 2008, pp. 65-84.
ISBN: 978-3-540-85712-9
(in_print)
Presented at: EJC 2008: 18th European Japanese Conference on Information Modelling and Knowledge Bases, 2.-6.6.2008, Tsukuba, Japan.
Nečaský Martin
Conceptual Model Based Normalization of XML Views
In: Proc. of DATESO 2008, (Ed. J. Pokorný, V. Snášel, K. Richta), CEUR Workshop Proc., 2008, pp. 13-24.
Presented at: Dateso 2008: Annual International Workshop on DAtabases, TExts, Specifications and Objects, 16.4.-18.4.2008, Desná - Černá Říčka,
Czech Republic.
As the popularity of XML as a format for data representation grows the need for storing XML data in an effective way grows as well. Recent research has provide us with effeective solutions based on storing XML data into relational databases and with new technologies based on storing XML data in the native form. However, design of XML databases has not been studied su±ciently yet. In this paper, we suppose a set of XML schemes that describe XML representation of our data in several types of XML documents. We show that we can not usually store the data directly in this representation because it can contain redundancies. To design an optimal database schema we therefore need to locate these redundancies and eliminate them.We describe two types of redundancies in XML data in this paper and show how to utilize a conceptual schema of the XML schemes to locate such redundancies. We also show how to normalize the XML schemes to eliminate these redundancies.
Nečaský Martin
Conceptual modeling for XML
In: Diploma Thesis, Charles University, Prague, 2007, pp. 153 p..
Nečaský Martin
Conceptual Modeling for XML: A Survey
Technical Report: 2006-3, Dep. of Software Engineering, Faculty of Mathematics and Physics, Charles University, Prague, 2006, 54 p.
Recently XML is the standard format used for the exchange of data between information systems and is also frequently applied as a logical database model. If we use XML as a logical database model we need a conceptual model for the description of its semantics. However, XML as a logical database model has some special characteristics which makes existing conceptual models as E-R or UML unsuitable. In this paper, the current approaches to the conceptual modeling of XML data are described in an uniform style. A list of requirements for XML conceptual models is presented and described approaches are compared on the base of the requirements.
Nečaský Martin
Conceptual Modeling for XML: A Survey
In: Proceedings of the Dateso 2006, CEUR-WS, 2006, pp. 40-53.
Presented at: Dateso 2006 Annual International Workshop on DAtabases, TExts, Specifications and Objects, 26.4.-28.4.2006, Desná - Černá Říčka,
Czech Republic.
Recently XML is the standard format used for the exchange of data between information systems and is also frequently applied as a logical database model. If we use XML as a logical database model we need a conceptual model for the description of its semantics. However, XML as a logical database model has some special characteristics which makes existing conceptual models as E-R or UML unsuitable. In this paper, the current approaches to the conceptual modeling of XML data are described in an uniform style. A list of requirements for XML conceptual models is presented and described approaches are compared on the base of the requirements.
Nečaský Martin
XSEM – A Conceptual model for XML Data
In: Proceedings of Communications and Doctoral Consortium, 7th International Baltic Conference on Databases and Information Systems, Vilnius, 2006, pp. 328-331.
Recently XML is the standard format used for the exchange of data between information systems and is also frequently applied as a logical database model. If we use XML as a logical database model we need a conceptual model for the description of its semantics. In this paper, we describe our work on a new conceptual model for XML called XSEM created as a combination of several approaches applied in the area of conceptual modeling for XML.
Nečaský Martin
XSEM - A Conceptual Model for XML Data
In: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu, Ústav informatiky AV ČR, Prague, 2006, pp. 60-69.
ISBN: 80-903298-7-X
Presented at: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu - Seminář projektu programu Informační společnost, 5.10.-7.10.2006, Zadov,
Czech Republic.
In this paper we briefly describe a new conceptual model
for XML data called XSEM. The model is a combination of several approaches in the area of conceptual modeling of XML data. The model
divides the process of conceptual modeling of XML data to two levels.
On the first level, a designer designs an overall non-hierarchical conceptual schema of a domain. On the second level, he or she derives different
hierarchical representations of parts of the overall conceptual schema using transformation operators. These hierarchical representations describe
how the data is organized in an XML form.
Nečaský Martin
XSEM - A Conceptual Model for XML
In: Proceedings of the Fourth Asia-Pacific Conference on Conceptual Modelling (APCCM 2007) , (Ed. Roddick J. F., Annika H.), 2007, pp. 37-48.
Presented at: The Fourth Asia-Pacific Conference on Conceptual Modelling (APCCM 2007), 30.1.-2.2.2007, Ballarat, Victoria,
Australia.
We propose a new conceptual model for XML data
called XSEM as a combination of several approaches
in the area of the conceptual modeling for XML.
The model divides the conceptual modeling process of
XML data to two levels. On the first level, a designer
designs an overall non-hierarchical conceptual schema
of a domain. On the second level, he or she derives
different hierarchical representations of parts of the
overall conceptual schema using transformation op-
erators. These hierarchical representations describe
how the data is organized in an XML form.
Nečaský Martin
Using XSEM for Modeling XML Interfaces of Services in SOA
In: Proceedings of the Dateso 2007, CEUR Workshop Proc., 2007, pp. 35-46.
Presented at: Dateso 2007 Annual International Workshop on DAtabases, TExts, Specifications and Objects, 18.4.-20.4.2007, Desná - Černá Říčka,
Czech Republic.
In this paper we briefly describe a new conceptual model for
XML data called XSEM and how to use it for modeling XML interfaces
of services in service oriented architecture (SOA). The model is a
combination of several approaches in the area of conceptual modeling of
XML data. It divides the process of conceptual modeling of XML data to
two levels. The first level consists of designing an overall non-hierarchical
conceptual schema of the domain. The second level consists of deriving
different hierarchical representations of parts of the overall conceptual
schema using transformation operators. Each hierarchical representation
models an XML schema describing the structure of the data exchanged
between a service interface and external services.
Nečaský Martin, Pokorný Jaroslav
Extending E-R for Modelling XML Keys
In: Proc. of IEEE ICDIM 2007: Proc. of The Second International Conference on Digital Information Management, IEEE Computer Society, 2007, pp. 236-241.
Presented at: ICDIM 2007: The Second International Conference on Digital Information Management, 28.-31.10.2007, Lyon,
France.
With the growing popularity of XML there is a need not only to describe the structure of XML data but also its semantics. For the conceptual modelling of XML we can use existing conceptual models. However, special features of XML require extensions of these models. In this paper, we study conceptual modelling of XML keys. We extend the notion of E-R keys to be suitable for modelling the semantics of XML keys and we show how to express them on the XML logical level.
Nečaský Martin, Pokorný Jaroslav
Design and Management of Semantic Web Services using Conceptual Model
In: Proceedings of The 23rd Annual ACM Symposium on Applied Computing (SAC 2008), Volume: 3, Fortaleza, Ceará, 2008, pp. 2243-2247.
Presented at: 23rd Annual ACM Symposium on Applied Computing, 16.-20.3.2008, Fortaleza,
Brazil.
Nedbal Radim
Relational Databases with Ordered Relations
In: Logic Journal of the IGPL, Volume: 13, 2005, pp. 587-597.
Presented at: ERCIM 2004, 12.-17.07.2004, Vienna,
Austria.
The paper deals with expressing preferences in the framework of the relational data model. Preferences have usually a form of a partial ordering. Therefore the question arises how to provide the relational data model with such an ordering.
Nedbal Radim
Relational Databases with Ordered Relations
In: Doktorandský den `04, MATFYZPRESS, 2004, pp. 75-83.
ISBN: 80-86732-30-4
Presented at: Institute of Computer Science Ph.D. Student`s Days 04, 29.09.-01.10.2004, Paseky nad Jizerou,
Czech Republic.
This paper describes an option to express our preferences in the framework of relational databases. Preferences have usually a form of a partial ordering. Therefore the question is how to deliver the semantics of ordering to a database system. The answer is quite straightforward.
Nedbal Radim
General Relational Data Model with Preferences
In: Doktorandský den 06, (Ed. F. Hakl), MATFYZPRESS, 2006, pp. 78-84.
ISBN: 80-86732-87-8
Presented at: Doktorandský den 06, 20.9.-22.9.2006, Monínec, Sedlec-Prčice,
Czech Republic.
The aim of the paper is to present a novel, general approach to preference modelling in the framework
of the relational data model. To allow nonmonotonic operations, the preferences are defined between
sets of relational instances. The aim is the generalization of the relational algebra that is as minimal as
possible, in the sense that the formal fundamentlas of the relational data model are preserved. At the same
time, the extended model should be formal enough to provide a sound basis for the investigation of other
new preference constructors and operations and for new possible applications.
Nedbal Radim
Model of Preferences for the Relational Data Model
In: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu, Ústav informatiky AV ČR, Prague, 2006, pp. 70-77.
ISBN: 80-903298-7-X
Presented at: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu - Seminář projektu programu Informační společnost, 5.10.-7.10.2006, Zadov,
Czech Republic.
The aim of the paper is to present a novel, general approach
to preference modelling in the framework of the relational data model.
The preferences are defined between sets of relational instances, which
is a nontrivial generalization of the approach aiming at incorporating
ordered attribute domains into the relational data model. The main goals
are as follows: an effective representation of information representable by
a partial order, an intuitive preference construction and its processing
throughout the query execution plan, and a suitable data structure to
support it all.
Nedbal Radim
Model Preferences over the Relational Data Model
In: Sborník workshopu doktorandů FJFI oboru Matematické inženýrství, (Ed. P. Ambrož, Z. Masáková), 2006, pp. 119-129.
Presented at: Doktorandské dny 2006, 10.11.2006 and 24.11.2006, Prague, Czech Republic.
Nedbal Radim
User Preference and Optimization of Relational Queries
In: Doktorandské dny '08, (Ed. F. Hakl), MATFYZPRESS, 2008, pp. 82-87.
ISBN: 978-80-7378-054-8
Presented at: Doktorandské dny 2008, 29.9.-1.10.2008, Jizerka,
Czech Republic.
The notion of preference poses a new prospect of personalization of database queries. In addition, it can be exploited to optimize query execution.
Indeed, a novel optimization technique involving preference is developed, and its algorithm presented.
Nedbal Radim
Various Kinds of Preferences in Database Queries
In: Doktorandský den 07, (Ed. F. Hakl), MATFYZPRESS, 2007, pp. 49-59.
Presented at: Doktorandské dny 2007, 17.-19.9.2007, Malá Úpa,
Czech Republic.
The paper resumes recent advances in the
field of logic of preference and presents their
application in the field of database queries.
Namely, non-monotonic reasoning mechanisms
including various kinds of preferences are reviewed,
and a way of suiting them to practical
database applications is shown: reasoning including
sixteen strict and non-strict kinds of preferences,
inclusive of ceteris paribus preferences,
is feasible. However, to make the mechanisms
useful for practical applications, the assumption
of preference specification consistency
has to be relinquished. This is achieved in two
steps: firstly, all the kinds of preferences are de-
fined so that some uncertainty is inherent, and
secondly, not a notion of a total pre-order but a
partial pre-order is used in the semantics, which
enables to indicate some kind of conflict among
preferences. Most importantly, the semantics of
a set of preferences is related to that of a disjunctive
logic program.
Nedbal Radim
Algebraic Optimization of Database Queries with Preferences
In: Doktorandské dny na KM FJFI 07, 2007, pp. 157-167.
ISBN: 978-80-01-03913-7
The paper resumes a logical framework for formulating preferences and proposes
their embedding into relational algebra through a single preference operator parameterized by
a set of user preferences of sixteen various kinds, inclusive of ceteris paribus preferences, and
returning only the most preferred subsets of its argument relation. Most importantly, conflicting
set of preferences is permitted and preferences between sets of elements can be expressed.
Formal foundation for algebraic optimization, applying heuristics like push preference, also
is provided: abstract properties of the preference operator and a variety of algebraic laws
describing its interaction with other relational algebra operators are presented.
Nedbal Radim
Non-monotonic reasoning with Various Kinds of Preferences in the Relational data Model Framework
In: Proceedings of ITAT 2007, Information Technologies - Applications and Theory, (Ed. Vojtáš P.), PONT s.r.o., Seňa, 2007, pp. 15-20.
Presented at: Konferencia o informačných (inteligentných) technológiách - aplikácie a teória 2007, 21.-27.9.2007, Polana,
Slovakia.
The paper gives an overview of recent advances
in the field of logic of preference and discusses their applicability
in the frame of the relational data model. Namely,
non-monotonic reasoning mechanisms with various kinds
of preferences are reviewed in detail, and a way of suiting
them to practical database applications is presented.
These mechanisms enable to reason simultaneously about
sixteen strict and non-strict kinds of preferences, including
ceteris paribus preferences. To make the mechanisms
useful for practical applications, the assumption of preference
specification consistency has to be loosened. This is
achieved in two steps: firstly, all the preference specifications
are generalized to permit uncertainty, and secondly,
not a total pre-order on worlds but a partial pre-order on
worlds is used in the semantics, which enables to indicate
some kind of conflict among worlds by their incomparability.
Most importantly, the semantics of set of preferences
is related to that of a disjunctive logic program.
Nedbal Radim
Algebraic optimization of relational queries with various kinds of preferences
In: SOFSEM 2008: Theory and Practice of Computer Science, LNCS 4910, Springer, 2008, pp. 388-399.
Presented at: 34th International Conference on Current Trends in Theory and Practice of Computer Science, 19.-25.1.2008, Nový Smokovec, High Tatras,
Slovakia.
Neruda Roman, Krušina Pavel
A Framework for Modelling and Estimating Complexity in Multi-Agent Systems
In: Paralel and Distributed Computing and Systems, ACTA Press, 2004, pp. 602-607.
ISBN: 088986-423-3
Presented at: PDCS 2004 IASTED International Conference on Parallel and Distributed Computing Systems (16.), 09.-11.11.2004, Cambridge, MIT,
USA.
Multi-agent systems typically utilize a non-blocking asynchronous communication in order to achieve required flexibility and adaptability. High performance computing techniques exploit the current hardware ability of overlapping asynchronous communication with computation to load the available computer resources efficiently. On the contrary, widely used parallel processes modeling methodologies do not often allow for an asynchronous communication description. At the same time those models do not allow their user to select the granularity level and provide only a fixed set of machine and algorithm description quantities. In this work we addressed this issue and designed a new parallel processes modeling methodology. Its main features include an open set of atomic operations that are calculated and predicted for the algorithm in question, and the computer aided semi-automatic measuring of operation counts and approximation of cost functions. This allows not only for tuning the model granularity as well as accuracy according to user needs, but also to reach a such description complexity that would be very difficult to obtain without any computer aid.
Neruda Roman, Krušina Pavel, Kudová Petra, Rydvan Pavel, Beuster Gerd
Bang3: A Computational Multi-Agent System
In: Intelligent Agent Technology. Piscataway, Piscataway, IEEE, 2004, pp. 563-564.
ISBN: 0-7695-2101-0
Presented at: IEEE/WIC/ACM - Intelligent Agent Technology, 20.-24.09.2004, Peking,
China.
A multi-agent system targeted toward the area of computational intelligence modeling is presented. The purpose of the system is to allow both experiments and high-performance distributed computations employing hybrid computational models. The focus of the system is the interchangeability of computational components, their autonomous behavior, and emergence of new models.
Neruda Roman, Vaculín Roman
Concept nodes architecture within the Bang3 system
Technical Report: V-947, ICS AS CR, 2005
In this paper we present an architecture for decision making of software agents that allows the agent to behave autonomously. Our target area is computational agents — encapsulating various neural networks, genetic algorithms, and similar methods — that are expected to solve problems of different nature within an environment of a hybrid computational multi-agent system. The architecture is based on the vertically-layered and belief-desire-intention architectures. Several experiments with computational agents were conducted to demonstrate the benefits of the architecture.
Neruda Roman, Farský Miroslav, Neruda Martin
Mass and energy flows in consequences of company environmental acconting (abstract)
In: Environmental Accounting - Sustainable Development Indicators, International Conference EA-SDI 2005, Collection of Abstracts, (Ed. Ritschelová I.), Jan Evangelista Purkyně University and Charles University, Ústí nad Labem and Prague, 2005, pp. 51.
ISBN: 80-7044-674-9
Presented at: International Conference EA-SDI 2005, 26.9.-27.9.2005,
Czech Republic.
During the implementation of an environmental accounting system in a company, one of the most important pieces of information to obtain is a detailed understanding about material flows (raw materials, semi-finished products, final products and wastes) and flows of different types of energy inputs (buying, selling and wastage) when thinking about the consequences on the company. The authors, in the article: 1) study the question of the quantification of the flows, and the accuracy of their measurement, 2) provide an environmental accounting statement, with help of standards and indices, statistical trends analysis.
Neruda Roman, Krušina Pavel
Estimating and Measuring Performance of Computational Agents
In: Proceedings of the 2005 IEEE/WIC/ACM International Conference on Intelligent Agent technology IAT 2005, IEEE Computer Society Press, 2005, pp. 615-618.
ISBN: 0-7695-2416-8
Presented at: 2005 IEEE/WIC/ACM International Conference on Intelligent Agent technology IAT 2005, 19.9.-22.9.2005,
France.
We study and design multi-agent systems for computational intelligence modeling. Agents typically reside in a high-performance parallel environment, such as a cluster of computational nodes, and utilize a non-blocking asynchronous communication. The need of accurate predictions of run-time and other characterizations of complex parallel asynchronous processes bring us to design a new parallel model creation methodology. In this article our approach is briefly described and a test case is shown and discussed.
Neruda Roman
Cooperation of Computational Intelligence Agents
In: Collaborative Technologies and Systems, (Ed. Smari W.), IEEE Computer Society, Los Alamitos, 2006, pp. 256-263.
ISBN: 0-9785699-0-3
Presented at: International Sympoium on Collaborative Technologies and Systems, 14.5.-17.5.2006, Las Vegas,
US.
In this paper, a multi agent platform targeted toward the area of computational intelligence modeling is presented. We show the design of various computational agents creating multi agent systems, as well as the infrastructure capabilities. The focus of the system is the cooperation of agents, their interchangeability autonomous behavior, and emergence of new models. Two main areas of cooperation are presented: automated creation of a multi-agent system satisfying given constraints, and decision support for agent partner selection. It is demonstrated that such a system is able to assist in building hybrid artificial intelligence models based on data in a distributed environment.
Neruda Roman, Beuster Gerd
Description and Generation of Computational Agents
In: Proceedings of the First International Conference on Knowledge Science, Engineering and Management (KSEM`06), Lecture Notes in Artificial Intelligence 4092, 2006, pp. 318-329.
ISBN: 3-540-37033-8
Presented at: First International Conference on Knowledge Science, Engineering and Management (KSEM`06), 5.8.-8.8.2006, Guilin,
China.
Neruda Roman, Slušný Stanislav
Evolutionary Learning of Multi-layer Perceptron Neural Networks
In: Proceedings of ITAT 2006, Information Technologies - Applications and Theory, 2006, pp. 125-130.
ISBN: 80-969184-4-3
Presented at: ITAT 2006, 26.9.-1.10.2006, Chata Kosodrevina, Bystrá dolina, Nízke Tatry,
Slovakia.
Neruda Roman
Hybrid Methods of Computational Intelligence and Software Agents
In: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu, Ústav informatiky AV ČR, Prague, 2006, pp. 78-90.
ISBN: 80-903298-7-X
Presented at: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu - Seminář projektu programu Informační společnost, 5.10.-7.10.2006, Zadov,
Czech Republic.
In this paper we present an approach where a hybrid computational model is represented as a set of communicating agents composing
a multi-agent system. A general concept of representation of connected
groups of agents is introduced and utilized for automatic building of
schemes to solve a given computational task. We propose a combination
of an evolutionary algorithm and a formal logic resolution system which
is able to generate and verify new schemes. Furthermore, the adaptive cooperation support of individual computational agents is described, which
improves their efficiency in time. These features are implemented within
a software system and demonstrated on several examples.
Neruda Roman
Implementation of Ontology Mapping for Computational Agents
In: WSEAS Transactions on Computers Research, Volume: 1, 2006, pp. 58-63.
This paper describes ontological description of computational agents, their properties and abilities. The
goal of the work is to allow for autonomous behavior and semi-automatic composition of agents within a multiagent
system. The system has to be create foundation for the interchangeability of computational components, and
emergence of new models. This paper focuses on ways of representing agents and systems in standard formalisms,
such as description logics, OWL, and Jade.
Neruda Roman, Beuster Gerd
Towards Dynamic Generation of Computational Agents by Means of Logical Descriptions
In: International Workshop on Multi-Agent System Challenges for Ubiquitous and Pervasive Computing, UTBM/LST, Paris, 2007, pp. 17-28.
Presented at: MASUPC`07 International Workshop on Multi-Agent System Challenges for Ubiquitous and Pervasive Computing, 02.-04.05.2007, Paris,
France.
Neruda Roman
Hybrid Evolutionary Algorithm for Multilayer Perceptron Networks with Competetive Performance
In: Evolutionary Computation, IEEE, Los Alamitos, 2007, pp. 1620-1627.
Presented at: CEC 2007, Congress on Evolutionary Computation, 25.-28.09.2007, Singapore,
SG.
Neruda Roman, Beuster Gerd
Toward Dynamic Generation of Computational Agents by Means of Logical Descriptions
In: International Transactions on Systems Science and Applications, Volume: 4, No: 1, 2008.
A formalism for the logical description of computational
agents and multi-agent systems is given. It is explained
how it such a formal description can be used to configure
and reason about multi-agent systems realizing computational
intelligence models. A usage within a real software system
Bang 3 is demonstrated. A way to extend the system toward
dynamic environments with migrating agents is discussed.
Neruda Roman
Ontology-based and Evolutionary Search for Computational Agents Schemes
In: Proceedings of The Twentieth International Conference on Software Engineering and Knowledge Engineering SEKE 2008, Knowledge Systems Institute, Skokie, 2008.
Presented at: SEKE 2008, 1.-3.7.2008, Redwood City, California,
USA.
This work deals with a problem of automatic composition of
multi-agent system satisfying given constraints. A general concept
of representation of connected groups of agents (schemes)
within a multi-agent system is introduced and utilized for automatic
building of schemes to solve a given computational intelligence
task. We propose a combination of an evolutionary
algorithm and a formal logic resolution system which is able
to propose and verify new schemes. The approach is illustrated
on simple examples.
Nováček Vít, Smrž Pavel
Ontology Acquisition for Automatic Building of Scientific Portals
In: Proceedings of SOFSEM 2006: Theory and Practice of Computer Science, LNCS 3831, Springer-Verlag, Berlin, 2006, pp. 493-500.
ISBN: 3-540-31198-X
Presented at: SOFSEM 2006: Theory and Practice of Computer Science, 21.1.-27.1.2006, Měřín,
Czech Republic.
Ontologies are commonly considered as one of the essential parts of the Semantic Web vision, providing a theoretical basis and implementation framework for conceptual integration and information sharing among various domains. In this paper, we present the main principles of a new ontology acquisition framework applied for semi-automatic generation of scientific portals. Extracted ontological relations play a crucial role in the structuring of the information at the portal pages, automatic classification of the presented documents as well as for personalisation at the presentation level.
Nováček Vít
Motivations of Extensive Incorporation of Uncertainty in OLE Ontologies
In: Proceedings of SOFSEM 2006, Volume: II, ICS AS CR, Prague, 2006, pp. 145-154.
ISBN: 80-903298-4-5
Presented at: SOFSEM 2006: Theory and Practice of Computer Science, 21.1.-27.1.2006, Měřín,
Czech Republic.
Recently, the significance of uncertain information representation has become obvious in the Semantic Web community. This paper presents an ongoing research of uncertainty handling in automatically created ontologies. Proposal of a specific framework is provided. The research is related to OLE (Ontology LEarning), a project aimed at bottom-up generation a nd merging of domain specific ontologies. Formal systems that underlie the uncertai nty representation are briefly introduced. We will discuss a universal internal form at of uncertain conceptual structures in OLE then. The proposed format serves as a basis for inference tasks performed among an ontology. These topics are outlined as motivations of our future work.
Nováček Vít, Smrž Pavel
BOLE - A New Bio-Ontology Learning Platform
In: Proceedings of ECCB`05 Workshop, Workshop on Biomedical Ontologies and Text Processing, 2005.
Presented at: ECCB`05 Workshop, Workshop on Biomedical Ontologies and Text Processing, 28.9.2005, Madrid,
Spain.
This paper presents BOLE — a new platform for bottomup generation and merging of bio-ontologies. In contrast to other ontology-learning systems that are currently available, BOLE can be characterized by the modular architecture enabling integrating and comparing various methods of the automatic acquisition of semantic relations. We introduce the architecture of the tool and discuss the methodology of the employed synthetic bottom-up approach. OLITE — the central component responsible for the automatic acquisition of semantic relations from texts is described in detail. The presented preliminary results prove the efficiency of the implemented framework. We also provide a brief comparative overview of other relevant approaches and outline the future work on representation of uncertain knowledge for bio-ontology merging.
Nováček Vít, Smrž Pavel
OLE - A New Ontology Learning Platform
In: Proceedings of International Workshop on Text Mining Research, Practice and Opportunities, Incoma Ltd., 2005, pp. 12-16.
ISBN: 954-91743-1-X
Presented at: International Workshop on Text Mining Research, Practice and Opportunities, 24.9.2005, Borovets,
Bulgaria.
This paper presents OLE — a new platform for bottom-up generation and merging of ontologies. In contrast to other ontology-learning systems that are currently available, OLE can be characterized by the modular architecture enabling integrating and comparing various methods of the automatic acquisition of semantic relations. We introduce the architecture of the tool and discuss the methodology of the employed synthetic bottom-up approach. OLITE — the central component responsible for the automatic acquisition of semantic relations from texts is described in detail. The presented preliminary results prove the efficiency of the implemented framework. We also provide a brief comparative overview of other relevant approaches and outline the future work on representation of uncertain knowledge for ontology merging.
Nováček Vít, Smrž Pavel
Empirical Merging of Ontologies - A Proposal of Universal Uncertainty Representation Framework
In: The Semantic Web: Research and Applications - Proceedings of ESWC`06 - 3rd European Semantic Web Conference, LNCS 4011, Springer-Verlag, Berlin, 2006, pp. 65-79.
ISBN: 3-540-34544-2
Presented at: ESWC`06 - 3rd European Semantic Web Conference, 11.6.-14.6.2006, Budva,
Montenegro.
The significance of uncertainty representation has become obvious in the Semantic Web community recently. This paper presents our research on uncertainty handling in automatically created ontologies. A new framework for uncertain information processing is proposed. The research is related to OLE (Ontology LEarning) - a project aimed at bottom-up generation and merging of domain-specific ontologies. Formal systems that underlie the uncertainty representation are briefly introduced. We discuss the universal internal format of uncertain conceptual structures in OLE then and offer a utilisation example then. The proposed format serves as a basis for empirical improvement of initial knowledge acquisition methods as well as for general explicit inference tasks.
Nováček Vít, Smrž Pavel, Pomikálek Jan
Text Mining for Semantic Relations as a Support Base of a Scientific Portal Generator
In: Proceedings of LREC 2006 - 5th International Conference on Language Resources and Evaluation, ELRA, Paris, 2006, pp. 1338-1343.
ISBN: 2-9517408-2-4
Presented at: LREC 2006 - 5th International Conference on Language Resources and Evaluation, 24.5.-26.5.2006, Genoa,
Italy.
Current Semantic Web implementation efforts pose a number of challenges. One of the big ones among them is development and evolution of specific resources—the ontologies—as a base for representation of the meaning of the web. This paper deals with the automatic acquisition of semantic relations from the text of scientific publications (journal articles, conference papers, project descriptions, etc.). We also describe the process of building of corresponding ontological resources and their application for semi–automatic generation of scientific portals. Extracted relations and ontologies are crucial for the structuring of the information at the portal pages, automatic classification of the presented documents as well as for personalisation at the presentation level. Besides a general description of the portal generating system, we give also a detailed overview of extraction of semantic relations in the form of a domain–specific ontology. The overview consists of presentation of an architecture of the ontology extraction system, description of methods used for mining of semantic relations and analysis of selected results and examples.
Nováček Vít
Ontology Learning
In: Diploma Thesis, Faculty of Informatics, Masaryk University, Brno, 2006, pp. 1-65.
Ontology learning is one of the essential topics in the scope of an important area of current computer science and artificial intelligence - the upcoming Semantic Web. As the Semantic Web idea comprises semantically annotated descendant of the current world wide web and related tools and resources, the need of vast and reliable knowledge repositories is obvious. Ontologies present well defined, straightforward and standardised form of these repositories. There are many possible utilisations of ontologies - from automatic annotation of web resources to domain representation and reasoning tasks. However, the ontology creation process is very expensive, time-consuming and unobjective when performed manually. So a framework for automatic acquisition of ontologies would be very advantageous. In this work we present such a framework called OLE (an acronym for Ontology LEarning) and current results of its application. The main relevant topics, state of the art methods and techniques related to ontology acquisition are discussed as a part of theoretical background for the presentation of the OLE framework and respective results. Moreover, we describe also preliminary results of progressive research in the area of uncertain fuzzy ontology representation that will provide us with natural and reasonable instruments for dealing with inconsistencies in empiric data as well as for reasoning. Main future milestones of the ongoing research are debated as well.
Nováček Vít
Ontology Acquisition Supported by Imprecise Conceptual Refinement - New Results and Reasoning Perspectives
In: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu, Ústav informatiky AV ČR, Prague, 2006, pp. 91-101.
ISBN: 80-903298-7-X
Presented at: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu - Seminář projektu programu Informační společnost, 5.10.-7.10.2006, Zadov,
Czech Republic.
The significance of uncertainty representation has become
obvious in the Semantic Web community recently. This paper presents
new results of our research on uncertainty handling in ontologies created
automatically by means of Human Language Technologies. The research
is related to OLE (Ontology LEarning) a project aimed at bottom-up generation and merging of domain-specific ontologies. It utilises a
proposal of expressive fuzzy knowledge representation framework called
ANUIC. We discuss current achievements in taxonomy acquisition and
outline some interesting applications of the framework regarding non-traditional reasoning perspectives.
Nováček Vít
Imprecise Empirical Ontology Refinement: Application to Taxonomy Acquisition
In: Proceedings of ICEIS 2007, Kluwer Academic Publishing, Artificial Intelligence and Decision Support Systems, London, 2007, pp. 8.
(in_print)
Enterprise Information Systems (ICEIS 2007, revised selected papers), Springer, 2007, pp. 8.
(in_print)
Presented at: ICEIS 2007, 12.-16.6.2007, Funchal,
Madeira - Portugal.
Nováček Vít, Laera Loredana, Handschuh Siegfried
Dynamic Integration of Medical Ontologies in Large Scale
In: Proceedings of WWW2007/HCLSDI, ACM Press, New York, 2007, pp. 10.
(in_print)
Nováček Vít, Laera Loredana, Handschuh Siegfried
Aiding the Data Integration in Medicinal Settings by Means of Semantic Technologies
In: Making Semantics Work for Business, Semantic Technology Institutes International Workshop at European Semantic Technology Conference, Vienna, Austria, 2007.
(in_print)
Nováček Vít
A Non-traditional Inference Paradigm for Learned Ontologies
In: Proceedings of ESWC 2007 PhD Symposium, CEUR Workshop proceedings Workshop at ESWC 2007, Innsbruck, 2007.
Nováček Vít, Laera Loredana, Handschuh Siegfried
Semi-automatic Integration of Learned Ontologies into a Collaborative Framework
In: Proceedings of IWOD/ESWC 2007, Springer Verlag, Innsbruck, 2007, pp. 14.
(in_print)
Nováček Vít, Dabrowski Maciej, Kruk Sebastian R.
Extending Community Ontology Using Automatically Generated Suggestions
In: Proceedings of FLAIRS 2007, AAAI Press, Menlo Park, CA, 2007, pp. 6.
(in_print)
Nováček Vít, Handschuh Siegfried, Laera Loredana, Maynard Diana, Voelkel Max
Dynamic Ontology Lifecycle Scenario in Translational Medicine
In: Proceedings of the 5th European Conference of Computational Biology (ECCB 2006) - Book of Abstracts, Oxford University Press, Oxford, 2007, pp. 5.
(in print)
Nováček Vít
Automatic Knowledge Acquisition and Integration Technique: Application to Large Scale Taxonomy Extraction and Document Annotation
In: Proceedings of ICEIS 2007, Kluwer Academic Publishing, Artificial Intelligence and Decision Support Systems, London, 2008, pp. 160-172.
Enterprise Information Systems (ICEIS 2007, revised selected papers), Springer, 2008, pp. 160-172.
Presented at: ICEIS 2007, 12.-16.6.2007, Funchal,
Madeira - Portugal.
Novák David, Zezula Pavel
LOBS: Load Balancing for Similarity Peer-to-Peer Structures
Technical Report: FIMU-RS-2007-04, Faculty of Informatics, Masaryk University, Brno, 2007, 22 p.
Novák David, Zezula Pavel
Indexing the Distance Using Chord: A Distributed Similarity Search Structure
Presented at: 8th International DELOS Workshop on Future Digital Library Management Systems, 29.3.-1.4.2005, Schloss Dagstuhl,
Germany.
The need of search mechanisms based on data content rather then attributes values has recently lead to formation of the metric-based similarity retrieval. The computational complexity of such retrieval and the large volume of processed data call for distributed processing. In this paper, we propose chiDistance, a distributed data structure for similarity search in metric spaces. The structure is based on the idea of a vectorbased index method iDistance which enables to transform the issue of similarity search into the one-dimensional range search problem. A Peerto-Peer system based on the Chord protocol is created to distribute the storage space and to parallelize the execution of similarity queries. In the experiments conducted on our prototype implementation we study the system performance concentrating on several aspects of parallelism of the range search algorithm.
Novák David, Zezula Pavel
M-Chord: A Scalable Distributed Similarity Search Structure
In: InfoScale '06: Proceedings of the 1st international conference on Scalable information systems, ACM Press, New York, NY, USA, 2006, pp. 1-10.
ISBN: 1-59593-428-6
The need for a retrieval based not on the attribute values but on the very data content has recently led to rise of
the metric-based similarity search. The computational complexity
of such a retrieval and large volumes of processed
data call for distributed processing which allows to achieve
scalability. In this paper, we propose M-Chord, a distributed
data structure for metric-based similarity search.
The structure takes advantage of the idea of a vector index
method iDistance in order to transform the issue of similarity
searching into the problem of interval search in one
dimension. The proposed peer-to-peer organization, based
on the Chord protocol, distributes the storage space and
parallelizes the execution of similarity queries. Promising
features of the structure are validated by experiments on the
prototype implementation and two real-life datasets.
Novák David
Image Similarity Search: Theory and Practice
In: Third Doctoral Workshop on Mathematical and Engineering Methods in Computer Science MEMICS 2007, Masaryk University and Technical University of Brno, Brno, 2007, pp. 154-160.
Presented at: MEMICS 2007, 26.10.-28.10.2007, Znojmo,
Czech Republic.
Novák David, Zezula Pavel
LOBS: Load Balancing for Similarity Peer-to-Peer Structures
In: Databases Information Systems and Peer-to-Peer Computing 2007, Springer Verlag, Berlin Heidelberg New York, 2007, pp. 1-8.
Presented at: DBISP2P 2007, 24.9.2007, Vienna,
Austria.
Novák David, Batko Michal, Dohnal Vlastislav, Zezula Pavel
Scaling up the Image Content-based Retrieval
In: Second DELOS Conference 2007 - Working Notes, DELOS Network of Excellence, Pisa, Italy, 2007, pp. 1-10.
Presented at: DELOS Conference 2007, 13-14.2.2007, Pisa,
Italy.
Novák David, Batko Michal, Zezula Pavel
Content-based Image Retrieval on the Web
In: Proceedings of the Poster and Demonstration Paper Track of the 1st Future Internet Symposium (FIS 2008), CEUR Workshop Proceedings, Vienna, 2008, pp. 1-3.
Novák David, Batko Michal, Zezula Pavel
Web-scale System for Image Similarity Search: When the Dreams Are Coming True
In: Proceedings of the Sixth International Workshop on Content-Based Multimedia Indexing (CBMI 2008), IEEE, London, 2008, pp. 446-453.
Obdržálek David
Usage of real-world robotics in Semantic Web
In: Proc. of 5th International Conference on Innovations in Information Technology, IEEE Computer Society Press, 2008.
(in_print)
Presented at: Innovations 2008: 5th International Conference on Innovations in Information Technology, 16.-18.12.2008, Al Ain, United Arab Emirates.
Obdržálek David
Daly by se použít robotické metody i v sémantickém webu?
In: Proc. of Information Technologies - Application and Theory, (Ed. P. Vojtáš), PONT, Seňa, 2008, pp. 87-90.
ISBN: 978-80-969184-9-5
Presented at: Konferencia o informačných (inteligentných) technológiách - aplikácie a teória 2008, 22.-26.9.2008, High Tatras,
Slovakia.
Obdržálek David, Kulhánek Jiří
Generating and handling of differential data in DataPile-oriented systems
In: Proceedings of the IASTED International Conference on Databases and Applications (DBA 2006), (Ed. Hamza M. H.), 2006.
ISBN: 0-88986-560-4
Presented at: IASTED International Conference on Databases and Applications (DBA 2006) as part of the 24th IASTED International Multi-Conference on Applied Informatics, 13.2.-15.2.2006, Innsbruck,
Austria.
Basics of the DataPile structure for data handling systems have been theoretically designed and published. During implementation of such system, numerous problems which were not addressed during the theoretical design phase arose. In a real production environment, the applications connected to the DataPile core need special treatment and set important requirements on the data synchronization process. This article concerns with generating of differential data being distributed from the central DataPile storage to individual applications. It is shown that the synchronization part of DataPile-structured system can be implemented and run efficiently despite of the restrictions or limitations these individual applications impose.
Obdržálek David, Kulhánek Jiří
Statická typová kontrola XSLT programů
In: Proceedings of ITAT 2005, Information Technologies - Applications and Theory, (Ed. Vojtáš P.), Prírodovedecká fakulta Univerzity Pavla Jozefa Šafárika, Košice, 2005, pp. 393-401.
ISBN: 80-7097-609-8
Presented at: ITAT 2005, 20.9. - 25.9.2005, Račkova dolina,
Slovakia.
Obdržálek David, Benda J.
GFE - Graphical Finite State Machine Editor for Parallel Execution
In: ICEC 2007, (Ed. Ma L., Nakatsu R., Rauterberg M.), LNCS 4740, Springer, IFIP, 2007, pp. 401-406.
Presented at: ICEC 2007 - International Conference on Entertainment Computing, 20.-23.06.2005, Shanghai,
China.
Ondrejička Matúš, Pokorný Jaroslav
Extending Fagin's algorithm for more users based on multidimensional B-tree
In: Proc. of 12th Advances in Databases and Information Systems, LNCS 5207, Springer-Verlag, Berlin, 2008, pp. 199-214.
ISBN: 978-3-540-85712-9
Presented at: ADBIS 2008: 12th Advances in Databases and Information Systems, 5.-9.9.2008, Pori, Finland.
Pala Karel
The Balkanet Experience
In: Proceedings of the GLDV (German Linguistische Daten Vorarbeitung) Conference, Bonn, 2005.
Presented at: GLDV (German Linguistische Daten Vorarbeitung) Conference, 30.3.-1.4.2005, Bonn,
Germany.
This paper describes the exhaustive results obtained within IST 290388 Project Balkanet, which went on 2001-2004. The attention is paid to the restructuring and final shaping the individual Balkan WordNets. In comparison with the EuroWordNet Project some new results have been obtained: The sets of Base Concepts have been extended and a set of the Balkanet
1. Common Synsets has been introduced (8,000 synsets). These were relinked to Princeton WordNet 2.0 (PWN) and converted to XML standard format,
2. The language specific synsets that do not have translation equivalents in PWN 2.0 have been established for Balkanet languages,
3. Valency frames have been developed for Czech, Bulgarian and Romanian,
4. Domains have been added to Balkanet WordNets and implemented in the VisDic browser,
5. Integrating derivational relations into Czech WordNet and adding semantic relations into Turkish WordNet exploiting Turkish derivational morphology,
6. Links to the SUMO/MILO Ontology were added to and implemented in VisDic.
Pala Karel
Word Sketches and Semantic Roles
In: Proceedings of Corpus Linguistic Conference 2000, Saint-Petersburg State University, 2006.
ISBN: 5-288-04181-4
Pala Karel, Horák Aleš, Rambousek Adam, Vetulani Zygmunt, Konieczka Paweł, Marciniak Jacek, Obrębski Tomasz, Rzepecki Przemysław, Walkowska Justyna
DEB Platform tools for effective development of WordNets in application to PolNet
In: Proceedings of 3rd Language & Technology Conference, Fundacja Uniwersytetu im. A. Mickiewicza, Poznań, 2007, pp. 514-518.
Presented at: LTC`07, 5.-7.2007, Poznań,
Poland.
Petricek V., Escher T., Cox I. J., Margetts H.
The Web Structure of E-Government - Developing a Methodology for Quantitative Evaluation
In: Proceedings of the 15th International Conference on World Wide Web WWW 2006, ACM Press, New York, 2006, pp. 669-678.
Presented at: International Conference on World Wide Web WWW 2006, 23.12.-26.12.2006, Edinburgh,
UK.
Petrů Lukáš, Wiedermann Jiří
A Model of an Amorphous Computer and its Communication Protocol
Technical Report: V-970, ICS AS CR, Prague, 2006, 8 p.
We design a formal model of an amorphous computer suitable for theoretical investigation of its computational properties. The model consists of a finite set of nodes created by RAMs with restricted memory,
which are dispersed uniformly in a given area. Within a limited radius the nodes can communicate with their
neighbors via a single-channel radio. The assumptions on low-level communication abilities are among the
weakest possible: the nodes work asynchronously, there is no broadcasting collision detection mechanism
and no network addresses. For the underlying network we design a randomized communication protocol
and analyze its efficiency. The subsequent experiments and combinatorial analysis of random networks
show that the expectations under which our protocol was designed are met by the vast majority of the
instances of our amorphous computer model.
Petrů Lukáš, Wiedermann Jiří
A Model of an Amorphous Computer and its Communication Protocol
In: SOFSEM 2007, LNCS 4362, Springer, Berlin, 2007.
Presented at: SOFSEM 2007, 20.2.-26.2.2007, Harrachov,
Czech Republic.
We design a formal model of an amorphous computer suit-
able for theoretical investigation of its computational properties. The
model consists of a ¯nite set of nodes created by RAMs with restricted
memory, which are dispersed uniformly in a given area. Within a limited
radius the nodes can communicate with their neighbors via a single-
channel radio. The assumptions on low-level communication abilities are
among the weakest possible: the nodes work asynchronously, there is no
broadcasting collision detection mechanism and no network addresses.
For the underlying network we design a randomized communication pro-
tocol and analyze its e±ciency. The subsequent experiments and combi-
natorial analysis of random networks show that the expectations under
which our protocol was designed are met by the vast majority of the
instances of our amorphous computer model.
Petrů Lukáš, Wiedermann Jiří
On the universal computing power of amorphous computing systems
In: Theory of Computing Systems, Springer, 2008.
(in_print)
Amorphous computing differs from the classical ideas about
computations almost in every aspect. The architecture of amorphous
computers is random, since they consist of a plethora of identical computational units spread randomly over a given area. Within a limited radius
the units can communicate wirelessly with their neighbors via a single-channel radio.We consider a model whose assumptions on the underlying
computing and communication abilities are among the weakest possible:
all computational units are finite state probabilistic automata working
asynchronously, there is no broadcasting collision detection mechanism
and no network addresses. We show that under reasonable probabilistic
assumptions such amorphous computing systems can possess universal
computing power with a high probability. The underlying theory makes
use of properties of random graphs and that of probabilistic analysis of
algorithms. To the best of our knowledge this is the first result showing
the universality of such computing systems.
Podzimek Michal, Dokulil Jiří, Yaghob Jakub, Zavoral Filip
Mám hlad: pomůže mi Sémantický web?
In: Proc. of Information Technologies - Application and Theory, (Ed. P. Vojtáš), PONT, Seňa, 2008, pp. 91-94.
ISBN: 978-80-969184-9-5
Presented at: Konferencia o informačných (inteligentných) technológiách - aplikácie a teória 2008, 22.-26.9.2008, High Tatras,
Slovakia.
Pokorný Jaroslav, Vávra Jan, Snášel Václav
A Renewed Matrix Model for XML Data
In: Proc. of 8th International Conference on Intelligent Systems Design and Applications, IEEE Computer Society, 2008.
(in_print)
Presented at: ISDA 2008: 8th International Conference on Intelligent Systems Design and Applications, 25.-28.11.2008, Kaohsiung, Taiwan.
Pokorný Jaroslav, Richta Karel, Valenta Michal
Cellstore: Educational and Experimental XML-Native DBMS
In: The Inter-Networked World: ISD Theory, Practice, and Education, (Ed. Barry C., Lang M., Wojtkowski W., Wojtkowski G., Wrycza S., Zupancic J.), Springer-Verlag, New York, 2008.
ISBN: 978-0387304038
Pokorný Jaroslav
Digitální knihovny v prostředí Sémantického webu
In: Sborník z 10. ročníku semináře AKP 2005 (automatizace knihovnických procesů - 10.), (Ed. D. Tkačíková, B. Ramajzlová), VIC ČVUT, 2005, pp. 64-73.
Presented at: AKP 2005 (Automatizace knihovnických procesů) 10. ročník semináře, 3.5.-4.5.2005, Liberec,
Czech Republic.
Digitální knihovny (DK) přispívají k rozvoji Sémantického webu a současně mohou využívat jeho technologické prvky. Lze tak docílit kvalitnějšího řízení dat v DK a snazší integrace více DK, jakož i a zvýšení možnosti interakce s dalšími informačními zdroji. Ideou, stojící za Sémantickým webem, je rozšířit webové stránky značkováním, které podchytí alespoň část významu obsahu stránky. Toto sémantické značkování znamená přidání jistých metadat, která poskytují formální sémantiku obsahu webu. Projekty Sémantického webu vycházejí z technologií, které jsou vyvíjeny jako standardy. Patří sem jazyky XML, XML Schema, RDF a RDF Schema. Tyto jazyky slouží pro zápis metadat, z nichž část se organizuje v ontologiích. Další úroveň Sémantického webu využívá jazyky logiky. Základ zpracování v takto pojatém webu poskytují programy - softwaroví agenti. Cílem článku je uvést do technologií Sémantického webu a ukázat jejich uplatnění při vytváření DK
Pokorný Jaroslav
Směrem k Sémantickému Webu
In: Sborník příspěvků 20. ročníku konference Moderní databáze, KOMIX, Roudnice nad Labem, 2005, pp. 15-24.
Presented at: 20. ročník konference Moderní databáze, 26.5.-27.5.2005, Hotel Amber, Roudnice nad Labem,
Czech Republic.
Současné webové vyhledavače založené na technikách vyhledávání informací v textech nejsou schopny využít sémantické znalosti uvnitř webové stránky a tedy nemohou dát uspokojující odpovědi na uživatelské dotazy. Možným řešením se zdá být tzv. Sémantický web, který koncem 90. let popsal ve své vizi Tim Berners-Lee. Ideou, stojící za Sémantickým webem, je rozšířit webové stránky značkováním, které podchytí alespoň část významu obsahu stránky. Toto sémantické značkování znamená přidání jistých metadat, která poskytují formální sémantiku obsahu webu. Projekty Sémantického webu vycházejí z několika technologií, z nichž ty základní jsou již standardizovány nebo alespoň doporučovány. Patří sem jazyky XML, XML Schema, RDF a RDF Schema. Tyto jazyky slouží pro zápis metadat, z nichž některá se organizují v tzv. ontologiích. Další úroveň Sémantického webu využívá jazyky logiky. Základ zpracování v takto pojatém webu poskytují softwaroví agenti, tj. programy, které pracují autonomně a proaktivně. Cílem článku je uvést do technologií podporujících vytváření Sémantického webu, ukázat jeho architekturu a zmínit některé již rozpracované projekty směřující k vytváření inteligentních webových informačních služeb, personalizovaných webových míst a sémanticky zesílených vyhledávacích strojů.
Pokorný Jaroslav, Smižanský J.
Page Content Rank: an Approach to the Web Content Mining
In: Proceedings of IADIS International Conference Applied Computing, Volume: 1, IADIS Press, 2005, pp. 289-296.
ISBN: 3-540-31198-X
Presented at: IADIS International Conference Applied Computing, 22.2.-25.2. 2005, Algavre,
Portugal.
Methods of web data mining can be divided into several categories according to a kind of mined information and goals that particular categories set: Web structure mining (WSM), Web usage mining (WUM), and Web Content Mining (WCM). The objective of this paper is to propose a new WCM method of a page relevance ranking based on the page content exploration. The method, we call it Page Content Rank (PCR) in the paper, combines a number of heuristics that seem to be important for analysing the content of Web pages. The page importance is determined on the base of the importance of terms which the page contains. The importance of a term is specified with respect to a given query q and it is based on its statistical and linguistic features. As a source set of pages for mining we use a set of pages responded by a search engine to the query q. PCR uses a neural network as its inner classification structure. We describe an implementation of the proposed method and a comparison of its results with the other existing classification system – PageRank algorithm.
Pokorný Jaroslav
Database architectures: current trends and their relationships to environmental data management
In: Proceedings of the 19th Conference EnviroInfo, Masaryk University, Brno, 2005, pp. 24-28.
Presented at: 19th Conference EnviroInfo (Informatics for Environmental Protection, Networking Environmental Information), 7.9.-9.9.2005, Brno,
Czech Republic.
Ever increasing environmental demands from customers, authorities and governmental organizations as well as new business control functions are integrated to environmental management systems (EMSs). With a production of huge data sets and their processing in real-time applications, the needs for environmental data management have grown significantly. Current trends in database development and an associated research meet these challenges. The paper discusses recent advances in database technologies and attempts to highlight them with respect to requirements of EMSs.
Pokorný Jaroslav, Reschke J.
Exporting relational data into a native XML store
Pokorný Jaroslav, Reschke J.
Exporting relational data into a native XML store
In: Advances in Information Systems Development - Bridging the Gap between Academia and Industry, (Ed. A.G. Nilsson et al), Volume: 2, Springer Verlag, 2006, pp. 807-818.
ISBN: 0-387-30834-2
Pokorný Jaroslav
Databázové architektury: současné trendy a jejich vztah k novým požadavkům praxe
In: Sborník příspěvků 20. ročníku konference Moderní databáze, KOMIX, 2006, pp. 5-14.
ISBN: 80-239-7109-3
Presented at: Moderní databáze, 30.5.-31.5.2006, Zvánovice, Czech Republic.
Pokorný Jaroslav
Database architectures: current trends and their relationships to environmental data management
In: Environmental Modelling & Software, Volume: 21, No: 11, Elsevier Science, 2006, pp. 1579-1586.
Pokorný Jaroslav
Database Architectures: Current Trends and Their Relationships to Requirements of Practice
In: Proceedings of Information Systems Development ’06 Conference, Budapest, 2006.
Presented at: ISD’ 06 Conference, 31.8.-2.9.2006, Budapest,
Hungary.
Pokorný Jaroslav
Zpracování proudů dat
In: Proceedings of the Annual Database Conference DATAKON 2006, Masaryk University, Brno, 2006, pp. 61-76.
Presented at: DATAKON 2006, 20.10.-23.10.2006, Brno,
Czech Republic.
Pomikálek Jan, Řehůřek R.
The Influence of Preprocessing Parameters on Text Categorization
In: International Conference on Computer, Information and Systems Science and Engineering, Springer, 2007.
(in_print)
Presented at: XIX International Conference on Computer, Information and Systems Science and Engineering, 29.1.-31.1.2007, Bangkok,
Thailand.
Pomikálek Jan, Rychlý Pavel
Detecting Co-Derivative Documents in Large Text Collections
In: Proceedings of the Sixth International Language Resources and Evaluation (LREC'08), European Language Resources Association (ELRA), 2008, pp. 132-135.
Presented at: International Conference on Language Resources and Evaluation, , Marrakech, Morocco.
Rychlý Pavel, Smrž Pavel
Manatee, Bonito and Word Sketches for Czech
In: Proceedings of the Second International Conference on Corpus Linguistics (Corpora-2004), Saint-Petersburg State University Press, Saint-Petersburg, Russia, 2004, pp. 124-132.
ISBN: 5-288-03531-8
Presented at: Second International Conference on Corpus Linguistics (Corpora-2004), , Saint-Petersburg,
Russia.
This paper deals with a newly designed and developed system Manatee that can be employed to manage corpora, especially extremely large ones with billions of words, and enables the efficient evaluation of complex queries and the computation of advanced statistics. The main functions of the tool are presented here, together with the introduction of its web-based graphical user interface, Bonito. The sophisticated statistical processing is demonstrated in an example of computing of Word Sketches. Special attention is paid to the definition of the word sketches for Czech and problems connected to its free word order.
Rychlý Pavel, Kovář Vojtěch
Displaying Bidirectional Text Concordances in KWIC format
In: Proceedings of 5th Biennial Conference of the Asian Association for Lexicography, University of Madras, Chennai, India, 2007, pp. 96-100.
Presented at: Asialex 2007, 6.-8.12.2007, Chennai,
India.
Rychlý Pavel
Manatee/Bonito - A Modular Corpus Manager
In: RASLAN 2007: Recent Advances in Slavonic Natural Language Processing, Masaryk University, Brno, 2007, pp. 97-102.
Rychlý Pavel, Kilgarriff A.
An Efficient Algorithm for Building a distributed Thesaurus (and other Sketch Engine Development)
In: Association for Computational Linguistics, Proceedings of the ACL 2007 Demo and Poster Sessions, Prague, 2007, pp. 41-44.
Presented at: ACL 2007, 23.-30.6.2007, Prague,
Czech Republic.
Řezanková Hana, Húsek Dušan, Snášel Václav
Clustering as a Tool for Data Mining
In: Applications of Mathematics and Statistics in Economy, Professional Publishing, Prague, 2004, pp. 203-208.
ISBN: 80-86419-77-0
Presented at: 7th International Scientific Conference, 2.-3.9.2004, České Budějovice,
Czech Republic.
Řimnáč Martin, Tyl Pavel
Kombinace metod pro srovnání ontologií
In: Proc. of Information Technologies - Application and Theory, (Ed. P. Vojtáš), PONT, Seňa, 2008, pp. 113-117.
ISBN: 978-80-969184-8-5
Presented at: Konferencia o informačných (inteligentných) technológiách - aplikácie a teória 2008, 22.-26.9.2008, High Tatras,
Slovakia.
Zatímco dílčí ontologie pokrývají jeden pohled na úzce vymezenou oblast, mnohé aplikace vyžadují obecnější přístup k popisovaným datům. Z tohoto důvodu se přistupuje ke srovnávání ontologií (Ontology Matching), které, pokud je to možné, transformuje několik různých ontologických popisů do jediného.
Příspěvek popisuje případovou studii takového procesu za využití různých metod, srovnává jejich úspěšnost a diskutuje možnost využití dílčích výsledků k definici výsledné ontologie. Pro experiment byly nezávisle vytvořeny dvě triviální ontologie, které byly různými nástroji a metodami integrovány do jedné.
Řimnáč Martin
Web Integration Tool: Data Structure Modelling
In: Proceedings of the 2005 International Conference on Data Mining, CSREA Press, 2005.
ISBN: 1-932415-79-3
Presented at: DMIN`05 -International Conference on Data Mining, 20.-23.06.2005, Las Vegas,
USA.
The paper describes a method for relational data model estimation from input web data and usage of this method. It includes also its principal limitations and shows the model usage for a more effective storage into a repository. The repository is implemented as the universal relation. The properties of the model are described as well.
Řimnáč Martin
Rekonstrukce databázového modelu na základě dat (studie proveditelnosti)
In: Doktorandský den `04, MATFYZPRESS, 2004, pp. 113-120.
ISBN: 80-86732-30-4
Presented at: Institute of Computer Science Ph.D. Student`s Days 04, 29.09.-01.10.2004, Paseky nad Jizerou,
Czech Republic.
Příspěvek popisuje provedenou studii proveditelnosti databázově orientované části systému zajišťujícím automatickou extrakci dat z webových zdrojů (formáty XHTML, XML, CSV). Úkolem této části je transformace dat do automaticky vygenerovaného relačního modelu, který může být následně užit pro realizaci myšlenek sémantického webu. V úvodní části je uvedena motivace pro implementaci takového nástroje. Součástí příspěvku je i částečné ohlédnutí za již implementovanými metodami, které autor v současné době zpracovává. V poslední části je nastíněna fuzzyfikace problematiky.
Řimnáč Martin
Rekonstrukce databázového modelu na základě nepřesných dat
Presented at: ITAT 2004, Workshop on Information Technologies - Applications and Theory, 15.9.-19.9.2004, High Tatra,
Slovakia.
Příspěvek popisuje provedenou studii proveditelnosti databázově orientované části systému zajišťujícím automatickou extrakci dat z webových zdrojů (formáty XHTML, XML, CSV). Úkolem této části je transformace dat do automaticky vygenerovaného relačního modelu, který může být následně užit pro realizaci myšlenek sémantického webu. V úvodní části je uvedena motivace pro implementaci takového nástroje. Součástí příspěvku je i částečné ohlédnutí za již implementovanými metodami, které autor v současné době zpracovává. V poslední části je nastíněna fuzzyfikace problematiky.
Řimnáč Martin
Transforming Current Web Sources for Semantic Web Usage
In: Proceedings of SOFSEM 2006, Volume: II, ICS AS CR, Prague, 2006, pp. 155-165.
ISBN: 80-903298-4-5
Presented at: SOFSEM 2006: Theory and Practice of Computer Science, 21.1.-27.1.2006, Měřín,
Czech Republic.
The paper proposes a data structure modelling method, which aim is to estimate a structure model from a given input data set. The model can be seen as an estimate of data semantics ֠the obtained relations can be transformed into an RDF or OWL semantic web format documents to be included into the semantic web portfolio. The proposed method makes a connection between current web sources and the semantic web vision to be realized. Finally, the method usage and conversion rules are illustrated on an example.
Řimnáč Martin
Odhadování struktury dat pomocí pravidlových systémů
In: Doktorandský den 05, (Ed. Hakl F.), MATFYZPRESS, Prague, 2005, pp. 124-133.
ISBN: 80-86732-56-8
Presented at: Institute of Computer Science Ph.D. Student`s Days 05, 5.10.-7.10.2005, Nový Dvůr,
Czech Republic.
Metoda odhadování struktury dat spojuje vizi sémantického webu a dnešní webové datové zdroje, které převážně neobsahují žádnou doprovodnou sémantiku prezentovaných informací. Aby bylo možné tyto zdroje použít pokročilými nástroji sémantického webu, je potřeba sémantiku prezentovaných dat alespoň odhadnout. Příspěvek popisuje takovou metodu, ukazuje její použití pro úlohy induktivního logického programování a jmenuje výhody použití pravidlových systémů pro její implementaci.
Řimnáč Martin
Odhad struktury dat a induktivní logické programování
In: Proceedings of ITAT 2005, Information Technologies - Applications and Theory, (Ed. Vojtáš P.), Prírodovedecká fakulta Univerzity Pavla Jozefa Šafárika, Košice, 2005, pp. 124-133.
ISBN: 80-7097-609-8
Presented at: ITAT 2005, 20.9. - 25.9.2005, Račkova dolina,
Slovakia.
Odhadování struktury dat je jednou z možností, jak automatizovaným způsobem interpretovat data. Ta mohou být popsána pomocí modelu funkčních závislostí, vytváření takového modelu lze srovnat s některými technikami strojového učení. Tento příspěvek shrnuje vybrané základní techniky induktivního logického programování a analyzuje je z pohledu metody odhadování struktury dat. Ukazuje se, že techniky induktivního logického programování lze v některých případech převést právě odhadování struktury dat.
Řimnáč Martin
Odhadování struktury a asociativní úložiště dat
In: Doktorandský den 06, (Ed. F. Hakl), MATFYZPRESS, 2006, pp. 135-142.
ISBN: 80-86732-87-8
Presented at: Doktorandský den 06, 20.9.-22.9.2006, Monínec, Sedlec-Prčice,
Czech Republic.
Odhad struktury dat získaných například z webových zdrojů lze využít jednak pro uložení dat, tak
pro netriviální dotazování nad těmito daty. Článek rozšiřuje metodu odhadu struktury dat získávající
odpovídající schéma relačního modelu ze vstupních dat a popisuje metodu uložení dat pomocí
jednoduchého asociativního úložiště dat právě na základě odhadnutého modelu. Článek diskutuje
dvě možné implementace úložiště: první uchovávající data jako instance funkčních závislostí, druhou
uchovávající pouze instance funkčních závislosti mezi jednoduchými atributy rozšířenou o podporu komplexních atributů pomocí metainformace.
Řimnáč Martin
Asociativní úložiště dat v prostředí sémantického webu
In: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu, Ústav informatiky AV ČR, Prague, 2006, pp. 102-109.
ISBN: 80-903298-7-X
Presented at: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu - Seminář projektu programu Informační společnost, 5.10.-7.10.2006, Zadov,
Czech Republic.
Použití asociativního úložiště dat je jednou z možností, jak
efektivním způsobem reprezentovat data. Článek se zabývá z převážné
části metodou učení takového úložiště, přičemž využívá myšlenek vize
sémantického webu. Dále ukazuje souvislosti této metody s teoriemi organizace
paměti živých organismů včetně člověka a její učení bez snahy
tyto procesy zpětně modelovat. Jelikož se nabízí možnost využít současných webových stránek jako vstupních dat, je učící algoritmus navržen
inkrementálnì a výhody použití takového adaptivního přístupu jsou detailně popsány. Výsledkem algoritmu je asociativní úložiště navržené na
základě všech dostupných (meta)informací, na které je možné pohlížet
jako na extensionální úrovni odhadnutou sémantiku dat.
Řimnáč Martin
Data Structure Estimation for RDF Oriented Repository Building (Extended Abstract)
In: Frontiers in Mobile and Web Computing, (Ed. Baroli L., Abderazek B.A., Grill T., Nguyen T.M., Tjondronegoro D.), Österreichische Computer Gesselschaft, Wien, 2006, pp. 681-685.
ISBN: 3-85403-216-1
Presented at: The Fourth International Conference on Advances in Mobile Computing & Multimedia (MoMM2006), 4.12.-6.12.2006, Yogyakarta,
Indonesia.
Mechanisms for accessing and training the data repository using a binary matrix formalism
are presented. The repository is designed for a data storage through corresponding instances of
simple attribute functional dependencies, which can be seen as similar to the binary predicate
formalism being used by the RDF semantic web format.
Two mechanisms for querying a repository, the generalisation and the specialisation, are given.
Furthermore, the incremental repository training mechanism with no extra requirements on
the input data form is described: The extensional functional dependency system is used as a
generalised view on the stored data; the algorithm is inspired by the functional dependency
discovery approach.
Řimnáč Martin
Advanced Features of Attribute Annotated Data Sets
In: WETDAP 2007, Proceedings of the 1st Workshop Evolutionary Techniques in Data-processing, In Conjunction with Znalosti (Knowledge) 2007, Faculty of Electrical Engineering and Computer Science, VŠB - Technical University of Ostrava, Ostrava, 2007, pp. 54-59.
Presented at: Workshop Evolutionary Techniques in Data-processing, Associated with ZNALOSTI 2007 conference
, 21.-23.2.2007, Ostrava,
Czech Republic.
The paper compares features of learning and querying process
in the situation, when values in the input data set are annotated by
attributes or this information is not available. The attribute annotation
enables to consider global relationships, which are useful to express the
data semantics in a explicit way. It will be shown data can be accessed
with no semantic interpretation and then, after the evaluation process,
the result can be interpreted.
Řimnáč Martin
Minimalising Binary Predicate Knowledge Base using Transitivity Rule in Incremental Algorithm
Presented as an invited talk: 22nd European Conference on Operational Research EURO 2007
, 8.-11.7.2007, Prague,
Czech Republic.
Machine learning methods can be seen as an optimalisation task reducing differences
between an expected and returned result on a given data set. A corresponding
knowledge base can be expressed in many ways, for example, by a binary predicate
formalism.
The talk deals with a minimalisation of predicate ammount in such a repository,
which is enabled by a transitivity. The transitive reduction algorithm will be
detaily given for an incremental (attribute annotated data driven) building of a
knowledge base; a base model with higher expressiveness will be prefered.
Finally, an effect of the selected model to estimated explicit semantic definitions
of symbols (internal base interpretation) will be mentioned as well.
Řimnáč Martin
Nevyužité možnosti sémantického webu
In: Doktorandské dny '08, (Ed. F. Hakl), MATFYZPRESS, 2008, pp. 106-111.
ISBN: 978-80-7378-054-8
Presented at: Doktorandské dny 2008, 29.9.-1.10.2008, Jizerka,
Czech Republic.
Vize sémantického webu byla představena před skoro již 10 lety, avšak žádná z její aplikací prozatím nedokázala oslovit takové množství lidí, jaké dnes používá web v současné podobě. Příspěvek se věnuje možnostem sémantického webu a přínosům, které může přinést pro koncové uživatele. Nejprve podává přehled o současných technologiích i jejich použití a následně diskutuje možnosti plynoucí z použití odkazů v prostředí sémantického webu tak, jak je známe z webu současného, tedy rozšiřující, zpřesňující či udávající kontext
prezentované informace.
Řimnáč Martin
Redukce datových modelů
In: Doktorandský den 07, (Ed. F. Hakl), MATFYZPRESS, 2007, pp. 80-86.
Presented at: Doktorandské dny 2007, 17.-19.9.2007, Malá Úpa,
Czech Republic.
Přıspěvek se zabývá aspekty optimalizace paměťových nároků binárního úložiště atributově anotovaných dat
na základě transitivní redukce zobecněného systému funkčních závislostí. Tento systém buď může být předem
daný modelem, v tomto případě se ukazuje, že je možné optimalizaci použít jednorázově; a nebo tento model
je inkremetálním způsobem odhadován a pak se ukazuje vhodným pouze již jednou naoptimalizované úložiště
pouze upravovat opět inkrementálním způsobem. V poslední sekci se příspěvek zaobírá rozborem nejednoznačnosti
výsledku včetně detailního rozboru vlastností základních konfigurací částí modelu způsobující tuto nejednoznačnost.
V neposlední řadě je analyzována složitost dílčích operací v úložišti.
Řimnáč Martin, Linková Zdeňka
Automatizovaný návrh pravidel pro integraci dat
Řimnáč Martin, Špánek Roman, Linková Zdeňka
Sémantický web: vize globálního úložiště dat?
In: DATAKON 2007, (Ed. Popelínský L., Výborný O.), Masaryk university, 2007, pp. 176-186.
Presented at: DATAKON 2007, 20.10.-23.10.2007, Brno,
Czech Republic.
Cílem příspěvku je předložit vizi nových přístupů pro sdílení a vyhledávání dat na internetu. Opírá se o prověřené technologie pracující nad textovými webovými dokumenty a propojuje je se sémantickým webem, moderním prostředkem pro výměnu dat a aktuálními trendy ve vývoji internetu jako celku.
Řimnáč Martin, Špánek Roman, Linková Zdeňka
SemanticWeb: Vision of Distributed and Trusted Data Environment?
In: WWM 2007, 2007, pp. 627-634.
Presented at: WWM 2007, 1st International Web X.0 and Web Mining Workshop, held in collocation with ICDIM 2007, 28.10.-31.10.2007, Lyon,
France.
The vision of the semantic web as a distributed and
trusted environment for data sharing together with related
issues are presented. The paper brings a basic binary
matrix formalism for the internal representation of sources
and shows the clasical issues as a data inconsistency and a
data integration. Aspects of these issues lead to the binary
formalism to be generalised into the <0,1> interval one to
enable the consideration of uncertainty at various level.
Finally, the need of a source trust definition is presented
and discussed with respect to a semantic web.
Řimnáč Martin
Data Structure Estimation for RDF Oriented Repository Building
In: Complex, Intelligent and Software Intensive Systems, (Ed. Barolli L., Tjoa A.), IEEE Computer Society, Los Alamitos, 2007, pp. 147-154.
Presented at: CISIS`07 International Conference on Complex, Intelligent and Software Intensive Systems, 10.-13.04.2007, Vienna, Austria.
Sedmidubský Jan, Bartoň Stanislav, Dohnal Vlastislav, Zezula Pavel
Adaptive Approximate Similarity Searching through Metric Social Networks
Technical Report: FIMU-RS-2007-06, Faculty of Informatics, Masaryk University, Brno, 2007, 22 p.
Exploiting the concepts of social networking represents a novel approach to the approximate
similarity query processing. We present an unstructured and dynamic P2P environment in
which a metric social network is built. Social communities of peers giving similar results
to specific queries are established and such ties are exploited for answering future queries.
Based on the universal law of generalization, a new query forwarding algorithmis introduced
and evaluated. The same principle is used to manage query histories of individual peers with
the possibility to tune the tradeoff between the extent of the history and the level of the queryanswer
approximation. All proposed algorithms are tested on real data and medium-sized
P2P networks consisting of tens of computers.
Sedmidubský Jan, Bartoň Stanislav, Dohnal Vlastislav, Zezula Pavel
Querying Similarity in Metric Social Networks
In: Network-Based Information Systems, First International Conference, NBiS 2007, Springer, Berlin, 2007, pp. 278-287.
Presented at: NBiS 2007, 3.-7.9.2007, Regensburg,
Germany.
In this paper we tackle the issues of exploiting the concepts of social networking in processing similarity queries in the environment of a P2P network. The processed similarity queries are laying the base on which the relationships among peers are created. Consequently, the communities encompassing similar data emerge in the network. The architecture of the presented metric social network is formally defined using the acquaintance and friendship relations. Two version of the navigation algorithm are presented and thoroughly experimentally evaluated. Finally, learning ability of the metric social network is presented and discussed.
Sedmidubský Jan, Bartoň Stanislav, Dohnal Vlastislav, Zezula Pavel
Adaptive Approximate Similarity Searching through Metric Social Networks
In: 24th International Conference on Data Engineering (ICDE 2008), 2008, pp. 3.
Presented at: 24th International Conference on Data Engineering, 7.-12.4.2008, Cancún,
Mexico.
Exploiting the concepts of social networking represents a novel
approach to the approximate similarity query processing. We present a metric
social network where relations between peers, giving similar results, are
established on per-query basis. Based on the universal law of
generalization, a new query forwarding algorithm is proposed. The same
principle is used to manage query histories of individual peers with the
possibility to tune the tradeoff between the extent of the history and the
level of the query-answer approximation. All algorithms are tested on real
data and real network of computers.
Sedmidubský Jan, Bartoň Stanislav, Dohnal Vlastislav
mSN: Metric Social Network for Similarity Searching
SW prototype
This prototype implements the idea of social networking in metric
spaces. The metric social network is a peer-to-peer network in which users
can share their data without the need to send them to a centralized node.
Searching in this system is based on the notion of similarity which is
modelled using metric spaces. The architecture of the metric social network
is formally defined by using acquaintance and friendship relations. The
implementation builds on top of the MESSIF framework library.
Sedmidubský Jan, Dohnal Vlastislav, Bartoň Stanislav, Zezula Pavel
A Self-organized System for Content-based Search in Multimedia.
In: IEEE International Symposium on Multimedia (ISM 2008), Patrick Kellenberger, Los Alamitos, California, 2008.
(in_print)
Šesták Radovan, Lánský Jan, Žemlička M.
Suffix Array for Large Alphabet
In: Proc. of 2008 Data Compression Conference (DCC 2008), IEEE Computer Society Press, 2008, pp. 543-543.
Presented at: DCC 2008 Data Compression Conference, 25.-27.3.2008, Snowbird, Utah,
USA.
Šesták Radovan, Lánský Jan
Compression of Concatenated Web Pages Using XBW
In: SOFSEM 2008: Theory and Practice of Computer Science, LNCS 4910, Springer, 2008, pp. 743-754.
Presented at: 34th International Conference on Current Trends in Theory and Practice of Computer Science, 19.-25.1.2008, Nový Smokovec, High Tatras,
Slovakia.
XBW [10] is modular program for lossless compression that
enables testing various combinations of algorithms. We obtained best
results with XML parser creating dictionary of syllables or words combined
with Burrows-Wheeler transform - hence the name XBW. The
motivation for creating parser that handles non-valid XML and HTML
files, has been system EGOTHOR [5] for full-text searching. On files of
size approximately 20MB, formed by hundreds of web pages, we achieved
twice the compression ratio of bzip2 while running only twice as long. For
smaller files, XBWhas very good results, compared with other programs,
especially for languages with rich morphology such as Slovak or German.
For any big textual files, our program has good balance of compression
and run time.
Program XBW enables use of parser and coder with any implemented
algorithm for compression.We have implemented Burrows-Wheeler transform
which together with MTF and RLE forms block compression, dictionary
methods LZC and LZSS, and finally statistical method PPM. Coder
offers choice of Huffman and arithmetic coding.
Skopal Tomáš, Pokorný Jaroslav, Snášel Václav
Nearest Neighbours Search using the PM-tree
In: Procedings of The 10th International Conference on Database Systems for Advanced Applications, LNCS 3453, Springer-Verlag, 2005, pp. 803-815.
Presented at: DASFAA 2005, 17.4.-20.4.2005, Beijing,
China.
Skopal Tomáš
On Fast Non-Metric Similarity Search by Metric Access Methods
In: Proceedings of 10th International Conference on Extending Database Technology EDBT 2006, (Ed. Y. Ioannidis et al.), 2006, pp. 718-736.
ISBN: 3-540-32960-9
Presented at: EDBT 2006, 26.3.-31.3.2006, Munich,
Germany.
The retrieval of objects from a multimedia database employs a measure which defines a similarity score for every pair of objects. The measure should effectively follow the nature of similarity, hence, it should not be limited by the triangular inequality, regarded as a restriction in similarity modeling. On the other hand, the retrieval should be as efficient (or fast) as possible. The measure is thus often restricted to a metric, because then the search can be handled by metric access methods (MAMs). In this paper we propose a general method of non-metric search by MAMs. We show the triangular inequality can be enforced for any semimetric (reflexive, non-negative and symmetric measure), resulting in a metric that preserves the original similarity orderings (retrieval effectiveness). We propose the TriGen algorithm for turning any blackbox semimetric into (approximated) metric, just by use of distance distribution in a fraction of the database. The algorithm finds such a metric for which the retrieval efficiency is maximized, considering any MAM.
Skopal Tomáš, Snášel Václav
An Application of LSI and M-tree in Image Retrieval
In: GESTS International Transactions on Computer Science and Engineering, Volume: 34, No: 1, GEST Society, 2006, pp. 212-225.
When dealing with image databases, we often need to solve the problem of how to retrieve a desired set of images effectively and efficiently. As a representation of images, there are commonly used some high-dimensional vectors of extracted features, since in such a way the content-based image retrieval is turned into a geometric-search problem. In this article we present a study of feature extraction from raw image data by means of the LSI method (singular-value decomposition, respectively). Simultaneously, we show how such a kind of feature extraction can be used for efficient and effective similarity retrieval using the M-tree index. Because of the application to image retrieval, we also show some interesting effects of LSI, which are not directly obvious in the area of text retrieval (where LSI came from).
Skopal Tomáš, Hoksza D.
Improving the Performance of M-tree Family by Nearest-Neighbor Graphs
In: Advances in Databases and Information Systems, LNCS 4690, Springer, Berlin, 2007, pp. 172-188.
Presented at: ADBIS 2007, 29.9.-3.10.2007, Varna,
Bulgaria.
The M-tree and its variants have been proved to provide an efficient similarity search in database environments. In order to further improve their performance, in this paper we propose an extension of the M-tree family, which makes use of nearest-neighbor (NN) graphs. Each tree node maintains its own NN-graph, a structure that stores for each node entry a reference (and distance) to its nearest neighbor, considering just entries of the node. The NN-graph can be used to improve filtering of non-relevant subtrees when searching (or inserting new data). The filtering is based on using ”sacrifices” selected entries in the node serving as pivots to all entries being their reverse nearest neighbors (RNNs). We propose several heuristics for sacrifice selection; modified insertion; range and kNN query algorithms. The experiments have shown the M-tree (and variants) enhanced by NN-graphs can perform significantly faster, while keeping the construction cheap.
Skopal Tomáš
Unified Framework for Exact and Approximate Search in Dissimilarity Spaces
In: Transactions on Database Systems (TODS), Volume: 32, No: 4, ACM, 2007, pp. 1-47.
In multimedia systems we usually need to retrieve database (DB) objects based on their similarity
to a query object, while the similarity assessment is provided by a measure which defines a
(dis)similarity score for every pair of DB objects. In most existing applications, the similarity measure
is required to be a metric, where the triangle inequality is utilized to speed up the search
for relevant objects by use of metric access methods (MAMs), for example, the M-tree. A recent
research has shown, however, that nonmetric measures are more appropriate for similarity modeling
due to their robustness and ease to model a made-to-measure similarity. Unfortunately, due to
the lack of triangle inequality, the nonmetric measures cannot be directly utilized by MAMs. From
another point of view, some sophisticated similarity measures could be available in a black-box
nonanalytic form (e.g., as an algorithm or even a hardware device), where no information about
their topological properties is provided, so we have to consider them as nonmetric measures as well.
From yet another point of view, the concept of similarity measuring itself is inherently imprecise
and we often prefer fast but approximate retrieval over an exact but slower one.
To date, the mentioned aspects of similarity retrieval have been solved separately, that is, exact
versus approximate search or metric versus nonmetric search. In this article we introduce a similarity
retrieval framework which incorporates both of the aspects into a single unified model. Based
on the framework, we show that for any dissimilarity measure (either a metric or nonmetric) we
are able to change the “amount” of triangle inequality, and so obtain an approximate or full metric
which can be used for MAM-based retrieval. Due to the varying “amount” of triangle inequality,
the measure is modified in a way suitable for either an exact but slower or an approximate but
faster retrieval. Additionally, we introduce the TriGen algorithm aimed at constructing the desired
modification of any black-box distance automatically, using just a small fraction of the database.
Slušný Stanislav, Vidnerová Petra, Neruda Roman
Behavior Emergence in Autonomous Robot control by Means of Feedforward and Recurrent Neural Networks
In: WCECS 2007, (Ed. Ao S., Douglas C., Grundfest W., Schruben L., Wu X.), IA ENG, LNCS, Hong Kong, 2007, pp. 518-523.
Presented at: WCECS 2007. World Congress on Engineering and Computer Science, 24.-26.10.2007, San Francisco,
USA.
Slušný Stanislav, Neruda Roman, Vidnerová Petra
Rule-based Analysis of Behaviour Learned by Evolutionary and Reinforcement Algorithms
In: Advanced Intelligent Computing Technology and Applications-ICIC2008, LNAI, Springer Verlag, 2008.
Presented at: ICIC 2008, 15.-18.09.2008, Shanghai,
China.
We study behavioural patterns learned by a robotic agent by means of
two different control and adaptive approaches - a radial basis function neural
network trained by evolutionary algorithm, and a traditional reinforcement Q-learning
algorithm. In both cases, a set of rules controlling the agent is derived
from the learned controllers, and these sets are compared. It is shown that both
procedures lead to reasonable and compact, albeit rather different, rule sets.
Smrž Pavel, Povolný Martin, Sinopalníková Anna
OASIS - A New Tool for the Transformation of XML Knowledge Resources into OWL
Presented at: ISWC 2004, 7.11.-11.11. 2004, Hiroshima,
Japan.
This paper presents OASIS – a new tool that enables (semi)automatic conversion of existing knowledge bases, semantic networks, terminological databases and various other resources to complex ontologies into OWL. The tool is implemented as a client of DEB (Dictionary Editor and Browser) which is able to store, index and efficiently retrieve lexical data. The architecture is based on XML and related W3C standards (XSLT, XML Schema, XPath, DOM). The main feature which brings the efficiency of the transformation is the extension of a standard XSLT processor with the ability to obtain additional data from the server through the mechanism of nested queries. This technique allows formulation of complex constraints needed in the conversion to OWL
Snášel Václav, Moravec Pavel, Pokorný Jaroslav
WordNet Ontology Based Model for Web Retrieval
In: Proceedings of International Workshop on Challenges in Web Information Retrieval and Integration (WIRI) 2005, IEEE Computer Society Press, 2005, pp. 231-236.
Presented at: International Workshop on Challenges in Web Information Retrieval and Integration, 8.4.-9.4. 2005, Tokyo,
Japan.
It is well known that ontologies will become a key piece, as they allow making the semantics of Semantic Web content explicit. In spite of the big advantages that the Semantic Web promises, there are still several problems to solve. Those concerning ontologies include their availability, development and evolution. In the area of information retrieval, the dimension of document vectors plays an important role. Firstly, with higher index dimensions the indexing structures suffer from the "curse of dimensionality" and their efficiency rapidly decreases. Secondly, we may not use exact words when looking for a document, thus we miss some relevant documents. LSI is a numerical method, which discovers latent semantics in documents by creating concepts from existing terms. In this paper we present a basic method of mapping LSI concepts on given ontology (WordNet), used both for retrieval recall improvement and dimension reduction. We offer experimental results for this method on a subset of TREC collection, consisting of Los Angeles Times articles.
Snášel Václav, Moravec Pavel, Pokorný Jaroslav
Using SDD for Topic Identification
In: Proc. of 8th International Conference on Intelligent Systems Design and Applications, IEEE Computer Society, 2008.
Presented at: ISDA 2008: 8th International Conference on Intelligent Systems Design and Applications, 25.-28.11.2008, Kaohsiung, Taiwan.
Snášel Václav, Dvorský Jiří, Timofieiev Anton, Pokorný Jaroslav
H-Index Analysis of Enron Corpus
In: Proc. of 8th International Conference on Intelligent Systems Design and Applications, IEEE Computer Society, 2008.
(in_print)
Presented at: ISDA 2008: 8th International Conference on Intelligent Systems Design and Applications, 25.-28.11.2008, Kaohsiung, Taiwan.
Snášel Václav, Moravec Pavel, Pokorný Jaroslav
Using BFA with wordnet ontology based model for web retrieval
In: Proceedings of the First IEEE International Conference on Signal-Image Technology & Internet-Based Systems (SITIS`05), 2005, pp. 254-259.
Presented at: First IEEE International Conference on Signal-Image Technology & Internet-Based Systems (SITIS`05), 27.11.-1.12.2005, Yaoundé,
Cameroon.
In the area of information retrieval, the dimension of document vectors plays an important role. We may need to find a few words or concepts, which characterize the document based on its contents, to overcome the problem of the "curse of dimensionality", which makes indexing of highdimensional data problematic. To do so, we earlier proposed a Wordnet and Wordnet+LSI (Latent Semantic Indexing) based model for dimension reduction. While LSI works on the whole collection, another procedure of feature extraction (and thus dimension reduction) exists, using binary factorization. The procedure is based on the search of attractors in Hopfield-like associative memory. Separation of true attractors (factors) and spurious ones is based on calculation of their Lyapunov function. Being applied to textual data the procedure conducted well and even more it showed sensitivity to the context in which the words were used. In this paper, we suggest that the binary factorization may benefit from the Wordnet filtration.
Snášel Václav, Moravec Pavel, Pokorný Jaroslav
Using BFA with WordNet Based Model for Web Retrieval
In: Journal of Digital Information Management, Volume: 4, No: 2, 2006, pp. 107-111.
Snášel Václav, Moravec Pavel, Húsek Dušan, Frolov A., Řezanková Hana, Polyakov P. Y.
Pattern Discovery for High-Dimensional Binary Datasets
In: ICONIP 2007: International Conference on Neural Information Processing, (Ed. M. Ishikawa, K. Doya, H. Miyamoto, T. Yamakawa), LNCS 4984, Springer, Berlin, 2008, pp. 861-872.
ISBN: 978-3-540-69154-9
Presented at: 14th International Conference on Neural Information Processing, 13.-16.11.2007, Kitakyushu, Japan.
Snášel Václav, Řezanková Hana, Húsek Dušan, Kudělka Miloš, Lehečka Ondřej
Semantic Analysis of Web Pages using Cluster Analysis and Nonnegative matrix Factorization
In: Advances in Intelligent Web Mastering, (Ed. Wegrzyn-Wolska K., Szczepaniak P.), Volume: 43, Springer, Berlin, 2007, pp. 328-336.
ISBN: 978-3-540-72574-9
Presented at: AWIC 2007. Atlantic Web Intelligence Conference (5.), 25.6.-27.6.2007, Fontainbleau,
France.
In this paper, the web pages concerning products sale are analyzed with the aim to create clusters of similar web pages and characterize these by GUI patterns. We applied GD-CLS (gradient descent - constrained least squares) method which combines some of the best features of other methods. Both traditional methods for searching clusters and nonnegative matrix factorization are used.
Sojka Petr
Towards Digital Mathematical Library
In: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu, Ústav informatiky AV ČR, Prague, 2006, pp. 110-113.
ISBN: 80-903298-7-X
Presented at: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu - Seminář projektu programu Informační společnost, 5.10.-7.10.2006, Zadov,
Czech Republic.
This paper describes a prototype of the OCR math engine
built in the DML-CZ project. Solution stands on the combination of FineReader and InftyReader programmes. The achieved error rate (counting
not only character errors, but also errors in the recognition of structure
of mathematics notation) decreased from an initial 12% to under 1%.
Sojka Petr, Choi Key-Sun, Fellbaum Christiane, Vossen Piek
Proceedings of the Third International WordNet Conference, GWC 2006
In: Proceedings of the Third International Wordnet Conference, 2006.
Sojka Petr, Kopeček Ivan, Pala Karel
Text, Speech and Dialogue - Proceedings of the 9th International Conference, TSD 2006
In: Text, Speech and Dialogue - Proceedings of the 9th International Conference, TSD 2006, LNCS 4188, Springer-Verlag, Berlin / Heidelberg, 2006.
ISBN: 978-3-540-39090-9
Špánek Roman
Security in Mobile Environment
In: Doktorandský den `04, MATFYZPRESS, 2004, pp. 149-155.
ISBN: 80-86732-30-4
Presented at: Institute of Computer Science Ph.D. Student`s Days 04, 29.09.-01.10.2004, Paseky nad Jizerou,
Czech Republic.
Advances in cellular mobile technology have engendered a new paradigm of computing, called mobile computing. New challenges have arisen and solutions are proposed based on various approaches. One of the most important challenges is security and now a day has been found ubiquitous in computing as whole. The paper is intended as a quick survey emphasizing security paradigm and also ad hoc networks are kept in mind and briefly discussed.
Špánek Roman
RollingBall: Energy and QoS Aware Protocol for Wireless Sensor Networks
In: Proceedings of SOFSEM 2006, Volume: II, ICS AS CR, Prague, 2006, pp. 166-173.
ISBN: 80-903298-4-5
Presented at: SOFSEM 2006: Theory and Practice of Computer Science, 21.1.-27.1.2006, Měřín,
Czech Republic.
In the paper, we present a quality of service and energy aware communication protocol, called RollingBall. We do believe that QoS and energy awareness are two of the most important parameters in wireless sensor networks. The protocol is completely distributed with no centralized control. The key idea is to introduce a resistance calculation for every connection in the network. The resistance reflects the distance to the sink together with energy capabilities of particular sensor. While the resistance is continually re-calculated, packets are sent to the sink via an appropriate path. Such a scheme allows to spend minimum messages on network management, whereby sensor network lifetime is extended and throughput remains high.
Špánek Roman
Sharing information in a Large Network of Users
In: Doktorandský den 05, (Ed. Hakl F.), MATFYZPRESS, Prague, 2005, pp. 134-140.
ISBN: 80-86732-56-8
Presented at: Institute of Computer Science Ph.D. Student`s Days 05, 5.10.-7.10.2005, Nový Dvůr,
Czech Republic.
The paper describes a possible treatment of sharing data in a large network of users. The mathematical model is based on weighted hypergraphs whose nodes and edges denote the users and their relations, respectively. Its flexibility guarantees to have basic relations between users robust under frequent changes in the network connections. Approach copes with the communication/computing issues from different point of view based on a structure evolution and its further optimization in sense of keeping the parallel space and time complexities low. Although the idea is aimed to the field of mobile computing, it can be generalized in straightforward way to other similar environment. An experimental application is also proposed and discussed in the paper.
Špánek Roman
Data pozičně závislá a jejich dopad v mobilních databázích
In: Proceedings of ITAT 2005, Information Technologies - Applications and Theory, (Ed. Vojtáš P.), Prírodovedecká fakulta Univerzity Pavla Jozefa Šafárika, Košice, 2005, pp. 273-278.
ISBN: 80-7097-609-8
Presented at: ITAT 2005, 20.9. - 25.9.2005, Račkova dolina,
Slovakia.
The paper describes selected problems and possible solutions for the position management in mobile computing. A proposed scheme extends existing approaches. The main idea is to reduce amount of possible solutions given by a movement prediction algorithm by constrains ubiquitously found in the real-life. Existing solutions and possibilities for a future research are also described.
Špánek Roman
Self-organizing and Self-monitoring Security Model for Dynamic Distributed Environments
In: Diploma Thesis, Technical University of Liberec, Faculty of Mechatronics and Interdisciplinary Engineering Studies, Liberec, 2008, pp. 130 p..
The thesis deals with security hazards in distributed environments where
traditional centralized approaches are only of limited serviceability. One of
the very successful model for treating security and access management in distributed systems are so called reputation systems. The main goal of the rep-
utation systems is to provide entities in the environment with mechanisms for
inferring and building trust consequently used for access control. If the trust
between two entities is high enough, transactions are likely to be allowed.
The thesis proposes a new security model with trust management system
for dynamic and distributed environments with huge number of entities. In
dynamic systems new entities or relationships are likely to emerge or existing
entities or relationships may often disappear. Such dynamics pose severe problems even for traditional reputation systems. Therefore our approach differs
from the traditional ones in the way adopted for establishment and management of trust between entities in our point of view trust is not assigned to
particular relationships but the trust is common for a group of entities. In this
way, our proposal significantly enhances ability to infer trust between entities
with no previous personal experiences with each other or in environments with
huge number of entities.
For the proposal differs in understanding of trust, it uses a hypergraph
model for representation of system of entities. The security model proposed
in the thesis contains two algorithms for transformation of a general input
graph structure into hypergraph model, an algorithm treating dynamics of the
distributed environment and a security subsystem.
Our experimental implementation SecGrid utilizes proposed algorithms and
it is used for experimental verification of the security models. The experiments
investigate ability of the transformation algorithms; in details the dynamic
part of our proposal together with the security subsystem proposed specially
for the hypergraph model. Experiments show that our model overcame the
traditional graph model in many ways especially in dynamic environments
with huge amount of entities.
Špánek Roman
Security Model Based on Virtual Organizations for Distributed Environments
In: Doktorandský den 06, (Ed. F. Hakl), MATFYZPRESS, 2006, pp. 164-171.
ISBN: 80-86732-87-8
Presented at: Doktorandský den 06, 20.9.-22.9.2006, Monínec, Sedlec-Prčice,
Czech Republic.
The paper presents a new approach for treating security issues in various environments with special
emphasis on Mobile databases, Semantic web and Grids. A brief overview on possible security models
and a discussion on their advantages and disadvantages is given. Our model based on virtual organization
and is build up on mathematical background based on hypergraphs. We show that hypergraphs are the
way how to reduce space complexity of the model. The complexity is important with respect to target
environments where number of users might be huge. To verify our model an experimental implementation
was programed and some graphical outputs are mentioned.
Špánek Roman, Tůma Miroslav
Sdílení dat v prostředí s nehomogenními skupinami uživatelů
In: Proceedings of ITAT 2006, Information Technologies - Applications and Theory, 2006.
ISBN: 80-969184-4-3
Presented at: ITAT 2006, 26.9.-1.10.2006, Chata Kosodrevina, Bystrá dolina, Nízke Tatry,
Slovakia.
Špánek Roman
Security, Privacy and Trust in (Semantic)Web
In: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu, Ústav informatiky AV ČR, Prague, 2006, pp. 114-122.
ISBN: 80-903298-7-X
Presented at: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu - Seminář projektu programu Informační společnost, 5.10.-7.10.2006, Zadov,
Czech Republic.
This paper gives a short overview on security issues widely
found in the Semantic Web environment. It goes through each level of
the proposed Semantic Web layers and discusses security, privacy and
trust for each. Then, a list of possible solutions is given. In particular
XML security, RDF security, secure information integration and trust on
the Semantic Web are mentioned and short discussion is given. Finally,
an approach for treating security and trust based on Virtual organization
is described and its advantages are provided.
Špánek Roman
Secure Grid-based Computing with Social-Network Based Trust Management in the (Semantic) Web
In: Frontiers in Mobile and Web Computing, (Ed. Baroli L., Abderazek B.A., Grill T., Nguyen T.M., Tjondronegoro D.), Österreichische Computer Gesselschaft, Wien, 2006, pp. 663-667.
ISBN: 3-85403-216-1
Presented at: The Fourth International Conference on Advances in Mobile Computing & Multimedia (MoMM2006), 4.12.-6.12.2006, Yogyakarta,
Indonesia.
The paper describes a new approach for treatment security issues in reconfigurable groups of
users (Virtual Organizations-VO). The proposed strategy combines a convenient mathematical
model, efficient combinatorial algorithms which are robust with respect to changes in the
VO structure, and an efficient implementation. The mathematical model uses properties of
weighted hypergraphs. Model flexibility enables description of basic security relations between
the nodes such that these relations are preserved under frequent changes in connections of the
hypergraph nodes. The proposed implementation makes use of the techniques developed for
time and space-critical applications in numerical linear algebra. The ideas can be generalized
to other concepts describable by weighted hypergraphs. The consistency of the proposed ideas
for security management in the changing VO was verified in a couple of tests with our pilot
implementation SECGRID.
Špánek Roman
Web Search Engines and Linear Algebra
Technical Report: V-974, ICS AS CR, Prague, 2006, 7 p.
The technical report presents a brief overview on web search engines with deeper insight into their linear
algebra background. The linear algebra plays very important role in modern web search algorithms (e.g.
Google). The report presents two algorithms, particularly HITS and PageRank. The algorithms are discussed on their convergence problems and also some improvements to their personalization abilities. The computation complexity is also mentioned and briefly sketched.
Špánek Roman
Maintaining Trust in Large Scale Environments
In: Doktorandský den 07, (Ed. F. Hakl), MATFYZPRESS, 2007, pp. 94-102.
Presented at: Doktorandské dny 2007, 17.-19.9.2007, Malá Úpa,
Czech Republic.
Špánek Roman
Supporting Secure Communication in Distributed Environments
Špánek Roman
Reputation System for Large Scale Environments
In: WWM 2007, 2007, pp. 621-626.
Presented at: WWM 2007, 1st International Web X.0 and Web Mining Workshop, held in collocation with ICDIM 2007, 28.10.-31.10.2007, Lyon,
France.
The paper describes a new approach for treating trust in
reconfigurable groups of users with special accent on trust
in the next generations of the Internet. The proposed model
uses properties of weighted hypergraphs. Model flexibility
enables description of relations between nodes such that
these relations are preserved under frequent changes. The
ideas can be straightforwardly generalized to other concepts
describable by weighted hypergraphs. The consistency
of the proposal was verified in a couple of experiments
with our pilot implementation SecGRID.
Špánek Roman, Pirkl Pavel, Kovář P.
The Blue Game Project: Ad-hoc Multiplayer Mobole Game with Social Dimension
In: CoNEXT 2007, New York, 2007.
Presented at: 3rd Annual CoNEXT Conference, 10.-13.12.2007, New York,
USA.
The paper presents the BlueGame project an ad-hoc multiplayer
mobile game based on the Dungeons&Dragons board
game. The main idea lies in the adoption of Bluetooth Piconet
configuration and direct face to face contact of players
in real environments.
Špánek Roman, Řimnáč Martin, Linková Zdeňka
On creating a trusted and distributed data source environment
In: SOFSEM 2008: Theory and Practice of Computer Science, P. J. Šafárik University, Košice, 2008, pp. 112-123.
Presented at: 34th International Conference on Current Trends in Theory and Practice of Computer Science, 19.-25.1.2008, Nový Smokovec, High Tatras,
Slovakia.
Despite the tremendous research activity in the field of searching engines for
the Internet, current searching engines still face some severe limitations.
The paper presents an idea of a distributed data source environment to be
build on the current state of the art technologies available on the Internet.
The paper combines recent advances in the fields of a data inconsistency, a
data integration and reputations of sources for further refinements of data
searching and sharing processes. The paper generalizes the data binary
formalism narrowly connected with the ideas of the semantic web into the <0,1> interval to enable the consideration of uncertainty at various levels.
Štuller Július, Linková Zdeňka
Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu
In: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu, Ústav informatiky AV ČR, Prague, 2006.
ISBN: 80-903298-7-X
Toman Kamil, Mlýnková Irena
XML Data - The Current State of Affairs
In: Proceedings of XML Prague 2006 conference, ITI Series, MFF UK, 2006, pp. 87-102.
Presented at: XML Prague 2006, 17.6.-18.6.2006, Prague,
Czech Republic.
At present the eXtensible Markup Language (XML) is used almost in all spheres of human activities. Its popularity results especially from the fact that it is a self-descriptive metaformat that allows to define the structure of XML data using other powerful tools such as DTD or XML Schema. Consequently, we can witness a massive boom of techniques for managing, querying, updating, exchanging, or compressing XML data.
On the other hand, for majority of the XML processing techniques we can find various spots which cause worsening of their time or space efficiency. Probably the main reason is that most of them consider XML data too globally, involving all their possible features, though the real data are often much simpler. If they do restrict the input data, the restrictions are often unnatural.
In this contribution we discuss the level of complexity of real XML collections and their schemes, which turns out to be surprisingly low. We involve and compare results and findings of existing papers on similar topics as well as our own analysis and we try to ¯nd the reasons for these tendencies and their consequences.
Toman Kamil, Mlýnková Irena
Statistics on The Real XML Data
In: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu, Ústav informatiky AV ČR, Prague, 2006, pp. 123-130.
ISBN: 80-903298-7-X
Presented at: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu - Seminář projektu programu Informační společnost, 5.10.-7.10.2006, Zadov,
Czech Republic.
At present the eXtensible Markup Language (XML) is used
almost in all spheres of human activities. We can witness a massive
boom of techniques for managing, querying, updating, exchanging, or
compressing XML data.
On the other hand, for majority of the XML processing techniques we can
find various spots which cause worsening of their time or space efficiency.
Probably the main reason is that most of them consider XML data too
globally, involving all their possible features, though the real data are
often much simpler. If they do restrict the input data, the restrictions
are often unnatural.
We discuss the level of complexity of real XML collections and their
schemes, which turns out to be surprisingly low. We involve and compare
results and findings of existing papers on similar topics as well as our
own analysis and we try to ¯nd the reasons for these tendencies and their
consequences.
Tyl Pavel
Combination of Methods for Ontology Matching
In: Doktorandské dny '08, (Ed. F. Hakl), MATFYZPRESS, 2008, pp. 125-132.
ISBN: 978-80-7378-054-8
Presented at: Doktorandské dny 2008, 29.9.-1.10.2008, Jizerka,
Czech Republic.
While partial ontologies cover view at one-track area, many applications require much more general approach to describe their data.
On this account it approaches to ontology matching as a headstone of further operations, that can transform several ontological descriptions into one.
This paper describe case study of such process with using different methods, confront their fruitfulness and discuss a possibility of using particular
results to definition of final ontology. Two trivial ontologies were created (independently of any tool) and they were matched using various selected tools.
Tyl Pavel
Problematika integrace ontologií
In: Doktorandský den 07, (Ed. F. Hakl), MATFYZPRESS, 2007, pp. 110-115.
Presented at: Doktorandské dny 2007, 17.-19.9.2007, Malá Úpa,
Czech Republic.
Internet je ohromným zdrojem provázaných, ale většinou neuspořádaných dat. Sémantický web, jako rozšíření
webu současného, se snaží tuto neuspořádanost řešit a to nejen bezprostředně pro lidského uživatele, ale zejména
z hlediska možnosti strojového zpracování informací. Cílem je doplnit data o metadata, která mají být srozumitelná
jak pro člověka, tak pro počítač. Tato metadata jsou nejčastěji vyjádřena pomocí ontologií, které jsou jedním
ze základních stavebních prvků sémantického webu. V příspěvku se snažím nastínit některé z možností integrace
(slučování) ontologií za účelem sdílení informací.
Vaculín Roman, Neruda Roman
Autonomous behavior of computational agents
In: Adaptive and Natural Computing Algorithms, Springer, Wien, 2005, pp. 514-517.
Presented at: ICANNGA 2005, 21.-23.03.2005, Coimbra,
Portugal.
In this paper we present an architecture for decision making of software agents that allows the agent to behave autonomously. Our target area is computational agents—encapsulating various neural networks, genetic algorithms, and similar methods — that are expected to solve problems of different nature within an environment of a hybrid computational multi-agent system. The architecture is based on the vertically-layered and beliefdesire-intention architectures. Several experiments with computational agents were conducted to demonstrate the benefits of the architecture
Van Leeuwen Jan, Wiedermann Jiří
Lazy Autoconfiguration in Mobile Ad Hoc Networks and Dynamic Sets of Mobile Agents
Technical Report: UU-CS-2006-018, Department of Information and Computing Sciences, Utrecht University, Utrecht, 2006
In MANETs and similar link-free networks of communicating objects there is no central authority for naming and connection management. Autoconfiguration of network nodes is therefore desirable and, building on approaches in IPv6, a number of ‘zero configuration’ networking protocols have been proposed for this case. Typically these protocols do not easily scale and have difficulty with network partitioning and merging. In this paper we propose a number of novel, decentralized techniques for name resolution in zero configuration protocols that are more flexible and yet lead to name extensions of smallest possible length, assuming that objects sufficiently mix within their ranges. Finally, the techniques are converted into a fully decentralized, scalable autoconfiguration protocol for use in ad hoc networks with directional antennas. The protocol is ‘lazy’ in the sense that name resolution is postponed until the moment that conflicts prevent the correct functioning of the communication structure.
Vaneková Veronika, Vojtáš Peter
A Description Logic with Concept Ordering and top-k Restriction
In: Proc. of 18th European Japanese Conference on Information Modelling and Knowledge Bases, EJC2008 Program Comitee and EJC2008 Program Coordination team, 2008, pp. 139-149.
ISBN: 978-3-540-85712-9
(in_print)
Presented at: EJC 2008: 18th European Japanese Conference on Information Modelling and Knowledge Bases, 2.-6.6.2008, Tsukuba, Japan.
Vlčková Z., Galamboš Leo
Dynamizace gridu
In: Proceedings of ITAT 2007, Information Technologies - Applications and Theory, (Ed. Vojtáš P.), PONT s.r.o., Seňa, 2007, pp. 115-121.
Presented at: Konferencia o informačných (inteligentných) technológiách - aplikácie a teória 2007, 21.-27.9.2007, Polana,
Slovakia.
Vojtáš Peter, Gurský Peter
On top-k search with no random access using small memory
In: Proc. of 12th Advances in Databases and Information Systems, LNCS 5207, Springer-Verlag, Berlin, 2008, pp. 97-111.
ISBN: 978-3-540-85712-9
Presented at: ADBIS 2008: 12th Advances in Databases and Information Systems, 5.-9.9.2008, Pori, Finland.
Vojtáš Peter
Decathlon, Conflicting Objectives and User Preference Querying
In: Proc. of DATESO 2008, (Ed. J. Pokorný, V. Snášel, K. Richta), CEUR Workshop Proc., 2008, pp. 76-78.
ISBN: 978-80-248-1746-0
Presented at: Dateso 2008: Annual International Workshop on DAtabases, TExts, Specifications and Objects, 16.4.-18.4.2008, Desná - Černá Říčka,
Czech Republic.
Vojtáš Peter
Proceedings of ITAT 2005, Information Technologies - Applications and Theory
In: Proceedings of ITAT 2005, Information Technologies - Applications and Theory, (Ed. Vojtáš P.), Prírodovedecká fakulta Univerzity Pavla Jozefa Šafárika, Košice, 2005.
ISBN: 80-7097-609-8
Vojtáš Peter
Fuzzy Logic as an Optimization Task
In: Fuzzy Logic and Technology, (Ed. Sobrevilla P., Montseny E.), Barcelona, 2005, pp. 781-786.
ISBN: 84-7683-872-3
Presented at: EUSFLAT - LFA 2005. Conference of the European Society for Fuzzy Logic and Technology /13./, Recontres Francophones sur la Logique Floue et ses Applications /11./, 7.9.-9.9.2005, Barcelona, Spain.
Vojtáš Peter
Model Theoretic and Fixpoint Semantics for Preference Queries over Imperfect Data
In: Proceedings of Inconsistency and Incompleteness in Databases, (Ed. Chomicki J., Wijsen J.), Munich, 2006, pp. 87-91.
Presented at: Inconsistency and Incompleteness in Databases, International Workshop Collocated with the 10 th International Conference on Extending Database Technology, 26.3.2006, Munich,
Germany.
We present an overview of our results on model theoretic and fixpoint semantics for a relational algebra using a model of many valued Datalog with similarity. Using our previous results on equivalence of our model and certain variant of generalized annotated programs, we base our querying on fuzzy aggregation operators (also called annotation terms, combining functions, utility functions). Using of fuzzy aggregation operators (distinct from database aggregations) enables us to reduce tuning of various linguistic variables. In practice we can learn fuzzy aggregator operators by an ILP procedure for every user profile. Our approach enables also integration of data from different sources via aggregation and similarity. Extending domains we discuss difference between fuzzy elements and fuzzy subsets. We also discuss an alternative, when all extensional data are stored crisp and fuzziness is in rules interpreting data, context and in user query.
Vojtáš Peter
A Fuzzy EL Description logic with Crisp Roles and Fuzzy Aggregation for Web Consulting
In: Proceedings of Information Processing and Management of Uncertainty in Knowledge-Based Systems (IPMU 2006), Edition EDK, 2006, pp. 1834-1841.
ISBN: 2-84254-112-X
Presented at: Information Processing and Management of Uncertainty in Knowledge-Based Systems (IPMU 2006), 2.7.-7.7.2006, Paris,
France.
Vojtáš Peter
Fuzzy Logic Aggregation for Semantic Web Search for the Best Answer
In: Fuzzy Logic and the Semantic Web, (Ed. Sanchez E.), Elsevier, 2006.
ISBN: 0-444-51948-3
Vojtáš Peter
Information Technologies - Applications and Theory
In: Proceedings of ITAT 2006, Information Technologies - Applications and Theory, 2006.
ISBN: 80-969184-4-3
Vojtáš Peter
EL Description Logic Modeling Querying Web and Learning Imperfect User Preferences
In: Uncertainty Reasoning for the Semantic Web - Volume 2, Proceedings of the Second ISWC Workshop on Uncertainty Reasoning for the Semantic Web, (Ed. P.C.G. da Costa, K.B. Laskey, K.J. Laskey, F. Fung, M. Pool), 2006, pp. 2.
Presented at: Workshop on Uncertainty Reasoning for the Semantic Web URSW 2006, 5.11.2006, Athens, Georgia,
USA.
In this position paper we share ideas on modeling querying
web resources by (imperfect) combination of particular user preferences
based on description logic. Our basic assumption is, that web resources
are modeled crisp. Imperfection (uncertainty, vagueness,...) comes from
user context and preferences. We offer a model based on connection between
three EL-description logic systems: classical, annotated(fuzzy) and
a new variant of Bayesian description logic. The Bayesian part enables
learning each single user`s combination function and concepts.
Vojtáš Peter, Vomlelová M.
Learning fuzzy logic aggregation for multicriterial querying with user preferences
In: Proceedings of 27th Linz Seminar on Fuzzy Set Theory - Preferences, Games and Decisions, (Ed. J. Fodor, E.P. Klement, M. Roubens), Linz, 2006, pp. 128-129.
Presented at: 27th Linz Seminar on Fuzzy Set Theory - Preferences, Games and Decisions, 7.2.-11.2.2006, Linz,
Austria.
Vojtáš Peter
EL description logic with aggregation of user preference concepts
In: Frontiers in Artificial Intelligence and applications 154, Information modelling and Knowledge Bases XVIII, IOS Press, Amsterdam, 2007, pp. 154-165.
Wiedermann Jiří
HUGO: A Cognitive Architecture with an Incorporated World Model
Technical Report: V-966, ICS AS CR, Prague, 2006, 15 p.
We present a design of cognitive system architecture with an internal world model. The internal world model is realized with the help of artificial mirror neurons. We consider generalized artificial mirror neurons acting both as a mechanism for assembling and learning multimodal sensorimotor information and as associative memory for invoking multimodal information given only some of its components. We show that within an artificial cognitive system a network of generalized mirror neurons can simultaneously serve as an internal world model recognized by the agent and as that of the agent`s position within this world. We also specify a self-organizing control mechanism, which is based on the basic operations over concepts that were essentially identified by the British 18th century philosopher David Hume. This control mechanism makes use of the internal world model constructed in agent`s interaction with real world and straightforwardly supports imitation learning. Building heavily on the properties of the generalized mirror net and on automatic abstract concept creation, we offer an algorithmic explanation of computational language acquisition, thinking and consciousness in our model. Rather than describing an implementation of the respective mechanisms, the aim of the paper is to establish a proof of the principle of algorithmic nature of higher cognitive functions.
Wiedermann Jiří
Chtěli byste být mozkem v baňce?
Technical Report: V-967, ICS AS CR, Prague, 2006, 7 p.
Moderní teorie kognitivních systémů pohlíží na tyto systémy jako na autonomní vtělené výpočetní systémy, které se situují v okolí prostřednictvím svých senzomotorických jednotek. Přesto zejména v kruzích počítačových teoretikù je opakovaně slyšet názory, že na kognici lze pořád možné pohlížet i "klasicky", jako na problém specifického zpracování dat a že tudíž vtělení není nezbytné pro zachycení podstaty kognice. Ukážeme, že takto zjednodušený pohled opomíjí podstatnou vlastnost kognitivních systémů - a sice jejich aktivní vliv na výběr či dokonce vznik vstupních dat. Bez této zpětné vazby si systém nemůže vytvořit svůj vnitřní model světa poznaný prostřednictvím svých akcí. Pro vysvětlení povahy zmíněného problému použijeme výpočetní model kognitivních systémů zavedený autorem v předchozích pracích. Tento model umožní na principielní úrovni přemýšlet o fungování algoritmických mechanizmů imitace, komunikace, vzniku řeči, myšlení a vědomí a tím přispět i k jejich pochopení v živých systémech.
Wiedermann Jiří
Self-Reproducing Self-Assembling Evolutionary Automata
In: Tillings and Cellular Automata, CDMTCS, Auckland, 2004, pp. 1-15.
Presented at: International Workshop on Tillings and Cellular Automata, 13.-17.12.2004, Auckland,
New Zeland.
We introduce a computational model of a so-called globular universe which represents generalization of both classical cellular automata and contemporary models of self-assembly. Similarly as the latter mentioned model our model utilizes a multiset of globules which are endowed by self-organizing ability controlled by a finite state mechanism; these computational units are not fixed in a predetermined structure. The environment abounds in these units which are available at places where needed for a self-assembly of various objects. Within a globular universe we define the notion of self-reproducing evolutionary automaton. This notion refers to an automaton being at the beginning of a lineage of self-reproducing automata which leads to self-reproducing automata with arbitrary complex finite state control mechanisms via a series of mutations of intermediate automata. The ideas presented in this paper complement von Neumann`s results on self-reproducing automata in a static universe by offering a precise definition of what is meant by "evolutionary self-reproduction" and by designing a dynamic nondeterministic universe with a self-reproducing self-assembling evolutionary automaton.
Wiedermann Jiří
Self-Reproducing by Self-Assembly and Fission
In: Developments in Language Theory. Supplemental Papers., CDMTCS, Auckland, 2004, pp. 1-10.
Presented at: DLT`04. International Conference on Developments in Language Theory /8./, 13.-17.12.2004, Auckland,
New Zeland.
We introduce so-called biomata which represent a novel approach to the construction of self-reproducing automata within the automata theory. The design of our automata has been motivated by the ideas of cellular biology on the origin of life. Unlike the von Neumann`s model our model replicates by fission and need not give much attention to the exact guiding of its own assemblage; rather, this process relies on self-assembly abilities of the respective parts produced by the biomaton from input objects not possessing such quality. The model represents an interesting fusion of computational and self-organizational processes.We believe that by capturing the basic aspects of the assumed origin of real life our modelling leads to a conceptually simpler and hence more plausible scenario of natural self-reproduction than the previous attempts did.
Wiedermann Jiří
Chtěli byste být mozkem v baňce, aneb o důležitosti vtělenosti a situovanosti při rozvoji mentálních schopností kognitivního agenta
In: Kognice a umělý život, (Ed. Kelemen J.), Sleská univerzita, Ostrava, 2006, pp. 421-427.
Presented at: Kognice a umělý život VI, 28.5.-1.6.2006, Třešť,
Czech Republic.
Moderní teorie kognitivních systémů pohlíží na tyto systémy jako na autonomní vtělené výpočetní systémy, které se situují v okolí prostřednictvím svých senzomotorických jednotek. Přesto zejména v kruzích počítačových teoretiků je opakovaně slyšet názory, že na kognici lze pořád možné pohlížet i „klasicky“, jako na problém specifického zpracování dat a že tudíž vtělení není nezbytné pro zachycení podstaty kognice. Ukážeme, že takto zjednodušený pohled opomíjí podstatnou vlastnost kognitivních systémů − a sice jejich aktivní vliv na výběr či dokonce vznik vstupních dat. Bez této zpětné vazby si systém nemůže vytvořit svůj vnitřní model světa poznaný prostřednictvím svých akcí. Pro vysvětlení povahy zmíněného problému použijeme výpočetní model kognitivních systémů zavedený autorem v předchozích pracích. Tento model umožní na principielní úrovni přemýšlet o fungování algoritmických mechanizmů imitace, komunikace, vzniku řeči, myšlení a vědomí a tím přispět i k jejich pochopení v živých systémech.
Wiedermann Jiří
Problém softwarových agentů a sémantický web
Wiedermann Jiří
Grandiózne výzvy pre dlhodobý výskum v informatike
Presented as an invited talk: SOFTEC 2006, Odborná konferencia o víziách a trendoch v moderných informačných technológiách, 2.3.2006, Bratislava,
Slovakia.
Existujú v súčasnej informatike výzvy zrovnateľné s veľkými problémami, s ktorými je konfrontované ľudstvo v oblasti fyziky, biológie, či iných vedných a inžinierskych oblastí? Má súčasná informatika výskumné zámery svojim významom zrovnateľné s odhalením podstaty gravitácie? S cestou na Mars? S rozlúštením genetického kódu? V prednáške sa zamyslíme, ktoré problémy v informatike majú podobný charakter, prečo je dobré sústrediť sa na ich riešenie a aké riešenia sa rysujú.
Wiedermann Jiří
One Computer Theorist`s View of Cognitive Systems
Wiedermann Jiří, Tel Gerard, Pokorný Jaroslav, Bieliková Mária, Štuller Július
Proceedings of SOFSEM 2006
In: Proceedings of SOFSEM 2006, Volume: II, ICS AS CR, Prague, 2006.
ISBN: 80-903298-4-5
Wiedermann Jiří, Tel Gerard, Pokorný Jaroslav, Bieliková Mária, Štuller Július
Proceedings of SOFSEM 2006: Theory and Practice of Computer Science
In: Proceedings of SOFSEM 2006: Theory and Practice of Computer Science, LNCS 3831, Springer-Verlag, Berlin, 2006.
ISBN: 3-540-31198-X
Wiedermann Jiří
Globural Universe and Autopoietic Automata: A Framework for Artificial Life
In: Advances in Artificial Life, (Ed. Bentley P. J., Capcarrere M., Freitas A. A., Johnson C. G.), Springer Verlag, Berlin, 2005, pp. 21-30.
Presented at: ECAL 2005, European Conference on Artificial Life, 5.9.-9.9.2005, Cantebury,
UK.
We present two original computational models - globular universe and autopoietic automata - capturing the basic aspects of an evolution: a construction of self-reproducing automata by self-assembly and a transfer of algorithmically modified genetic information over generations. Within this framework we show implementation of autopoietic automata in a globular universe. Further, we characterize the computational power of lineages of autopoietic automata via interactive Turing machines and show an unbounded complexity growth of a computational power of automata during the evolution. Finally, we define the problem of sustainable evolution and show its undecidability.
Wiedermann Jiří
Can Cognitive and Intelligent Systems Outperform Turing Machines?
In: Proceedings of Czech-Argentinian Workshop `e-Golems` (Interdisciplinary Aspects of Human-Machine Co-existence and Co-operation), (Ed. Marik et al.), CTU, Prague, 2005, pp. 82-86.
Presented at: Czech-Argentinian Workshop `e-Golems` (Interdisciplinary Aspects of Human-Machine Co-existence and Co-operation), 2.7.-5.7.2005, Prague,
Czech Republic.
We look for computational limits of artificial, natural and hybrid cognitive and intelligent systems. The common basis for such studies is offered by computationalism, i.e., the belief that cognitive or intelligent processes, respectively, are in essence computational processes. We show that in principle cognitive systems might exist whose computational power outperforms that of Turing machines and that even in practice we observe the rudiments of such systems. These results point to the fact that the so - called Church - Turing Thesis, dealing with the central position of Turing machines in the world of computations and algorithms, must be seen in the context of physical principles exploited by the cognitive systems, and in that of the communication scenario between the system and its environment.
Wiedermann Jiří
Neomezený evoluční růst výpočetní síly sebereprodukčních automatů v globulárním vesmíru a jiné výsledky
In: Kognice a umělý život, (Ed. Kelemen J., Kvasnička V., Pospíchal J.), Sleská univerzita, Ostrava, 2005, pp. 613-623.
ISBN: 80-7248-310-2
Presented at: Kognícia a umelý život V, 30.5.-2.6.2005, Smolenice,
Slovakia.
Popíšeme původní výpočetní modely – globulární vesmír a autopoietické automaty – které zachycují podstatné výpočetní aspekty evoluce: konstrukci sebereprodukčních evolučních automatů pomocí sebesestavování a přenos algoritmicky modifikovatelné genetické informace na potomka. V tomto rámci ukážeme neomezený růst výpočetní síly automatů během evoluce a pomocí interaktivního Turingova stroje chrakterizujeme výpočetní sílu rodových linií automatů.
Wiedermann Jiří
HUGO: A Cognitive Architecture with an Incorporated World Model
In: Towards a Science of Complex Systems, (Ed. Jost J., Reed-Tsochas F., Schuster P.), ECCS, Paris, 2006, pp. 15.
ISBN: 0-9554123-0-7
Presented at: European Conference on Complex Systems 2006 (ECCS '06), 25.9.-29.9.2006, Oxford,
Great Britain.
Wiedermann Jiří
Lesk a bída nestandardních výpočetních systémů
In: SOFTECON 2007, Softec, Bratislava, 2007, pp. 1-32.
Presented at: SOFTECON 2007. Odborná konferencia o víziách a trendoch v moderných informačných technologiách, 1.3.2007, Bratislava,
Slovakia.
Wiedermann Jiří
Nástin architektury vědomého kognitivního agenta se dvěma vnitřními modely světa
In: Kognice a umělý život, (Ed. Kelemen J., Kvasnička V., Pospíchal J.), Sleská univerzita, Opava, 2007, pp. 377-383.
Presented at: Kognice a umělý život VII, 28.5.-31.5.2007, Smolenice,
Slovakia.
Nastíníme jednoduchou, ale přesto kognitivně účinnou
architekturu kognitivního agenta. Náš model se liší od
jiných podobných modelů především využíváním dvou
komplementárních vnitřních modelů světa, které mají jiný
úkol než v podobných modelech známých z odborné
literatury. První z nich zachycuje senzorimotorickou
„syntaxi“ agentova chování a je využíván pro situování
agenta v jeho prostředí. Druhý model popisuje
senzorimotorickou dynamiku světa agenta a je využíván
pro řízení agentova chování. Informace v obou vnitřních
modelech závisí od agentova ztělesnění a jeho zkušeností.
Ukážeme, že kognitivní potenciál našeho modelu
podstatně překračuje možnosti dřívějších modelů tím, že
podporuje algoritmické procesy podobající se ve svých
důsledcích vyšším kognitivním funkcím, jakými jsou
imitační učení a rozvoj komunikace, řeči, myšlení a
vědomí.
Wiedermann Jiří
Spojení samoorganizace s výpočty: minimální život v moři umělých molekul
In: Myseľ, inteligencia a život, (Ed. Kvasnička V., Trebanický P., Pospíchal J., Kelemen J.), Slovenská technická univerzita, Bratislava, 2007, pp. 497-512.
Bakteriod je formální abstraktní hybridní systém, který ve své činnosti kombinuje
výpočetní a nevýpočetní mechanizmy. Ukážeme, že v prostředí umělých molekul, nadanými
jistými samoorganizačními schopnostmi, některé bakteroidy vykazují znaky minimálního života:
jsou autonomní, replikují se a mají schopnost darwinovské evoluce. Návrh bakteroidů je
inspirován představami současné molekulární biologie o dnes již neexistujících (či zatím
neobjevených) formách protoživota.
Wiedermann Jiří
Výpočetní meze kognitivních a inteligentních systémů
In: Umělá inteligence, (Ed. Mařík V., Štěpánková O., Lažanský J.), Academia, 2007, pp. 75-90.
ISBN: 978-80-200-1470-2
V příspěvku budeme hledat výpočetní meze kognitivních a inteligentních systémů, a
to jak biologických, tak i umělých a hybridních, které jsou kombinacích obou předchozích
druhů. Společnou platformu poskytne komputacionalimus, tj. víra, že kognitivní resp.
inteligentní procesy jsou v konečném důsledku výpočetními procesy. Ukážeme, že v principu
mohou existovat kognitivní systémy, a dokonce i v praxi existují „zárodky“ takových systémů,
které předčí svou výpočetní sílou výpočetní sílu Turingových strojů. Tyto výsledky naznačují,
že tzv. Church-Turingovu tezi, hovořící o centrálním postavení Turingových strojů ve světě
výpočtů a algoritmů, je třeba vidět v souvislosti s fyzikálními principy, které kognitivní systém
při své činnosti využívá, a se způsobem, kterým systém komunikuje s okolím.
Wiedermann Jiří, Petrů Lukáš
On the Universal Computing Power of Amorphous Computing Systems
Technical Report: V-1009, ICS AS CR, Prague, 2007, 11 p.
Amorphous computing differs from the classical ideas about computations almost in every aspect. The
architecture of amorphous computers is random, since they consist of a plethora of identical computational
units spread randomly over a given area. Within a limited radius the units can communicate wirelessly
with their neighbors via a single-channel radio. We consider a model whose assumptions on the underlying
computing and communication abilities are among the weakest possible: all computational units are finite
state probabilistic automata working asynchronously, there is no broadcasting collision detection mechanism
and no network addresses. We show that under reasonable probabilistic assumptions such amorphous
computing systems can possess universal computing power with a high probability. The underlying theory
makes use of properties of random graphs and that of probabilistic analysis of algorithms. To the best of
our knowledge this is the first result showing the universality of such computing systems.
Wiedermann Jiří, Petrů Lukáš
Communicating Mobile Nano-Machines and Their Computational Power
Technical Report: V-1024, ICS AS CR, Prague, 2008, 9 p.
A computational model of molecularly communicating mobile nanomachines is de¯ned. Nanomachines are modelled by timed probabilistic automata augmented by a severely restricted communication mechanism. We show that for molecular communication among such machines an asynchronous stochastic protocol originally designed for wireless communication in so-called amorphous computers with static computational units can also be used. We design an algorithm that using randomness and timing delays selects with a high probability a leader from among sets of anonymous candidates. This enables a simulation of counter automata proving that networks of mobile nanomachines possess universal computing power.
Wiedermann Jiří
An Architecture of an Embodied Cognitive Agent with Two Cooperating Internal World Models
In: Cybernetics and Systems Research, (Ed. R. Trappl), Austrian Society of Cybernetics Studies, Vienna, 2008, pp. 561-566.
Presented at: EMCSR 2008, 25.-28.03.2008, Vienna,
Austria.
We propose a simple yet cognitively powerful architecture of an embodied conscious
agent whose functioning is in a good agreement with neurophysiological evidence. Our
model differs from other proposals by exploiting two complementary internal world
models having a different purpose than that
in similar models known from the literature.
The first model captures the sensorimotor
"syntax" of the agent's behavior and is used
for situating the agent in its environment.
The second model describes the sensorimotor dynamics of the world and is used for
controlling the agent's behavior. Both internal world models are built automatically,
autonomously in the course of agent's interaction with the environment and their coop-
eration takes care of symbol grounding. We
show that the cognitive potential of the proposed model goes substantially beyond that
of earlier models since it supports algorithmic processes underlying phenomena similar
to higher cognitive functions such as imitation learning and the development of commu-
nication, language, thinking and consciousness.
Wiedermann Jiří
Specifikace výpočetního modelu mysli vědomého vtěleného agenta
In: Modely mysle, (Ed. V. Kvasnicka, J. Kelemen, J. Pospichal a E. Gal), Europa, Bratislava, 2008.
Nastíníme jednoduchou, ale přesto kognitivně účinnou architekturu inteligentního agenta. Model využívá dvou komplementárních vnitřních modelů světa: jeden pro „syntax“ poznaného světa a druhý pro jeho sémantiku. Tyto modely řeší problém porozumění konceptům a podporují algoritmické procesy, jejichž efekty se pro pozorovatele jeví jako projevy vyšších kognitivních funkcí, jakými jsou imitační učení, rozvoj komunikace, řeči, myšlení a vědomí.
Wiedermann Jiří
How we think of computing today
In: Logic and Theory of Algorithms, LNCS 5028, Springer, 2008, pp. 579-593.
Presented as an invited talk: Conference on Computability in Europe, CiE 2008, 15.-20.06.2008, Athens,
Greece.
Classical models of computation no longer fully correspond
to the current notions of computing in modern systems. Even in the
sciences, many natural systems are now viewed as systems that compute.
Can one devise models of computation that capture the notion of
computing as seen today and that could play the same role as Turing
machines did for the classical case? We propose two models inspired from
key mechanisms of current systems in both artificial and natural environments:
evolving automata and interactive Turing machines with advice.
The two models represent relevant adjustments in our apprehension of
computing: the shift to potentially non-terminating interactive computations,
the shift towards systems whose hardware and/or software can
change over time, and the shift to computing systems that evolve in an
unpredictable, non-uniform way. The two models are shown to be equivalent
and both are provably computationally more powerful than the
models covered by the old computing paradigm. The models also motivate
the extension of classical complexity theory by non-uniform classes,
using the computational resources that are natural to these models. Of
course, the additional computational power of the models cannot in general
be meaningfully exploited in concrete goal-oriented computations.
Yaghob Jakub, Zavoral Filip
Budování infrastruktury sémantického webu
In: Proceedings of ITAT 2006, Information Technologies - Applications and Theory, 2006.
ISBN: 80-969184-4-3
Presented at: ITAT 2006, 26.9.-1.10.2006, Chata Kosodrevina, Bystrá dolina, Nízke Tatry,
Slovakia.
Yaghob Jakub, Zavoral Filip
Semantic Web Infrastructure using DataPile
In: Proceeding of the International Workshop on Technologies and Applications of Knowledge Computing on the Web (IEEE/WIC/ACM International Conference on Web Intelligence), (Ed. C.J. Butz, N.T. Nguyen, Y. Takama, W. Cheung), IEEE Computer Society, 2006, pp. 630-633.
Presented at: International Workshop on Technologies and Applications of Knowledge Computing on the Web (IEEE/WIC/ACM International Conference on Web Intelligence), 18.12.-22.12.2006, Hong-Kong
.
Zezula Pavel, Giuseppe Amato, Dohnal Vlastislav, Batko Michal
Similarity Search - The Metric Space Approach.
In: Advances in Database Systems, Volume: 32, Springer, 2006, pp. 220.
ISBN: 0-387-29146-6
Zezula Pavel
P2P Similarity Search Structures
In: Proceedings of the 14th Italian Symposium on Advanced Database Systems, peQuod, Ancona, Italy, 2006, pp. 1-12.
Presented as an invited talk: SEBD 2006 Fourteenth Italian Symposium on Advanced Database Systems, 18.6.-21.6.2006, Ancona,
Italy.
Zezula Pavel
Scalable Similarity Search in Computer Networks
In: Advances in Databases and Information Systems, LNCS 4152, Springer-Verlag, Berlin, 2006, pp. 3-3.
ISBN: 3-540-37899-5
Presented as an invited talk: Tenth East-European Conference on Advances in Databases and Information Systems, 3.9.-7.9.2006, Thessaloniki, Hellas,
Greece.
Zezula Pavel, Dohnal Vlastislav, Novák David
Towards Scalability of Similarity Searching
In: Global Data Management, (Ed. Baldoni R., Cortese G., Davide F., Melpignano A.), Volume: 8 of Emerging Communication: Studies in New Technologies and Practices in Communication, IOS Press, Amsterdam, The Netherlands, 2006.
ISBN: 1-58603-629-7
With the increasing number of applications that base searching on similarity rather than on exact matching, novel index structures are needed to speedup execution of similarity queries. An important stream of research in this direction uses the metric space as a model of similarity. We explain the principles and survey the most important representatives of index structures. We put most emphasis on distributed similarity search architectures which try to solve the difficult problem of scalability of similarity searching. The actual achievements are demonstrated by practical experiments. Future research directions are outlined in the conclusions.
Zezula Pavel, Giuseppe Amato, Dohnal Vlastislav
Similarity Search: The Metric Space Approach
In: ACM SAC 2007 Conference. ACM SAC 2007 Conference Tutorial, ACM, Seoul, Korea, 2007.
Presented at: ACM SAC 2007, , Seoul,
Korea.
Similarity searching has become afundamental computational task in a variety of application areas, including multimedia information retrieval, data mining, pattern recognition, machine learning, computer vision, biomedical databases, data compression and statistical data analysis. In such environments, an exact match has little meaning, and proximity/distance (similarity/dissimilarity) concepts are typically much more fruitful for searching. In this tutorial, we review the state of the art in developing similarity search mechanisms that accept the metric space paradigm. We explain the high extensibility of the metric space approach and demonstrate its capability with examples of distance functions. The efforts to further speed up retrieval are demonstrated by a class of approximated techniques and the very recent proposals of scalable and distributed structures based on the P2P communication paradigm.
Similarity searching has become afundamental computational task in a variety of application areas, including multimedia information retrieval, data mining, pattern recognition, machine learning, computer vision, biomedical databases, data compression and statistical data analysis. In such environments, an exact match has little meaning, and proximity/distance (similarity/dissimilarity) concepts are typically much more fruitful for searching. In this tutorial, we review the state of the art in developing similarity search mechanisms that accept the metric space paradigm. We explain the high extensibility of the metric space approach and demonstrate its capability with examples of distance functions. The efforts to further speed up retrieval are demonstrated by a class of approximated techniques and the very recent proposals of scalable and distributed structures based on the P2P communication paradigm.
Zezula Pavel, Dohnal Vlastislav, Batko Michal
File Organizations
In: Wiley Encyclopedia of Computer Science and Engineering, Wiley-Interscience, San Francisco, CA, USA, 2008, pp. 1-11.
Zezula Pavel, Batko Michal, Dohnal Vlastislav
Indexing Metric Spaces
In: Database Management and Information Retrieval, Springer-Verlag, New York, 2008, pp. 1-4.