Ali K., Pokorný Jaroslav
XML-based Temporal Models
Technical Report: DC-2006-02, Dep. of Comp. Sc. and Engineering, FEE TU, Prague, 2006, 39 p.
Much research work has recently focused on the problem of representing historical information in XML. This report describes a number of temporal XML data models and provides their comparison according to the following properties: time dimension (valid time, transaction time), support of temporal elements and attributes, querying possibilities, association to XML Schema/DTD, and influence on XML syntax.
Bednárek David
Turingovské vzory v XSLT programech
In: Proceedings of ITAT 2006, Information Technologies - Applications and Theory, 2006.
ISBN: 80-969184-4-3
Presented at: ITAT 2006, 26.9.-1.10.2006, Chata Kosodrevina, Bystrá dolina, Nízke Tatry,
Slovakia.
Benda J., Obdržálek David
GFE - Graphical Finite State Machine Editor for Parallel Execution
In: Workshop on Educational Robotics, DIEES, 2006, pp. 41-47.
Presented at: Workshop on Educational Robotics 2006, 1.6.2006, Acireale, Italy.
Bustos B., Skopal Tomáš
Dynamic Similarity Search in Multi-Metric Spaces
In: Proceedings of ACM MIR 2006 (a workshop at ACM Multimedia 2006), ACM Press, Santa Barbara, CA, USA, 2006.
Presented at: ACM MIR 2006, 26.10.-27.10.2006, Santa Barbara,
CA, USA.
Dokulil Jiří, Yaghob Jakub, Zavoral Filip
Infrastruktura pro dotazování nad semantickými daty
In: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu, Ústav informatiky AV ČR, Prague, 2006, pp. 10-26.
ISBN: 80-903298-7-X
Presented at: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu - Seminář projektu programu Informační společnost, 5.10.-7.10.2006, Zadov,
Czech Republic.
Idea sémantického webu je široce diskutována mezi odbornou
veřejností již mnoho let. Přestože je vyvinuta řada technologií, jazyků,
prostředků a dokonce i softwarových nástrojú, málokdo někdy nějaký
reálný sémantický web viděl. Za jeden z hlavních dùvodù tohoto stavu
považujeme neexistenci potřebné infrastruktury pro provoz sémantického
webu. V našem článku popisujeme návrh takové infrastruktury, která je
založena na využití a rozšíření technologie datového stohu a nástrojích
pro něj vyvinutých a jejich kombinaci s webovými vyhledávači a dalšími
nástroji a prostředky.
Dokulil Jiří
Použití relačních databází pro vyhodnocování SPARQL dotazů
In: Proceedings of ITAT 2006, Information Technologies - Applications and Theory, 2006.
ISBN: 80-969184-4-3
Presented at: ITAT 2006, 26.9.-1.10.2006, Chata Kosodrevina, Bystrá dolina, Nízke Tatry,
Slovakia.
Dokulil Jiří
Evaluation of SPARQL queries using relational databases
In: Proceedings of 5th International Semantic Web Conference, ISWC, 2006, (Ed. Cruz I.), LNCS 4273, Springer Verlag, Athens, FA, USA, 2006, pp. 972-973.
Basic storage and querying of RDF data using a relational
database can be done in a very simple manner. Such approach can run
into trouble when used on large and complex data. This paper presents
such data and several sample queries together with analysis of their performance.
It also describes two possible ways of improving the performance
based on this analysis.
Eckhardt Alan, Vojtáš Peter
Towards ontology language handling imperfection
In: Proceeding of the 1st Workshop on Intelligent and Knowledge oriented Technologies, 2006, pp. 124-125.
Presented at: 1st Workshop on Intelligent and Knowledge oriented Technologies, 28.11.-29.11.2006, Bratislava,
Slovakia.
Galamboš Leo
Dynamic Inverted Index Maintenance
In: International Journal of Computer Science, Volume: 1, No: 2, 2006, pp. 157-162.
Galamboš Leo
Inverted Index Maintenance
In: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu, Ústav informatiky AV ČR, Prague, 2006, pp. 27-38.
ISBN: 80-903298-7-X
Presented at: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu - Seminář projektu programu Informační společnost, 5.10.-7.10.2006, Zadov,
Czech Republic.
This paper presents a method for dynamization which may
be used for fast and effective inverted index maintenance. Experimental
results show that the dynamization process is possible and that it guarantees the response time for the query operation and index actualization.
Galamboš Leo, Lánský Jan, Chernik K.
Compression of Semistructured Documents
In: International Enformatika Conference IEC 2006, Enformatika, Transactions on Engieering, Computing and Technology, 2006, pp. 222-227.
Gurský Peter, Horváth T., Novotný R., Vaneková Veronika, Vojtáš Peter
UPRE: User preference based search system
In: Proceeding of the IEEE/WIC/ACM International Conference on Web Intelligence, ACM IEEE WIC, 2006, pp. 4.
Presented at: IEEE/WIC/ACM International Conference on Web Intelligence WI-06, 18.12.-22.12.2006, Hong-Kong
.
Lánský Jan, Galamboš Leo, Chernik K.
Komprese webového uložiště
In: Proceedings of ITAT 2006, Information Technologies - Applications and Theory, 2006.
ISBN: 80-969184-4-3
Presented at: ITAT 2006, 26.9.-1.10.2006, Chata Kosodrevina, Bystrá dolina, Nízke Tatry,
Slovakia.
Mlýnková Irena, Toman Kamil, Pokorný Jaroslav
Statistical Analysis of Real XML Data Collections
Technical Report: 2006/5, MFF UK, Prague, 2006, 39 p.
Recently XML has achieved the leading role among languages for data representation and thus we can witness a massive boom of corresponding techniques for managing XML data. Most of the processing techniques however suffer from various bottlenecks worsening their time and/or space efficiency.We assume that the main reason is they consider XML collections too globally, involving all their possible features, although real data are often much simpler. Even though some techniques do restrict the input data, the restrictions are often unnatural. In this paper we analyze existing XML data, their structure and real complexity in particular.We have gathered more than 20GB of real XML collections and implemented a robust automatic analyzer. The analysis considers existing papers on similar topics, trying to confirm or confute their observations as well as to bring new findings. It focuses on frequent but often ignored XML items (such as mixed content or recursion) and relationship between schemes and their instances.
Mlýnková Irena, Toman Kamil, Pokorný Jaroslav
Statistical Analysis of Real XML Data Collections
In: Proceeding of the 13th International Conference on Management of Data - COMAD 2006, (Ed. Lakshmanan, L.L., Roy, P., Tung, A.), Tata McGraw Hill Publ. Comp., Delhi, 2006, pp. 20-31.
Presented at: 13th International Conference on Management of Data - COMAD 2006, 14.12.-16.12.2006, Delhi,
India.
Nečaský Martin
Conceptual Modeling for XML: A Survey
Technical Report: 2006-3, Dep. of Software Engineering, Faculty of Mathematics and Physics, Charles University, Prague, 2006, 54 p.
Recently XML is the standard format used for the exchange of data between information systems and is also frequently applied as a logical database model. If we use XML as a logical database model we need a conceptual model for the description of its semantics. However, XML as a logical database model has some special characteristics which makes existing conceptual models as E-R or UML unsuitable. In this paper, the current approaches to the conceptual modeling of XML data are described in an uniform style. A list of requirements for XML conceptual models is presented and described approaches are compared on the base of the requirements.
Nečaský Martin
Conceptual Modeling for XML: A Survey
In: Proceedings of the Dateso 2006, CEUR-WS, 2006, pp. 40-53.
Presented at: Dateso 2006 Annual International Workshop on DAtabases, TExts, Specifications and Objects, 26.4.-28.4.2006, Desná - Černá Říčka,
Czech Republic.
Recently XML is the standard format used for the exchange of data between information systems and is also frequently applied as a logical database model. If we use XML as a logical database model we need a conceptual model for the description of its semantics. However, XML as a logical database model has some special characteristics which makes existing conceptual models as E-R or UML unsuitable. In this paper, the current approaches to the conceptual modeling of XML data are described in an uniform style. A list of requirements for XML conceptual models is presented and described approaches are compared on the base of the requirements.
Nečaský Martin
XSEM – A Conceptual model for XML Data
In: Proceedings of Communications and Doctoral Consortium, 7th International Baltic Conference on Databases and Information Systems, Vilnius, 2006, pp. 328-331.
Recently XML is the standard format used for the exchange of data between information systems and is also frequently applied as a logical database model. If we use XML as a logical database model we need a conceptual model for the description of its semantics. In this paper, we describe our work on a new conceptual model for XML called XSEM created as a combination of several approaches applied in the area of conceptual modeling for XML.
Nečaský Martin
XSEM - A Conceptual Model for XML Data
In: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu, Ústav informatiky AV ČR, Prague, 2006, pp. 60-69.
ISBN: 80-903298-7-X
Presented at: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu - Seminář projektu programu Informační společnost, 5.10.-7.10.2006, Zadov,
Czech Republic.
In this paper we briefly describe a new conceptual model
for XML data called XSEM. The model is a combination of several approaches in the area of conceptual modeling of XML data. The model
divides the process of conceptual modeling of XML data to two levels.
On the first level, a designer designs an overall non-hierarchical conceptual schema of a domain. On the second level, he or she derives different
hierarchical representations of parts of the overall conceptual schema using transformation operators. These hierarchical representations describe
how the data is organized in an XML form.
Obdržálek David, Kulhánek Jiří
Generating and handling of differential data in DataPile-oriented systems
In: Proceedings of the IASTED International Conference on Databases and Applications (DBA 2006), (Ed. Hamza M. H.), 2006.
ISBN: 0-88986-560-4
Presented at: IASTED International Conference on Databases and Applications (DBA 2006) as part of the 24th IASTED International Multi-Conference on Applied Informatics, 13.2.-15.2.2006, Innsbruck,
Austria.
Basics of the DataPile structure for data handling systems have been theoretically designed and published. During implementation of such system, numerous problems which were not addressed during the theoretical design phase arose. In a real production environment, the applications connected to the DataPile core need special treatment and set important requirements on the data synchronization process. This article concerns with generating of differential data being distributed from the central DataPile storage to individual applications. It is shown that the synchronization part of DataPile-structured system can be implemented and run efficiently despite of the restrictions or limitations these individual applications impose.
Petricek V., Escher T., Cox I. J., Margetts H.
The Web Structure of E-Government - Developing a Methodology for Quantitative Evaluation
In: Proceedings of the 15th International Conference on World Wide Web WWW 2006, ACM Press, New York, 2006, pp. 669-678.
Presented at: International Conference on World Wide Web WWW 2006, 23.12.-26.12.2006, Edinburgh,
UK.
Pokorný Jaroslav, Reschke J.
Exporting relational data into a native XML store
In: Advances in Information Systems Development - Bridging the Gap between Academia and Industry, (Ed. A.G. Nilsson et al), Volume: 2, Springer Verlag, 2006, pp. 807-818.
ISBN: 0-387-30834-2
Pokorný Jaroslav
Databázové architektury: současné trendy a jejich vztah k novým požadavkům praxe
In: Sborník příspěvků 20. ročníku konference Moderní databáze, KOMIX, 2006, pp. 5-14.
ISBN: 80-239-7109-3
Presented at: Moderní databáze, 30.5.-31.5.2006, Zvánovice, Czech Republic.
Pokorný Jaroslav
Database architectures: current trends and their relationships to environmental data management
In: Environmental Modelling & Software, Volume: 21, No: 11, Elsevier Science, 2006, pp. 1579-1586.
Pokorný Jaroslav
Database Architectures: Current Trends and Their Relationships to Requirements of Practice
In: Proceedings of Information Systems Development ’06 Conference, Budapest, 2006.
Presented at: ISD’ 06 Conference, 31.8.-2.9.2006, Budapest,
Hungary.
Pokorný Jaroslav
Zpracování proudů dat
In: Proceedings of the Annual Database Conference DATAKON 2006, Masaryk University, Brno, 2006, pp. 61-76.
Presented at: DATAKON 2006, 20.10.-23.10.2006, Brno,
Czech Republic.
Skopal Tomáš
On Fast Non-Metric Similarity Search by Metric Access Methods
In: Proceedings of 10th International Conference on Extending Database Technology EDBT 2006, (Ed. Y. Ioannidis et al.), 2006, pp. 718-736.
ISBN: 3-540-32960-9
Presented at: EDBT 2006, 26.3.-31.3.2006, Munich,
Germany.
The retrieval of objects from a multimedia database employs a measure which defines a similarity score for every pair of objects. The measure should effectively follow the nature of similarity, hence, it should not be limited by the triangular inequality, regarded as a restriction in similarity modeling. On the other hand, the retrieval should be as efficient (or fast) as possible. The measure is thus often restricted to a metric, because then the search can be handled by metric access methods (MAMs). In this paper we propose a general method of non-metric search by MAMs. We show the triangular inequality can be enforced for any semimetric (reflexive, non-negative and symmetric measure), resulting in a metric that preserves the original similarity orderings (retrieval effectiveness). We propose the TriGen algorithm for turning any blackbox semimetric into (approximated) metric, just by use of distance distribution in a fraction of the database. The algorithm finds such a metric for which the retrieval efficiency is maximized, considering any MAM.
Skopal Tomáš, Snášel Václav
An Application of LSI and M-tree in Image Retrieval
In: GESTS International Transactions on Computer Science and Engineering, Volume: 34, No: 1, GEST Society, 2006, pp. 212-225.
When dealing with image databases, we often need to solve the problem of how to retrieve a desired set of images effectively and efficiently. As a representation of images, there are commonly used some high-dimensional vectors of extracted features, since in such a way the content-based image retrieval is turned into a geometric-search problem. In this article we present a study of feature extraction from raw image data by means of the LSI method (singular-value decomposition, respectively). Simultaneously, we show how such a kind of feature extraction can be used for efficient and effective similarity retrieval using the M-tree index. Because of the application to image retrieval, we also show some interesting effects of LSI, which are not directly obvious in the area of text retrieval (where LSI came from).
Snášel Václav, Moravec Pavel, Pokorný Jaroslav
Using BFA with WordNet Based Model for Web Retrieval
In: Journal of Digital Information Management, Volume: 4, No: 2, 2006, pp. 107-111.
Toman Kamil, Mlýnková Irena
XML Data - The Current State of Affairs
In: Proceedings of XML Prague 2006 conference, ITI Series, MFF UK, 2006, pp. 87-102.
Presented at: XML Prague 2006, 17.6.-18.6.2006, Prague,
Czech Republic.
At present the eXtensible Markup Language (XML) is used almost in all spheres of human activities. Its popularity results especially from the fact that it is a self-descriptive metaformat that allows to define the structure of XML data using other powerful tools such as DTD or XML Schema. Consequently, we can witness a massive boom of techniques for managing, querying, updating, exchanging, or compressing XML data.
On the other hand, for majority of the XML processing techniques we can find various spots which cause worsening of their time or space efficiency. Probably the main reason is that most of them consider XML data too globally, involving all their possible features, though the real data are often much simpler. If they do restrict the input data, the restrictions are often unnatural.
In this contribution we discuss the level of complexity of real XML collections and their schemes, which turns out to be surprisingly low. We involve and compare results and findings of existing papers on similar topics as well as our own analysis and we try to ¯nd the reasons for these tendencies and their consequences.
Toman Kamil, Mlýnková Irena
Statistics on The Real XML Data
In: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu, Ústav informatiky AV ČR, Prague, 2006, pp. 123-130.
ISBN: 80-903298-7-X
Presented at: Inteligentní modely, algoritmy, metody a nástroje pro vytváření sémantického webu - Seminář projektu programu Informační společnost, 5.10.-7.10.2006, Zadov,
Czech Republic.
At present the eXtensible Markup Language (XML) is used
almost in all spheres of human activities. We can witness a massive
boom of techniques for managing, querying, updating, exchanging, or
compressing XML data.
On the other hand, for majority of the XML processing techniques we can
find various spots which cause worsening of their time or space efficiency.
Probably the main reason is that most of them consider XML data too
globally, involving all their possible features, though the real data are
often much simpler. If they do restrict the input data, the restrictions
are often unnatural.
We discuss the level of complexity of real XML collections and their
schemes, which turns out to be surprisingly low. We involve and compare
results and findings of existing papers on similar topics as well as our
own analysis and we try to ¯nd the reasons for these tendencies and their
consequences.
Vojtáš Peter
Model Theoretic and Fixpoint Semantics for Preference Queries over Imperfect Data
In: Proceedings of Inconsistency and Incompleteness in Databases, (Ed. Chomicki J., Wijsen J.), Munich, 2006, pp. 87-91.
Presented at: Inconsistency and Incompleteness in Databases, International Workshop Collocated with the 10 th International Conference on Extending Database Technology, 26.3.2006, Munich,
Germany.
We present an overview of our results on model theoretic and fixpoint semantics for a relational algebra using a model of many valued Datalog with similarity. Using our previous results on equivalence of our model and certain variant of generalized annotated programs, we base our querying on fuzzy aggregation operators (also called annotation terms, combining functions, utility functions). Using of fuzzy aggregation operators (distinct from database aggregations) enables us to reduce tuning of various linguistic variables. In practice we can learn fuzzy aggregator operators by an ILP procedure for every user profile. Our approach enables also integration of data from different sources via aggregation and similarity. Extending domains we discuss difference between fuzzy elements and fuzzy subsets. We also discuss an alternative, when all extensional data are stored crisp and fuzziness is in rules interpreting data, context and in user query.
Vojtáš Peter
A Fuzzy EL Description logic with Crisp Roles and Fuzzy Aggregation for Web Consulting
In: Proceedings of Information Processing and Management of Uncertainty in Knowledge-Based Systems (IPMU 2006), Edition EDK, 2006, pp. 1834-1841.
ISBN: 2-84254-112-X
Presented at: Information Processing and Management of Uncertainty in Knowledge-Based Systems (IPMU 2006), 2.7.-7.7.2006, Paris,
France.
Vojtáš Peter
Fuzzy Logic Aggregation for Semantic Web Search for the Best Answer
In: Fuzzy Logic and the Semantic Web, (Ed. Sanchez E.), Elsevier, 2006.
ISBN: 0-444-51948-3
Vojtáš Peter
Information Technologies - Applications and Theory
In: Proceedings of ITAT 2006, Information Technologies - Applications and Theory, 2006.
ISBN: 80-969184-4-3
Vojtáš Peter
EL Description Logic Modeling Querying Web and Learning Imperfect User Preferences
In: Uncertainty Reasoning for the Semantic Web - Volume 2, Proceedings of the Second ISWC Workshop on Uncertainty Reasoning for the Semantic Web, (Ed. P.C.G. da Costa, K.B. Laskey, K.J. Laskey, F. Fung, M. Pool), 2006, pp. 2.
Presented at: Workshop on Uncertainty Reasoning for the Semantic Web URSW 2006, 5.11.2006, Athens, Georgia,
USA.
In this position paper we share ideas on modeling querying
web resources by (imperfect) combination of particular user preferences
based on description logic. Our basic assumption is, that web resources
are modeled crisp. Imperfection (uncertainty, vagueness,...) comes from
user context and preferences. We offer a model based on connection between
three EL-description logic systems: classical, annotated(fuzzy) and
a new variant of Bayesian description logic. The Bayesian part enables
learning each single user`s combination function and concepts.
Vojtáš Peter, Vomlelová M.
Learning fuzzy logic aggregation for multicriterial querying with user preferences
In: Proceedings of 27th Linz Seminar on Fuzzy Set Theory - Preferences, Games and Decisions, (Ed. J. Fodor, E.P. Klement, M. Roubens), Linz, 2006, pp. 128-129.
Presented at: 27th Linz Seminar on Fuzzy Set Theory - Preferences, Games and Decisions, 7.2.-11.2.2006, Linz,
Austria.
Wiedermann Jiří, Tel Gerard, Pokorný Jaroslav, Bieliková Mária, Štuller Július
Proceedings of SOFSEM 2006
In: Proceedings of SOFSEM 2006, Volume: II, ICS AS CR, Prague, 2006.
ISBN: 80-903298-4-5
Wiedermann Jiří, Tel Gerard, Pokorný Jaroslav, Bieliková Mária, Štuller Július
Proceedings of SOFSEM 2006: Theory and Practice of Computer Science
In: Proceedings of SOFSEM 2006: Theory and Practice of Computer Science, LNCS 3831, Springer-Verlag, Berlin, 2006.
ISBN: 3-540-31198-X
Yaghob Jakub, Zavoral Filip
Budování infrastruktury sémantického webu
In: Proceedings of ITAT 2006, Information Technologies - Applications and Theory, 2006.
ISBN: 80-969184-4-3
Presented at: ITAT 2006, 26.9.-1.10.2006, Chata Kosodrevina, Bystrá dolina, Nízke Tatry,
Slovakia.
Yaghob Jakub, Zavoral Filip
Semantic Web Infrastructure using DataPile
In: Proceeding of the International Workshop on Technologies and Applications of Knowledge Computing on the Web (IEEE/WIC/ACM International Conference on Web Intelligence), (Ed. C.J. Butz, N.T. Nguyen, Y. Takama, W. Cheung), IEEE Computer Society, 2006, pp. 630-633.
Presented at: International Workshop on Technologies and Applications of Knowledge Computing on the Web (IEEE/WIC/ACM International Conference on Web Intelligence), 18.12.-22.12.2006, Hong-Kong
.