Path Expression

14,000,000 Leading Edge Experts on the ideXlab platform

Scan Science and Technology

Contact Leading Edge Experts & Companies

Scan Science and Technology

Contact Leading Edge Experts & Companies

The Experts below are selected from a list of 31551 Experts worldwide ranked by ideXlab platform

Yip Chi Lap - One of the best experts on this subject based on the ideXlab platform.

  • SF-Tree: An efficient and flexible structure for estimating selectivity of simple Path Expressions with statistical accuracy guarantee
    Lecture Notes in Computer Science, 2004
    Co-Authors: Ben Kao, David W. Cheung, Yip Chi Lap
    Abstract:

    Estimating the selectivity of a simple Path Expression (SPE) is essential for selecting the most efficient evaluation plans for XML queries. To estimate selectivity, we need an efficient and flexible structure to store a summary of the Path Expressions that are present in an XML document collection. In this paper we propose a new structure called SF-Tree to address the selectivity estimation problem. SF-Tree provides a flexible way for the users to choose among accuracy, space requirement and selectivity retrieval speed. It makes use of signature files to store the SPEs in a tree form to increase the selectivity retrieval speed and the accuracy of the retrieved selectivity. Our analysis shows that the probability that a selectivity estimation error occurs decreases exponentially with respect to the error size.

  • DASFAA - SF-Tree: An efficient and flexible structure for estimating selectivity of simple Path Expressions with statistical accuracy guarantee
    Database Systems for Advanced Applications, 2004
    Co-Authors: Ben Kao, David W. Cheung, Yip Chi Lap
    Abstract:

    Estimating the selectivity of a simple Path Expression (SPE) is essential for selecting the most efficient evaluation plans for XML queries. To estimate selectivity, we need an efficient and flexible structure to store a summary of the Path Expressions that are present in an XML document collection. In this paper we propose a new structure called SF-Treeto address the selectivity estimation problem. SF-Tree provides a flexible way for the users to choose among accuracy, space requirement and selectivity retrieval speed. It makes use of signature files to store the SPEs in a tree form to increase the selectivity retrieval speed and the accuracy of the retrieved selectivity. Our analysis shows that the probability that a selectivity estimation error occurs decreases exponentially with respect to the error size.

Guoren Wang - One of the best experts on this subject based on the ideXlab platform.

  • CAiSE - Query processing and optimization for regular Path Expressions
    Notes on Numerical Fluid Mechanics and Multidisciplinary Design, 2003
    Co-Authors: Guoren Wang, Mengchi Liu
    Abstract:

    Regular Path Expression is one of the core components of XML query languages, and several approaches to evaluating regular Path Expressions have been proposed. In this paper, a new Path Expression evaluation approach, extent join, is proposed to compute both parent-children ('/') and ancestor-descendent ('//') connectors between Path steps. Furthermore, two Path Expression optimization rules, Pathshortening and Path-complementing, are proposed. The former reduces the number of joins by shortening the Path while the latter optimizes the execution of a Path by using an equivalent complementary Path Expression to compute the original Path. Experimental results show that the algorithms proposed in this paper are much more efficient than conventional ones.

  • A New Path Expression Computing Approach for XML Data
    Efficiency and Effectiveness of XML Tools and Techniques and Data Integration over the Web, 2003
    Co-Authors: Guoren Wang, Bing Sun
    Abstract:

    Most query languages in XML database systems use Regular Path Expressions (RPE) to query or extract data from databases and some query processing and optimization techniques have been proposed for RPEs. Conceptually XML documents are collections of Path instances. Each Path instance should conform to an XML element tag sequence, called Path schema. A RPE query can be written as an automaton that can represent a language, while Path schemas can be seen as sentences. In this paper, a novel RPE computing approach, automaton match (AM), is proposed. AM queries the RPEs by matching the automatons with Path schemas. The experimental results show AM is quite efficient for computing RPE queries.

  • comparison of parallel algorithms for Path Expression query in object database systems
    Database Systems for Advanced Applications, 2001
    Co-Authors: Guoren Wang, K Kaneko, Akitake Makinouchi
    Abstract:

    Proposes a new parallel algorithm for computing Path Expressions, named the "parallel cascade semi-join" (PCSJ) algorithm. Moreover, a new scheduling strategy called the "right-deep zigzag tree" is designed to further improve the performance of the PCSJ algorithm. The experiments have been implemented in a distributed and parallel NOW (network of workstations) environment. The results show that the PCSJ algorithm outperforms two other parallel algorithms [the parallel forward pointer chasing (PFPC) algorithm and the index-splitting parallel algorithm (IndexSplit)] when computing Path Expressions with restrictive predicates, and that the right-deep zigzag tree scheduling strategy has a better performance than the right-deep tree scheduling strategy.

  • DASFAA - Comparison of parallel algorithms for Path Expression query in object database systems
    Proceedings Seventh International Conference on Database Systems for Advanced Applications DASFAA 2001 DASFAA-01, 2001
    Co-Authors: Guoren Wang, K Kaneko, Akitake Makinouchi
    Abstract:

    Proposes a new parallel algorithm for computing Path Expressions, named the "parallel cascade semi-join" (PCSJ) algorithm. Moreover, a new scheduling strategy called the "right-deep zigzag tree" is designed to further improve the performance of the PCSJ algorithm. The experiments have been implemented in a distributed and parallel NOW (network of workstations) environment. The results show that the PCSJ algorithm outperforms two other parallel algorithms [the parallel forward pointer chasing (PFPC) algorithm and the index-splitting parallel algorithm (IndexSplit)] when computing Path Expressions with restrictive predicates, and that the right-deep zigzag tree scheduling strategy has a better performance than the right-deep tree scheduling strategy.

  • Analysis and evaluation of resources utility strategies for parallel Path Expression algorithms on DSVM
    Proceedings Fourth International Conference Exhibition on High Performance Computing in the Asia-Pacific Region, 2000
    Co-Authors: Qiang Fang, Guoren Wang
    Abstract:

    DSVM is a valuable technique developed in shared-nothing parallel architecture. Some research is given to the parallel algorithms on DSVM in the object database, but the essential topic is how to make full use of all kinds of resources to avoid the bottlenecks of CPUs, disks, memory and networks. In order to study this issue deeply, the paper takes parallel algorithms for Path Expressions as examples and analyzes how to use resources reasonably based on DSVM in NOW environments.

Aoying Zhou - One of the best experts on this subject based on the ideXlab platform.

  • index selection for efficient xml Path Expression processing
    International Conference on Conceptual Modeling, 2003
    Co-Authors: Zhimao Guo, Shuigeng Zhou, Aoying Zhou
    Abstract:

    One approach to building an efficient XML query processor is to use RDBMSs to store and query XML documents. XML queries contain a number of features that are either hard to translate into SQLs or for which the resulting SQL is complex and inefficient. Among them, Path Expressions pose a new challenge for efficient XML query processing in RDBMSs. Building index structures for Path Expressions is necessary. Meanwhile, indexes occupy much disk space. There is a tradeoff between the consumption of disk space and the efficiency of query evaluation. In this paper, we present a cost model for the space consumption of indexes and their benefit to XML queries. Making use of the statistics of XML data and the characteristics of the target application, we adopt greedy algorithm to select some map indexes to be built. Our experimental study demonstrates that query performance get comparatively significant improvement over the case without indexes while only consuming disk space of modest size.

  • ER (Workshops) - Index Selection for Efficient XML Path Expression Processing
    Conceptual Modeling for Novel Application Domains, 2003
    Co-Authors: Zhimao Guo, Shuigeng Zhou, Aoying Zhou
    Abstract:

    One approach to building an efficient XML query processor is to use RDBMSs to store and query XML documents. XML queries contain a number of features that are either hard to translate into SQLs or for which the resulting SQL is complex and inefficient. Among them, Path Expressions pose a new challenge for efficient XML query processing in RDBMSs. Building index structures for Path Expressions is necessary. Meanwhile, indexes occupy much disk space. There is a tradeoff between the consumption of disk space and the efficiency of query evaluation. In this paper, we present a cost model for the space consumption of indexes and their benefit to XML queries. Making use of the statistics of XML data and the characteristics of the target application, we adopt greedy algorithm to select some map indexes to be built. Our experimental study demonstrates that query performance get comparatively significant improvement over the case without indexes while only consuming disk space of modest size.

  • WAIM - Structural Map: A New Index for Efficient XML Path Expression Processing
    Advances in Web-Age Information Management, 2002
    Co-Authors: Shihui Zheng, Aoying Zhou, Long Zhang, Han Tao
    Abstract:

    XML is merging as the main standard of presentation and exchange on the Internet. Various XML query languages are proposed. As a ubiquitous part of those XML query languages, Path Expression poses a new challenge for efficient XML query processing in database systems. In this paper, we present a new index structure, Structural Map, for efficient evaluation of Path Expression queries. Structural Map can be used to support Path-indices as well as valueindices in an integrated manner. It is powerful to handle not only simple Path Expressions but also the branching Path Expressions and Path Expressions with predicates efficiently. Performance study shows the Structural Map can improve the performance of XML Path Expression processing significantly.

  • structural map a new index for efficient xml Path Expression processing
    Web-Age Information Management, 2002
    Co-Authors: Shihui Zheng, Aoying Zhou, Long Zhang, Han Tao
    Abstract:

    XML is merging as the main standard of presentation and exchange on the Internet. Various XML query languages are proposed. As a ubiquitous part of those XML query languages, Path Expression poses a new challenge for efficient XML query processing in database systems. In this paper, we present a new index structure, Structural Map, for efficient evaluation of Path Expression queries. Structural Map can be used to support Path-indices as well as valueindices in an integrated manner. It is powerful to handle not only simple Path Expressions but also the branching Path Expressions and Path Expressions with predicates efficiently. Performance study shows the Structural Map can improve the performance of XML Path Expression processing significantly.

Chunxia Tang - One of the best experts on this subject based on the ideXlab platform.

  • indexing xml data for Path Expression queries
    Lecture Notes in Computer Science, 2004
    Co-Authors: Chunxia Tang
    Abstract:

    Many XML query languages use regular Path Expressions to query XML data. Retrieval of XML data is like retrieval of relational databases, but the difference is that relational data are stored in 2-D flat tables whereas XML data are organized in a tree-like structure. Hence, fist tree traversal is a key to XML query processing. This is commonly accomplished using indexing. In this paper, we present a design and implementation of an efficient and quick solution to indexing XML data. Our approach is based on a numbering scheme that encodes the XML elements for not only indexing to the elements but also for quick determination of the ancestor-descendant relationship between elements in the tree hierarchy. This approach also allows efficiently inserting and updating the index system, which is an improvement over several existing XML tree node numbering methods.

  • SERA - Indexing XML Data for Path Expression Queries
    Software Engineering Research and Applications, 2004
    Co-Authors: Chunxia Tang
    Abstract:

    Many XML query languages use regular Path Expressions to query XML data. Retrieval of XML data is like retrieval of relational databases, but the difference is that relational data are stored in 2-D flat tables whereas XML data are organized in a tree-like structure. Hence, fast tree traversal is a key to XML query processing. This is commonly accomplished using indexing. In this paper, we present a design and implementation of an efficient and quick solution to indexing XML data. Our approach is based on a numbering scheme that encodes the XML elements for not only indexing to the elements but also for quick determination of the ancestor-descendant relationship between elements in the tree hierarchy. This approach also allows efficiently inserting and updating the index system, which is an improvement over several existing XML tree node numbering methods.

J.f. Naughton - One of the best experts on this subject based on the ideXlab platform.

  • ICDE - Recursive XML schemas, recursive XML queries, and relational storage: XML-to-SQL query translation
    Proceedings. 20th International Conference on Data Engineering, 2004
    Co-Authors: Rajasekar Krishnamurthy, Venkatesan T. Chakaravarthy, Raghav Kaushik, J.f. Naughton
    Abstract:

    We consider the problem of translating XML queries into SQL when XML documents have been stored in an RDBMS using a schema-based relational decomposition. Surprisingly, there is no published XML-to-SQL query translation algorithm for this scenario that handles recursive XML schemas. We present a generic algorithm to translate Path Expression queries into SQL in the presence of recursion in the schema and queries. This algorithm handles a general class of XML-to-relational mappings, which includes all techniques proposed in literature. Some of the salient features of this algorithm are: (i) It translates a Path Expression query into a single SQL query, irrespective of how complex the XML schema is, (ii) It uses the "with" clause in SQL99 to handle recursive queries even over nonrecursive schemas, (iii) It reconstructs recursive XML subtrees with a single SQL query and (iv) It shows that the support for linear recursion in SQL99 is sufficient for handling Path Expression queries over arbitrarily complex recursive XML schema.