Terminal Symbol

The Experts below are selected from a list of 1737 Experts worldwide ranked by ideXlab platform

Stephen Kwek - One of the best experts on this subject based on the ideXlab platform.

on learning unions of pattern languages and tree patterns

Algorithmic Learning Theory, 1999

Co-Authors: Sally A Goldman, Stephen Kwek

Abstract:

We present efficient on-line algorithms for learning unions of a constant number of tree patterns, unions of a constant number of one-variable pattern languages, and unions of a constant number of pattern languages with fixed length substitutions. By fixed length substitutions we mean that each occurence of variable xi must be substituted by Terminal strings of fixed length l(xi). We prove that if an arbitrary unions of pattern languages with fixed length substitutions can be learned efficiently then DNFs are efficiently learnable in the mistake bound model. Since we use a reduction to Winnow, our algorithms are robust against attribute noise. Furthermore, they can be modified to handle concept drift. Also, our approach is quite general and may be applicable to learning other pattern related classes. For example, we could learn a more general pattern language class in which a penalty (i.e. weight) is assigned to each violation of the rule that a Terminal Symbol cannot be changed or that a pair of variable Symbols, of the same variable, must be substituted by the same Terminal string. An instance is positive iff the penalty incurred for violating these rules is below a given tolerable threshold.

15 days free trial to Access Article
ALT - On Learning Unions of Pattern Languages and Tree Patterns

Lecture Notes in Computer Science, 1999

Co-Authors: Sally A Goldman, Stephen Kwek

Abstract:

We present efficient on-line algorithms for learning unions of a constant number of tree patterns, unions of a constant number of one-variable pattern languages, and unions of a constant number of pattern languages with fixed length substitutions. By fixed length substitutions we mean that each occurence of variable xi must be substituted by Terminal strings of fixed length l(xi). We prove that if an arbitrary unions of pattern languages with fixed length substitutions can be learned efficiently then DNFs are efficiently learnable in the mistake bound model. Since we use a reduction to Winnow, our algorithms are robust against attribute noise. Furthermore, they can be modified to handle concept drift. Also, our approach is quite general and may be applicable to learning other pattern related classes. For example, we could learn a more general pattern language class in which a penalty (i.e. weight) is assigned to each violation of the rule that a Terminal Symbol cannot be changed or that a pair of variable Symbols, of the same variable, must be substituted by the same Terminal string. An instance is positive iff the penalty incurred for violating these rules is below a given tolerable threshold.

15 days free trial to Access Article

Sally A Goldman - One of the best experts on this subject based on the ideXlab platform.

on learning unions of pattern languages and tree patterns

Algorithmic Learning Theory, 1999

Co-Authors: Sally A Goldman, Stephen Kwek

Abstract:

We present efficient on-line algorithms for learning unions of a constant number of tree patterns, unions of a constant number of one-variable pattern languages, and unions of a constant number of pattern languages with fixed length substitutions. By fixed length substitutions we mean that each occurence of variable xi must be substituted by Terminal strings of fixed length l(xi). We prove that if an arbitrary unions of pattern languages with fixed length substitutions can be learned efficiently then DNFs are efficiently learnable in the mistake bound model. Since we use a reduction to Winnow, our algorithms are robust against attribute noise. Furthermore, they can be modified to handle concept drift. Also, our approach is quite general and may be applicable to learning other pattern related classes. For example, we could learn a more general pattern language class in which a penalty (i.e. weight) is assigned to each violation of the rule that a Terminal Symbol cannot be changed or that a pair of variable Symbols, of the same variable, must be substituted by the same Terminal string. An instance is positive iff the penalty incurred for violating these rules is below a given tolerable threshold.

15 days free trial to Access Article
ALT - On Learning Unions of Pattern Languages and Tree Patterns

Lecture Notes in Computer Science, 1999

Co-Authors: Sally A Goldman, Stephen Kwek

Abstract:

We present efficient on-line algorithms for learning unions of a constant number of tree patterns, unions of a constant number of one-variable pattern languages, and unions of a constant number of pattern languages with fixed length substitutions. By fixed length substitutions we mean that each occurence of variable xi must be substituted by Terminal strings of fixed length l(xi). We prove that if an arbitrary unions of pattern languages with fixed length substitutions can be learned efficiently then DNFs are efficiently learnable in the mistake bound model. Since we use a reduction to Winnow, our algorithms are robust against attribute noise. Furthermore, they can be modified to handle concept drift. Also, our approach is quite general and may be applicable to learning other pattern related classes. For example, we could learn a more general pattern language class in which a penalty (i.e. weight) is assigned to each violation of the rule that a Terminal Symbol cannot be changed or that a pair of variable Symbols, of the same variable, must be substituted by the same Terminal string. An instance is positive iff the penalty incurred for violating these rules is below a given tolerable threshold.

15 days free trial to Access Article

Masayuki Takeda - One of the best experts on this subject based on the ideXlab platform.

Linear-Time text compression by longest-first substitution

Algorithms, 2009

Co-Authors: Ryosuke Nakamura, Shunsuke Inenaga, Hideo Bannai, Takashi Funamoto, Masayuki Takeda, Ayumi Shinohara

Abstract:

We consider grammar-based text compression with longest first substitution (LFS), where non-overlapping occurrences of a longest repeating factor of the input text are replaced by a new non-Terminal Symbol. We present the first linear-time algorithm for LFS. Our algorithm employs a new data structure called sparse lazy suffix trees. We also deal with a more sophisticated version of LFS, called LFS2, that allows better compression. The first linear-time algorithm for LFS2 is also presented.

15 days free trial to Access Article
DCC - Simple Linear-Time Off-Line Text Compression by Longest-First Substitution

2007 Data Compression Conference (DCC'07), 2007

Co-Authors: Ryosuke Nakamura, Shunsuke Inenaga, Hideo Bannai, Masayuki Takeda

Abstract:

We consider grammar based text compression with longest-first substitution, where non-overlapping occurrences of a longest repeating substring of the input text are replaced by a new non-Terminal Symbol. We present a new text compression algorithm by simplifying the algorithm presented in S. Inenaga et al., (2003). We give a new formulation of the correctness proof introducing the sparse lazy suffix tree data structure. We also present another type of longest-first substitution strategy that allows better compression. We show results of preliminary experiments comparing grammar sizes of the two versions of the longest-first strategy and the most frequent strategy

15 days free trial to Access Article

Ryosuke Nakamura - One of the best experts on this subject based on the ideXlab platform.

Linear-Time text compression by longest-first substitution

Algorithms, 2009

Co-Authors: Ryosuke Nakamura, Shunsuke Inenaga, Hideo Bannai, Takashi Funamoto, Masayuki Takeda, Ayumi Shinohara

Abstract:

We consider grammar-based text compression with longest first substitution (LFS), where non-overlapping occurrences of a longest repeating factor of the input text are replaced by a new non-Terminal Symbol. We present the first linear-time algorithm for LFS. Our algorithm employs a new data structure called sparse lazy suffix trees. We also deal with a more sophisticated version of LFS, called LFS2, that allows better compression. The first linear-time algorithm for LFS2 is also presented.

15 days free trial to Access Article
DCC - Simple Linear-Time Off-Line Text Compression by Longest-First Substitution

2007 Data Compression Conference (DCC'07), 2007

Co-Authors: Ryosuke Nakamura, Shunsuke Inenaga, Hideo Bannai, Masayuki Takeda

Abstract:

We consider grammar based text compression with longest-first substitution, where non-overlapping occurrences of a longest repeating substring of the input text are replaced by a new non-Terminal Symbol. We present a new text compression algorithm by simplifying the algorithm presented in S. Inenaga et al., (2003). We give a new formulation of the correctness proof introducing the sparse lazy suffix tree data structure. We also present another type of longest-first substitution strategy that allows better compression. We show results of preliminary experiments comparing grammar sizes of the two versions of the longest-first strategy and the most frequent strategy

15 days free trial to Access Article

Hideo Bannai - One of the best experts on this subject based on the ideXlab platform.

Linear-Time text compression by longest-first substitution

Algorithms, 2009

Co-Authors: Ryosuke Nakamura, Shunsuke Inenaga, Hideo Bannai, Takashi Funamoto, Masayuki Takeda, Ayumi Shinohara

Abstract:

We consider grammar-based text compression with longest first substitution (LFS), where non-overlapping occurrences of a longest repeating factor of the input text are replaced by a new non-Terminal Symbol. We present the first linear-time algorithm for LFS. Our algorithm employs a new data structure called sparse lazy suffix trees. We also deal with a more sophisticated version of LFS, called LFS2, that allows better compression. The first linear-time algorithm for LFS2 is also presented.

15 days free trial to Access Article
DCC - Simple Linear-Time Off-Line Text Compression by Longest-First Substitution

2007 Data Compression Conference (DCC'07), 2007

Co-Authors: Ryosuke Nakamura, Shunsuke Inenaga, Hideo Bannai, Masayuki Takeda

Abstract:

We consider grammar based text compression with longest-first substitution, where non-overlapping occurrences of a longest repeating substring of the input text are replaced by a new non-Terminal Symbol. We present a new text compression algorithm by simplifying the algorithm presented in S. Inenaga et al., (2003). We give a new formulation of the correctness proof introducing the sparse lazy suffix tree data structure. We also present another type of longest-first substitution strategy that allows better compression. We show results of preliminary experiments comparing grammar sizes of the two versions of the longest-first strategy and the most frequent strategy

15 days free trial to Access Article

Scan Science and Technology

Contact Leading Edge Experts & Companies

Scan Science and Technology

Contact Leading Edge Experts & Companies

Stephen Kwek - One of the best experts on this subject based on the ideXlab platform.

on learning unions of pattern languages and tree patterns

ALT - On Learning Unions of Pattern Languages and Tree Patterns

Sally A Goldman - One of the best experts on this subject based on the ideXlab platform.

on learning unions of pattern languages and tree patterns

ALT - On Learning Unions of Pattern Languages and Tree Patterns

Masayuki Takeda - One of the best experts on this subject based on the ideXlab platform.

Linear-Time text compression by longest-first substitution

DCC - Simple Linear-Time Off-Line Text Compression by Longest-First Substitution

Ryosuke Nakamura - One of the best experts on this subject based on the ideXlab platform.

Linear-Time text compression by longest-first substitution

DCC - Simple Linear-Time Off-Line Text Compression by Longest-First Substitution

Hideo Bannai - One of the best experts on this subject based on the ideXlab platform.

Linear-Time text compression by longest-first substitution

DCC - Simple Linear-Time Off-Line Text Compression by Longest-First Substitution

Terminal Symbol

Scan Science and Technology

Contact Leading Edge Experts & Companies

Scan Science and Technology

Contact Leading Edge Experts & Companies

Related terms

Stephen Kwek - One of the best experts on this subject based on the ideXlab platform.

Sally A Goldman - One of the best experts on this subject based on the ideXlab platform.

Masayuki Takeda - One of the best experts on this subject based on the ideXlab platform.

Ryosuke Nakamura - One of the best experts on this subject based on the ideXlab platform.

Hideo Bannai - One of the best experts on this subject based on the ideXlab platform.

Related terms