Sanskrit

14,000,000 Leading Edge Experts on the ideXlab platform

Scan Science and Technology

Contact Leading Edge Experts & Companies

Scan Science and Technology

Contact Leading Edge Experts & Companies

The Experts below are selected from a list of 17064 Experts worldwide ranked by ideXlab platform

Pawan Goyal - One of the best experts on this subject based on the ideXlab platform.

  • analysis of Sanskrit text parsing and semantic relations
    Sanskrit Computational Linguistics, 2009
    Co-Authors: Pawan Goyal, Vipul Arora, Laxmidhar Behera
    Abstract:

    In this paper, we are presenting our work towards building a dependency parser for Sanskrit language that uses deterministic finite automata(DFA) for morphological analysis and 'utsarga apavaada' approach for relation analysis. A computational grammar based on the framework of Panini is being developed. A linguistic generalization for Verbal and Nominal database has been made and declensions are given the form of DFA. Verbal database for all the class of verbs have been completed for this part. Given a Sanskrit text, the parser identifies the root words and gives the dependency relations based on semantic constraints. The proposed Sanskrit parser is able to create semantic nets for many classes of Sanskrit paragraphs(*********************). The parser is taking care of both external and internal sandhi in the Sanskrit words.

  • Sanskrit Computational Linguistics - Analysis of Sanskrit Text: Parsing and Semantic Relations
    Lecture Notes in Computer Science, 2009
    Co-Authors: Pawan Goyal, Vipul Arora, Laxmidhar Behera
    Abstract:

    In this paper, we are presenting our work towards building a dependency parser for Sanskrit language that uses deterministic finite automata(DFA) for morphological analysis and 'utsarga apavaada' approach for relation analysis. A computational grammar based on the framework of Panini is being developed. A linguistic generalization for Verbal and Nominal database has been made and declensions are given the form of DFA. Verbal database for all the class of verbs have been completed for this part. Given a Sanskrit text, the parser identifies the root words and gives the dependency relations based on semantic constraints. The proposed Sanskrit parser is able to create semantic nets for many classes of Sanskrit paragraphs(*********************). The parser is taking care of both external and internal sandhi in the Sanskrit words.

  • Sanskrit Computational Linguistics - A Study towards Design of an English to Sanskrit Machine Translation System
    Lecture Notes in Computer Science, 2009
    Co-Authors: Pawan Goyal, R. Mahesh K. Sinha
    Abstract:

    We are experimenting to examine how AnglaBharati system designed to translate English to Indian languages could be adapted for translation to Sanskrit. The main contribution of our work is demonstration of machine translation of English to Sanskrit for simple sentences based on PLIL generated by AnglaBharati and A****** ***dhy*** yī rules. Presently our translation system caters to affirmative, negative, interrogative, imperative, active and passive voice sentences. In our study, we have selected a set of nouns and verbs that represent different semantic categories besides a few adverbs and adjectives. We anticipate using a number of Sanskrit resources on A****** ***dhy*** yī and morphological synthesis [2].

Brendan S. Gillon - One of the best experts on this subject based on the ideXlab platform.

Oliver Hellwig - One of the best experts on this subject based on the ideXlab platform.

  • dating Sanskrit texts using linguistic features and neural networks
    Indogermanische Forschungen, 2019
    Co-Authors: Oliver Hellwig
    Abstract:

    Deriving historical dates or datable stratifications for texts in Classical Sanskrit, such as the epics Mahābhārata and Rāmāyaṇa, is a considerable challenge for text-historical research. This paper provides empirical evidence for subtle but noticeable diachronic changes in the fundamental linguistic structures of Classical Sanskrit, and argues that Classical Sanskrit shows enough diachronic variation for dating texts on the basis of linguistic developments. Building on this evidence, it evaluates machine learning algorithms that predict approximate dates of composition for Sanskrit texts. The paper introduces the required background, discusses the relevance of linguistic features for temporal classification, and presents a text-historical evaluation of Book 6 of the Mahābhārata, whose historical stratification is disputed in Indological research.

  • Sanskrit Computational Linguistics - Performance of a Lexical and POS Tagger for Sanskrit
    Lecture Notes in Computer Science, 2010
    Co-Authors: Oliver Hellwig
    Abstract:

    Due to the phonetic, morphological, and lexical complexity of Sanskrit, the automatic analysis of this language is a real challenge in the area of natural language processing. The paper describes a series of tests that were performed to assess the accuracy of the tagging program SanskritTagger. To our knowlegde, it offers the first reliable benchmark data for evaluating the quality of taggers for Sanskrit using an unrestricted dictionary and texts from different domains. Based on a detailed analysis of the test results, the paper points out possible directions for future improvements of statistical tagging procedures for Sanskrit.

  • Etymological trends in the Sanskrit vocabulary
    Literary and Linguistic Computing, 2009
    Co-Authors: Oliver Hellwig
    Abstract:

    The article examines how the etymological composition of the Sanskrit lexicon is influenced by time and whether this composition can be used to date Sanskrit texts automatically. For this purpose, statistical tests are applied to a corpus of lexically analyzed texts. Results reported in the article may contribute to the diachronic lexicography of Sanskrit and help to develop computational methods for analyzing anonymous and undated Sanskrit texts.

  • SanskritTagger : a stochastic lexical and pos tagger for Sanskrit
    2007
    Co-Authors: Oliver Hellwig
    Abstract:

    SanskritTagger is a stochastic tagger for unpreprocessed Sanskrit text. The tagger tokenises text with a Markov model and performs part-of-speech tagging with a Hidden Markov model. Parameters for these processes are estimated from a manually annotated corpus of currently about 1.500.000 words. The article sketches the tagging process, reports the results of tagging a few short passages of Sanskrit text and describes further improvements of the program. The article describes design and function of SanskritTagger, a tokeniser and part-of-speech (POS) tagger, which analyses ”natural”, i.e. unannotated Sanskrit text by repeated application of stochastic models. This tagger has been developped during the last few years as part of a larger project for digitalisation of Sanskrit texts (cmp. (Hellwig, 2002)) and is still in the state of steady improvement. The article is organised as follows: Section 1 gives a short overview about linguistic problems found in Sanskrit texts which influenced the design of the tagger. Section 2 describes the actual implementation of the tagger. In section 3, the performance of the tagger is evaluated on short passages of text from different thematic areas. In addition, this section describes possible improvements in future versions.

R. Mahesh K. Sinha - One of the best experts on this subject based on the ideXlab platform.

  • Sanskrit Computational Linguistics - A Study towards Design of an English to Sanskrit Machine Translation System
    Lecture Notes in Computer Science, 2009
    Co-Authors: Pawan Goyal, R. Mahesh K. Sinha
    Abstract:

    We are experimenting to examine how AnglaBharati system designed to translate English to Indian languages could be adapted for translation to Sanskrit. The main contribution of our work is demonstration of machine translation of English to Sanskrit for simple sentences based on PLIL generated by AnglaBharati and A****** ***dhy*** yī rules. Presently our translation system caters to affirmative, negative, interrogative, imperative, active and passive voice sentences. In our study, we have selected a set of nouns and verbs that represent different semantic categories besides a few adverbs and adjectives. We anticipate using a number of Sanskrit resources on A****** ***dhy*** yī and morphological synthesis [2].

Laxmidhar Behera - One of the best experts on this subject based on the ideXlab platform.

  • analysis of Sanskrit text parsing and semantic relations
    Sanskrit Computational Linguistics, 2009
    Co-Authors: Pawan Goyal, Vipul Arora, Laxmidhar Behera
    Abstract:

    In this paper, we are presenting our work towards building a dependency parser for Sanskrit language that uses deterministic finite automata(DFA) for morphological analysis and 'utsarga apavaada' approach for relation analysis. A computational grammar based on the framework of Panini is being developed. A linguistic generalization for Verbal and Nominal database has been made and declensions are given the form of DFA. Verbal database for all the class of verbs have been completed for this part. Given a Sanskrit text, the parser identifies the root words and gives the dependency relations based on semantic constraints. The proposed Sanskrit parser is able to create semantic nets for many classes of Sanskrit paragraphs(*********************). The parser is taking care of both external and internal sandhi in the Sanskrit words.

  • Sanskrit Computational Linguistics - Analysis of Sanskrit Text: Parsing and Semantic Relations
    Lecture Notes in Computer Science, 2009
    Co-Authors: Pawan Goyal, Vipul Arora, Laxmidhar Behera
    Abstract:

    In this paper, we are presenting our work towards building a dependency parser for Sanskrit language that uses deterministic finite automata(DFA) for morphological analysis and 'utsarga apavaada' approach for relation analysis. A computational grammar based on the framework of Panini is being developed. A linguistic generalization for Verbal and Nominal database has been made and declensions are given the form of DFA. Verbal database for all the class of verbs have been completed for this part. Given a Sanskrit text, the parser identifies the root words and gives the dependency relations based on semantic constraints. The proposed Sanskrit parser is able to create semantic nets for many classes of Sanskrit paragraphs(*********************). The parser is taking care of both external and internal sandhi in the Sanskrit words.