Synthetic Transaction

14,000,000 Leading Edge Experts on the ideXlab platform

Scan Science and Technology

Contact Leading Edge Experts & Companies

Scan Science and Technology

Contact Leading Edge Experts & Companies

The Experts below are selected from a list of 1512 Experts worldwide ranked by ideXlab platform

Versteeg Steve - One of the best experts on this subject based on the ideXlab platform.

  • GHTraffic: A Dataset for Reproducible Research in Service-Oriented Computing
    2018
    Co-Authors: Bhagya Thilini, Dietrich Jens, Guesgen Hans, Versteeg Steve
    Abstract:

    We present GHTraffic, a dataset of significant size comprising HTTP Transactions extracted from GitHub data and augmented with Synthetic Transaction data. The dataset facilitates reproducible research on many aspects of service-oriented computing. This paper discusses use cases for such a dataset and extracts a set of requirements from these use cases. We then discuss the design of GHTraffic, and the methods and tool used to construct it. We conclude our contribution with some selective metrics that characterise GHTraffic.Comment: 8 pages, 5 figure

Steve Versteeg - One of the best experts on this subject based on the ideXlab platform.

  • GHTraffic: A Dataset for Reproducible Research in Service-Oriented Computing
    2018
    Co-Authors: Thilini Bhagya, Jens Dietrich, Hans Guesgen, Steve Versteeg
    Abstract:

    We present GHTraffic, a dataset of significant size comprising HTTP Transactions extracted from GitHub data (i.e., from 04 August 2015 GHTorrent issues snapshot) and augmented with Synthetic Transaction data. This dataset facilitates reproducible research on many aspects of service-oriented computing. The GHTraffic dataset comprises three different editions: Small (S), Medium (M) and Large (L). The S dataset includes HTTP Transaction records created from google/guava repository. Guava is a popular Java library containing utilities and data structures. The M dataset includes records from the npm/npm project. It is the popular de-facto standard package manager for JavaScript. The L dataset contains data that were created by selecting eight repositories containing large and very active projects, including twbs/bootstrap, symfony/symfony, docker/docker, Homebrew/homebrew, rust-lang/rust, kubernetes/kubernetes, rails/rails, and angular/angular.js. We also provide access to the scripts used to generate GHTraffic. Using these scripts, users can modify the configuration properties in the config.properties file in order to create a customised version of GHTraffic datasets for their own use. The readme.md file included in the distribution provides further information on how to build the code and run the scripts. The GHTraffic scripts can be accessed by downloading the pre-configured VirtualBox image or by cloning the repository

Bhagya Thilini - One of the best experts on this subject based on the ideXlab platform.

  • GHTraffic: A Dataset for Reproducible Research in Service-Oriented Computing
    2018
    Co-Authors: Bhagya Thilini, Dietrich Jens, Guesgen Hans, Versteeg Steve
    Abstract:

    We present GHTraffic, a dataset of significant size comprising HTTP Transactions extracted from GitHub data and augmented with Synthetic Transaction data. The dataset facilitates reproducible research on many aspects of service-oriented computing. This paper discusses use cases for such a dataset and extracts a set of requirements from these use cases. We then discuss the design of GHTraffic, and the methods and tool used to construct it. We conclude our contribution with some selective metrics that characterise GHTraffic.Comment: 8 pages, 5 figure

Axelsson Stefan - One of the best experts on this subject based on the ideXlab platform.

  • Social simulation of commercial and financial behaviour for fraud detection research
    2021
    Co-Authors: Lopez-rojas, Edgar Alonso, Axelsson Stefan
    Abstract:

    We present a social simulation model that covers three main financial services: Banks, Retail Stores, and Payments systems. Our aim is to address the problem of a lack of public data sets for fraud detection research in each of these domains, and provide a variety of fraud scenarios such as money laundering, sales fraud (based on refunds and discounts), and credit card fraud. Currently, there is a general lack of public research concerning fraud detection in the financial domains in general and these three in particular. One reason for this is the secrecy and sensitivity of the customers data that is needed to perform research. We present PaySim, RetSim, and BankSim as three case studies of social simulations for financial Transactions using agent-based modelling. These simulators enable us to generate Synthetic Transaction data of normal behaviour of customers, and also known fraudulent behaviour. This Synthetic data can be used to further advance fraud detection research, without leaking sensitive information about the underlying data. Using statistics and social network analysis (SNA) on real data we can calibrate the relations between staff and customers, and generate realistic Synthetic data sets. The generated data represents real world scenarios that are found in the original data with the added benefit that this data can be shared with other researchers for testing similar detection methods without concerns for privacy and other restrictions present when using the original data

Thilini Bhagya - One of the best experts on this subject based on the ideXlab platform.

  • GHTraffic: A Dataset for Reproducible Research in Service-Oriented Computing
    2018
    Co-Authors: Thilini Bhagya, Jens Dietrich, Hans Guesgen, Steve Versteeg
    Abstract:

    We present GHTraffic, a dataset of significant size comprising HTTP Transactions extracted from GitHub data (i.e., from 04 August 2015 GHTorrent issues snapshot) and augmented with Synthetic Transaction data. This dataset facilitates reproducible research on many aspects of service-oriented computing. The GHTraffic dataset comprises three different editions: Small (S), Medium (M) and Large (L). The S dataset includes HTTP Transaction records created from google/guava repository. Guava is a popular Java library containing utilities and data structures. The M dataset includes records from the npm/npm project. It is the popular de-facto standard package manager for JavaScript. The L dataset contains data that were created by selecting eight repositories containing large and very active projects, including twbs/bootstrap, symfony/symfony, docker/docker, Homebrew/homebrew, rust-lang/rust, kubernetes/kubernetes, rails/rails, and angular/angular.js. We also provide access to the scripts used to generate GHTraffic. Using these scripts, users can modify the configuration properties in the config.properties file in order to create a customised version of GHTraffic datasets for their own use. The readme.md file included in the distribution provides further information on how to build the code and run the scripts. The GHTraffic scripts can be accessed by downloading the pre-configured VirtualBox image or by cloning the repository