Singapore Institute of Technology
Browse
3610543.3626168.pdf (38.11 MB)

SynthDa: Exploiting Existing Real-World Data for Usable and Accessible Synthetic Data Generation

Download (38.11 MB)
conference contribution
posted on 2023-12-05, 02:18 authored by Megani Rajendran, Chek Tien Tan, Indriyati AtmosukartoIndriyati Atmosukarto, Aik Beng Ng, Zhihua Zhou, Andrew GrantAndrew Grant, Simon See

Acquiring real-world data for computer vision presents challenges such as data scarcity, high costs, and privacy concerns. We introduce SynthDa, an automated approach for usable synthetic data generation (SDG) that empowers users with varying expertise to create diverse synthetic data from existing real-world datasets. It combines pose estimation, synthetic scene creation, and domain randomization to offer data variants. Ease of SDG through SynthDa enables different permutations and combinations of synthetic data that allow users to explore efficacy of various data configurations in relation to their specific AI tasks. Our experiments across multiple existing datasets and models demonstrate the utility of SynthDa in challenging nuances such as the “more data, the better” paradigm; revealing that excessive synthetic data may degrade performance and vice versa. In a pilot user study with 24 participants, we show the perceived usefulness of SynthDa as a promising SDG tool for overcoming challenges related to real-world data acquisition.

History

Journal/Conference/Book title

SA '23: SIGGRAPH Asia 2023 Technical Communications

Publication date

2023-11-28

Version

  • Published

Usage metrics

    Categories

    No categories selected

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC