Interpretable tabular data generation

نویسندگان

چکیده

Abstract Generative adversarial network () models have been successfully utilized in a wide range of machine learning applications, and tabular data generation domain is not an exception. Notably, some state-of-the-art generation, such as , etc. are based on models. Even though these resulted superior performance generating artificial when trained datasets, there lot room (and desire) for improvement. Not to mention that existing methods do weaknesses other than performance. For example, the current focus only model, limited emphasis given interpretation model. Secondly, operate raw features only, hence they fail exploit any prior knowledge explicit feature interactions can be during process. To alleviate two above-mentioned limitations, this work, we propose novel model— G enerative A dversarial Network modelling inspired from N aive B ayes L ogistic R egression’s relationship ( $${ { \texttt {GANBLR} } }$$ GANBLR ), which address limitation -based but provides capability handle well. Through extensive evaluations demonstrate ’s well better interpretable (explanation importance synthetic process) compared

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Tabular Data Cleaning and Linked Data Generation with Grafterizer

Over the past several years the amount of published open data has increased significantly. The majority of this is tabular data, that requires powerful and flexible approaches for data cleaning and preparation in order to convert it into Linked Data. This paper introduces Grafterizer – a software framework developed to support data workers and data developers in the process of converting raw ta...

متن کامل

Statistical Natural Language Generation from Tabular Non-textual Data

Most of the existing natural language generation (NLG) techniques employing statistical methods are typically resource and time intensive. On the other hand, handcrafted rulebased and template-based NLG systems typically require significant human/designer efforts. In this paper, we proposed a statistical NLG technique which does not require any semantic relational knowledge and takes much less ...

متن کامل

Automatic ontology generation from Web tabular structures

Turning the current Web into a Semantic Web requires automatic approaches for document annotation, since manual approaches will not scale in general. The focus of the thesis is on automatic transformation of arbitrary table-like structures into knowledge models, i.e. ontologies. The presented work is based on Hurst’s table model and consists of a methodology, an accompanying implementation name...

متن کامل

Tabular Code Generation: Write Once, Generate Many

PressPot is a system that adds annotations to Java .class files. [3] These annotations are used by an “annotation-aware” Java Virtual Machine (JVM) which uses these annotations to generate high-quality machine code quickly. If these annotated .class files are sent to a normal JVM, they are ignored and the program runs normally. One of the annotations, used for assigning machine registers, is ca...

متن کامل

Software for tabular data protection.

In order for national statistical offices to maintain the trust of the public to collect data and publish statistics of importance to society and decision-making, it is imperative that respondents (persons or establishments) be guaranteed privacy and confidentiality in return for providing requested confidential data. Consequently, for most survey and census data, disclosure limitation techniqu...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Knowledge and Information Systems

سال: 2023

ISSN: ['0219-3116', '0219-1377']

DOI: https://doi.org/10.1007/s10115-023-01834-5