Interpretable tabular data generation
نویسندگان
چکیده
Abstract Generative adversarial network () models have been successfully utilized in a wide range of machine learning applications, and tabular data generation domain is not an exception. Notably, some state-of-the-art generation, such as , etc. are based on models. Even though these resulted superior performance generating artificial when trained datasets, there lot room (and desire) for improvement. Not to mention that existing methods do weaknesses other than performance. For example, the current focus only model, limited emphasis given interpretation model. Secondly, operate raw features only, hence they fail exploit any prior knowledge explicit feature interactions can be during process. To alleviate two above-mentioned limitations, this work, we propose novel model— G enerative A dversarial Network modelling inspired from N aive B ayes L ogistic R egression’s relationship ( $${ { \texttt {GANBLR} } }$$ GANBLR ), which address limitation -based but provides capability handle well. Through extensive evaluations demonstrate ’s well better interpretable (explanation importance synthetic process) compared
منابع مشابه
Tabular Data Cleaning and Linked Data Generation with Grafterizer
Over the past several years the amount of published open data has increased significantly. The majority of this is tabular data, that requires powerful and flexible approaches for data cleaning and preparation in order to convert it into Linked Data. This paper introduces Grafterizer – a software framework developed to support data workers and data developers in the process of converting raw ta...
متن کاملStatistical Natural Language Generation from Tabular Non-textual Data
Most of the existing natural language generation (NLG) techniques employing statistical methods are typically resource and time intensive. On the other hand, handcrafted rulebased and template-based NLG systems typically require significant human/designer efforts. In this paper, we proposed a statistical NLG technique which does not require any semantic relational knowledge and takes much less ...
متن کاملAutomatic ontology generation from Web tabular structures
Turning the current Web into a Semantic Web requires automatic approaches for document annotation, since manual approaches will not scale in general. The focus of the thesis is on automatic transformation of arbitrary table-like structures into knowledge models, i.e. ontologies. The presented work is based on Hurst’s table model and consists of a methodology, an accompanying implementation name...
متن کاملTabular Code Generation: Write Once, Generate Many
PressPot is a system that adds annotations to Java .class files. [3] These annotations are used by an “annotation-aware” Java Virtual Machine (JVM) which uses these annotations to generate high-quality machine code quickly. If these annotated .class files are sent to a normal JVM, they are ignored and the program runs normally. One of the annotations, used for assigning machine registers, is ca...
متن کاملSoftware for tabular data protection.
In order for national statistical offices to maintain the trust of the public to collect data and publish statistics of importance to society and decision-making, it is imperative that respondents (persons or establishments) be guaranteed privacy and confidentiality in return for providing requested confidential data. Consequently, for most survey and census data, disclosure limitation techniqu...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Knowledge and Information Systems
سال: 2023
ISSN: ['0219-3116', '0219-1377']
DOI: https://doi.org/10.1007/s10115-023-01834-5