couplet

class Couplet(lazy=None, name=None, **config)[source]

Bases: paddlenlp.datasets.dataset.DatasetBuilder

Couplet dataset. The couplet data is from this github repository: https://github.com/v-zich/couplet-clean-dataset, which filters dirty data from the original repository https://github.com/wb14123/couplet-dataset.

class META_INFO(src_file, tgt_file, src_md5, tgt_md5)

Bases: tuple

property src_file

Alias for field number 0

property src_md5

Alias for field number 2

property tgt_file

Alias for field number 1

property tgt_md5

Alias for field number 3

get_vocab()[source]

Return vocab file path of the dataset if specified.