【華】Youtube節目華語字幕語料庫(Youtube Clip Chinese Subtitle Corpus)
This action may take several minutes for large corpora, please wait.

【華】Youtube節目華語字幕語料庫(Youtube Clip Chinese Subtitle Corpus)

youtube_corpus

Counts
Tokens3673337
Words3574577
Sentences623778
General info
Corpus description Document
LanguageChinese
EncodingUTF-8
Compiled03/17/2022 12:46:33
Tagset Description
Lexicon sizes
word74361
tag60
Tags legend
adjectiveA|VH
adverbD.*
conjunctionC.*
determinerNe.*
nounNa|Nb|Nc|Ncd|Nd|Nf|Nh|Nv
prepositionP
pronounNhaa|Nhab|Nhb|Nhc
verbV.*
Lempos suffixes
adjective-j
adverb-a
conjunction-c
noun-n
preposition-p
pronoun-d
verb-v

Structures and attributes

Subcorpora statistics

Subcorpus Tokens Words %
a 15289 ~ 14877 0.416215555502