degeneration
2022-08-17 17:39:29 0 举报
AI智能生成
a map illustrating the main idea about NLG degeneration
作者其他创作
大纲/内容
learning-based
CTRL-baseline
mentioned fine-grained control
train on seq of raw text prepended with control codes
famous for story generation
unlikelihood training-baseline
new loss
seq-level & token-level
simCTG
new loss
CTLoss
new loss
decoding-based
a.k.a. guided decoding
a.k.a. guided decoding
basic-baseline
random sample
topp
topk
temperature
beam search
create forbid set
heuristic
heuristic random sample
heuristic beam search - beast first beam search
heuristic guided decoding strategy
gamma sampling
raise diversity
fine-grained control including topic, end, repeatness
composition decoding
random sample+beam search
typical decoding
select based on conditional entropy
direct beam search
conbine similarity and beam search
future
prompt?
other guided decoding
previous question answering
modeling decoding strategy selection
discussion
"n best search"
intrinsic uncertainty could be solved by increase num_beam
harder to be trapped in local optimum
harder to be trapped in local optimum
decoding limitation
different task would have different optimal decoding strategy
high quality and high prob positive/negative correlation upon different task
high quality!=high prob
appropriate information entropy is decisive upon quality (encourage model to generate tokens based on information)
information theory related
information theory related
new idea
gamma-enhanced sample
idea : gamma sampling treat equally every homoionyms
this new sample method treat homoionyms differently sorted by their cosine similarity towards keywords appeared in source text
this new sample method treat homoionyms differently sorted by their cosine similarity towards keywords appeared in source text
change topic_tokens to key_words directly : works
add key_gamma : doesn't work
change probs before gamma sampling
delete the extreme condition dealing : works
answer towards previous question
收藏
0 条评论
下一页
为你推荐
查看更多
抱歉,暂无相关内容