An Ion Exchange Mechanism Inspired Story Ending Generator for Different Characters

Xinyu Jiang¹ The first two authors contribute equally to this work. Qi Zhang

^{⋆}

2University of Technology Sydney 23Deepblue Academy of Sciences 3 Chongyang Shi \Letter 1Beijing Institute of Technology 1 Kaiying Jiang 4University of Science and Technology Beijing 4 Liang Hu 3Deepblue Academy of Sciences 35Tongji University 5 Shoujin Wang 6Macquarie University
6jiangxy_08@163.com,{zhangqi_cs,cy_shi}@bit.edu.cn

Abstract

Story ending generation aims at generating reasonable endings for a given story context. Most existing studies in this area focus on generating coherent or diversified story endings, while they ignore that different characters may lead to different endings for a given story. In this paper, we propose a Character-oriented Story Ending Generator (CoSEG) to customize an ending for each character in a story. Specifically, we first propose a character modeling module to learn the personalities of characters from their descriptive experiences extracted from the story context. Then, inspired by the ion exchange mechanism in chemical reactions, we design a novel vector breaking/forming module to learn the intrinsic interactions between each character and the corresponding context through an analogical information exchange procedure. Finally, we leverage the attention mechanism to learn effective character-specific interactions and feed each interaction into a decoder to generate character-orient endings. Extensive experimental results and case studies demonstrate that CoSEG achieves significant improvements in the quality of generated endings compared with state-of-the-art methods, and it effectively customizes the endings for different characters.

Keywords:

Story Ending Generation Character-Oriented Neural Network

1 Introduction

Story ending generation aims to deliver a comprehensive understanding of the context and predict the next plot for a given story [10, 16, 31, 29]. Some studies in this field generate coherent stories by modeling the sequence of events or verbs [5, 19], or diversify story generation by introducing common senses or vocabulary information [8, 18]. Others focus on controlling the sentiment of story endings [16, 27] or generating the missing plot for an incomplete story [1, 29]. These methods generally ignore the relation and interaction between story plots and characters and simplify the influence of character personality on story generation, leading to desirable but character-irrelevant story endings.

Figure 1: An example of the story context in the ROCStories corpus, and the endings generated by our model for different characters.

Intuitively, stories are derived from characters, and character personality directly determines the plot and direction of the story. Figure 1 shows an example of a typical story in the ROCStories corpus [21] and the endings generated for different characters. From the figure, we can observe that: 1) each character has its unique personality depicted by its character token and character experience, i.e., the character-related descriptions in a story. For example, the character son is depicted by the token “son” and the description “lying in the pea gravel”; 2) naturally, different characters with different personalities interact with the story context and thus affect the story plot, leading to different story endings (see the different endings for ’son’, ’driver’ and ’I’ in the example).

Customizing the endings for different characters in a story is a novel but challenging task since there is no one-to-many dataset (i.e., one story corresponds to many ground-truth endings). To the best of our knowledge, most previous methods for the story ending generation aim to generate a single ending or missing plot rather than diverse coherent endings of different characters, for a given story context [10, 29, 31]. The main challenges in customizing character-oriented story endings are 1) to model the personality of each character, and 2) to learn the diverse interactions between different characters and the story context. Intuitively, a story context contains a character’s experiences, i.e., the multiple descriptions of the character, which depict the personalities of the character. It would be helpful to extract the related descriptions of each character from the story content and build its experience sequence via organizing the descriptions in chronological order for modeling each character’s personalities.

Inspired by recent studies using deep learning to plan and predict chemical reactions [26, 25], we model the personalities of characters by analogizing the interactions between characters and context to chemical reactions. Specifically, we believe that the information exchange between different characters and context in generating new situational (character-specific) semantics during the interaction is similar to the ion exchange [2] to form new products in chemical reactions (cf. Figure 2). Derived from this observation, it is promising to learn the interaction between a character and the corresponding context following an information exchange procedure. As depicted in Figure 2, by exchanging related descriptions of the character and context, a new informative and character-related description “invite so tired me to sing” is formed by putting “so tired” (from character) and “invite me to sing” (from context) together. Consequently, the newly formed character-related description leads to an ending “refused his invitation” customized for the character “I” with a high probability.

Figure 2: A character-context information exchange mechanism to learn the interaction between character and context, which is inspired by the ion exchange mechanism in chemical reactions.

Accordingly, we propose a Character-oriented Story Ending Generator (CoSEG) to customize an ending for each character in a story. Specifically, the proposed model first learns a representation of each character’s personality by modeling its experiences with a Character Modeling module (CMM) and a context representation by modeling the story content. Then, a novel Vector Breaking/Forming module (VBF) is introduced to effectively learn the interaction between each character and the context through multiple information exchanges. Finally, a character-specific interaction representation is generated by adaptively picking out the most effective interaction via a Character-Context Attention module (C-CA), and each interaction representation is further utilized to customize the ending for the corresponding character. Note that CoSEG adopts LSTM-based encoder-decoder architecture, and the proposed key modules are network-agnostics and are also suitable for other prevailing networks, e.g., CNN [14] and Transformer [32]. The main contributions of our paper are summarized below:

We propose a Character-oriented Story Ending Generator (CoSEG) model to tackle the challenging task of customizing story endings for different characters.
We introduce a character modeling module to effectively model the personality of each character and learn a personalized and informative character representation.
Inspired by the ion exchange process in chemical reactions, we propose a novel VBF module to learn the interaction between the character and context based on the information exchange mechanism.

Extensive experimental results on the ROCStories dataset show that our proposed CoSEG not only generates more coherent and diversified story endings compared with state-of-the-art and/or representative baseline methods, but also customizes effective endings for each character in a story. The superiority of CoSEG also demonstrates the effectiveness of the proposed CMM and VBF module in customizing story endings.

2 Related Work

Neural network-based models are the current mainstream in story generation methods owing to their impressive generation performance [22, 6, 31, 30, 18]. In recent years, there have been many innovations that utilize the encoder and decoder framework to generate coherent and diversified story endings. [33] applies a hierarchical attention architecture to encode text information to generate the context representation. [19] predicts the next event by extracting the event represented from the sentence, thereby ensuring the coherence of the story. [5] uses one head of the decoder’s self-attention to attend only to previously generated verbs in order to generate a coherent story. [4] learns a second seq2seq model, which is led by the first model to focus on what the first model failed to learn. [10] introduces external knowledge and utilizes an incremental encoding scheme to ensure the diversity of stories. In addition, recent work proposes a character-centric neural storytelling model to generate stories for a given character [15]. Excited with the excellent performance of attention-based models [34, 7] like Transformer [28] and BERT [3] in recent years, many story-ending generation and completion models leverage self-attention mechanism and Transformer architecture to enhance the quality of generated story endings [9, 32]. In this work, we adopt LSTM as the backbone in the model design and experiments. Other architectures, e.g., attention networks and Transformer, can easily be incorporated into the proposed CoSEG.

However, most of the aforementioned generation methods cannot generate multiple coherent and diverse endings for a single story context. Moreover, only a few works focus on generating multiple endings or responses given a single context. [8] uses several unobserved latent variables $z$ to generate different responses. This method, however, relies on a one-to-many dataset. [16] applies an additional sentiment analyzer to first predict the sentiment intensity $s$ of the ground truth ending $y$ , then constructs paired data $(x, s; y)$ for training, where $x$ is the story context. In the generation stage, the model receives the sentiment variable $s$ from users to generate a sentiment-specific ending. Recent work [29, 20] introduce prevalent Transformers to learn story representation for generating a missing plot or a story ending. All these methods assume that the plot has little relation or interaction with the personality of the characters in the story. Unlike these models, our proposed model can customize an ending for each character in the story context without relying on a one-to-many dataset.

3 Character-oriented Story Ending Generator (CoSEG)

3.1 Problem Definition and Architecture

In this section, we formulate the task of customizing the ending for each character in a story. Given a story content $x = (x_{1}, . . ., x_{l})$ , which contains $l$ sentences, and $m$ characters $(c_{1}, . . ., c_{m})$ . The task is to predict customized endings $y = (y_{1}, . . ., y_{m})$ for all the $m$ characters.

A story generally corresponds to only one ground truth ending, which may consist of the actions or opinions of a particular character; the endings of other characters are unavailable. To tackle the issue, in the training stage, we extract the experience sequence of the character in the ground-truth ending, then train our proposed model to generate an ending related to the extracted character (ground-truth ending). In the generating stage, we extract the experience sequences of all characters who appear in the story and then apply the proposed model to generate an ending for the characters.

The architecture of our proposed CoSEG model is depicted in Figure 3. Our model consists of three modules—a CMM module, a VBF module and a C-CA module—as well as an encoder to encode the story context and a decoder to generate the story ending. As shown in Figure 3, the CMM module generates the character representation $c c$ for each character $c_{i}$ by modeling the character’s experiences; the VBF module learns the interaction between the character representation and the story context through multiple information exchanges and generates multiple interaction results, namely product candidates $(p_{0}^{c}, . . ., p_{n}^{c})$ ; the C-CA module generates a character-related story context representation by picking out the most effective product candidates. The context representation is further used as the initial state of the decoder to predict an ending for the character. The following sections present the details of each module.

Figure 3: Character-oriented Story Ending Generator (CoSEG).

3.2 Character Experience Sequences

We construct the experience sequence for each character in the story; the experience sequence is further fed into CMM to generate the character representation.

We take the sentence She knew a discount store near her sold socks as an example to illustrate how to extract a character experience with the following four steps:

1) Construct a dependency tree for the given sentence, and get the headword knew of the character She.

2) Extract the context words, namely knew, discount, store, sold, socks, as the first part of the character experience. This part is the background of the story.

3) Extract the entity words, namely the character She, the headword knew and the corresponding object store, as the second part of the character experience.

4) Connect the above two parts of character experience with a token OBJ to obtain the final character experience $[$ knew, discount, store, sold, socks, OBJ, knew, store, She $]$ . The token OBJ is utilized to separate the two parts, explicitly telling the model which words are context information and which are directly related to the character.

In this way, we can extract each character’s experience from each sentence. Subsequently, we build the experience sequence $(e_{1}^{i}, . . ., e_{s}^{i})$ of the character $c_{i}$ via organizing the character’s experience in chronological order, where $s$ is the number of the sentences that contain $c_{i}$ .

3.3 Character Modeling Module (CMM)

We generate the character representation by modeling the character experience sequence. Formally, given a character $c_{i}$ and a corresponding character experience sequence $(e_{1}^{i}, . . ., e_{S}^{i})$ , the module encodes the token sequence $(w_{1}^{s}, . . ., w_{T_{s}}^{s})$ inside each experience $e_{s}^{i}$ to obtain the hidden states $(h_{1}^{s}, . . ., h_{T_{s}}^{s})$ , where $T_{s}$ is the length of $e_{s}^{i}$ , and the superscript $s$ of h or $w$ represents it is for the $s^{t h}$ character experience.

We choose the final hidden state $h_{T_{s}}^{s}$ as the representation of the character $c_{i}$ who went through the character experience $e_{s}^{i}$ . The character representation $h_{T_{s}}^{s}$ is then used as the initial state of the next encoder to generate further enriched character representation $h_{T_{s + 1}}^{s + 1}$ as follows:

h_{t}^{s + 1} = LSTM (h_{t - 1}^{s + 1}, w_{t}^{s + 1}, h_{T_{s}}^{s}),

(1)

where $T_{s + 1}$ is the length of experience $e_{s + 1}^{i}$ , $w_{t}^{s + 1}$ is the $t^{t h}$ token in the sequence $(w_{1}^{s + 1}, . . ., w_{T_{s + 1}}^{s + 1})$ inside the experience $e_{s + 1}^{i}$ , $w_{t}^{s + 1}$ denotes the embedding of $w_{t}^{s + 1}$ and the superscript $s + 1$ of h or $w$ represents it is for the $(s + 1)^{t h}$ character experience.

Finally, the CMM Module will generate the $S^{t h}$ character representation $h_{T_{S}}$ , which has gone through all the character experiences $(e_{1}^{i}, . . ., e_{S}^{i})$ .

3.4 Vector Breaking and Forming (VBF)

Since both character and context are represented with high-dimensional vectors, we propose a novel VBF module to learn the interaction between the character and the story context representation based on the information exchange procedure (cf. Figure 2) and then generate multiple product candidates.

Vector Breaking: Assume that there are invisible bonds between the adjacent elements of a vector, and VBF breaks the bonds between adjacent elements. For a vector of size $n$ , there are $n + 1$ potential bond-breaking positions. For example, given a vector $v_{1} = [0.1, 0.2]$ , the size of $v_{1}$ is 2 and we have 3 bond-breaking positions, as follows:

	$v_{1}^{l}, v_{1}^{r} = [], [0.1, 0.2] = V e c B_{0} (v_{1}),$		(2)
	$v_{1}^{l^{'}}, v_{1}^{r^{'}} = [0.1], [0.2] = V e c B_{1} (v_{1}),$
	$v_{1}^{l^{''}}, v_{1}^{r^{''}} = [0.1, 0.2], [] = V e c B_{2} (v_{1}),$

where $V e c B_{k}$ represents breaking the bond in the position $k$ , and the superscripts $l$ and $r$ of the breaking results represent the left and right parts of $v_{1}$ , respectively.

Vector Forming: The two interaction vectors break at each position respectively. To keep the size of interaction results constant, for each bond-breaking position, VBF integrates the left part of the first vector and the right part of the second vector to generate a product candidate¹¹1There is no order between the two interaction vectors, which vector as the first one has little influence on the experimental results.. In this way, two vectors of size $n$ can interact to obtain a total of $n + 1$ product candidates. For example, let $v_{1}$ interact with $v_{2} = [0.3, 0.4]$ , and we can obtain such three product candidates $p_{0}^{c} = [0.3, 0.4], p_{1}^{c} = [0.1, 0.4], p_{2}^{c} = [0.1, 0.2]$ . As shown in Figure 3 part (3), the character representation and the encoder final state $h_{t}$ (i.e., story context representation) interact in the VBF module to generate the product candidates $(p_{0}^{c}, . . ., p_{n}^{c})$ .

3.5 Character-Context Attention (C-CA)

As shown in Figure 3 part (4), the C-CA Module aims to pick out the most effective product candidates. Specifically, we utilize the $s^{t h}$ character representation $h_{T_{s}}$ and the encoder final state $h_{t}$ to obtain the attention weight of each product candidate $p_{k}^{c}$ :

	$a^{s}$	$= σ (W_{a} [h_{T_{s}}; h_{t}] + % b_{a}),$		(3)
	$r^{s}$	$= n \sum k = 0 a_{k}^{s} p_{k}^{c},$		(3)

where $σ$ is the softmax function, $W_{a}$ is the weight matrix, $b_{a}$ is the bias, $a t t$ is the attention weight, and $a^{s}$ stands for the character-related story context representation of the $s^{t h}$ character. As shown in Figure 3 part (5), the $r^{s}$ is further used as the initial state of the decoder (note that we omit the superscript $s$ in the following for simplicity):

h_{t}^{y} = LSTM (h_{t - 1}^{y}, w_{t - 1}^{y}, r),

(4)

where $h_{t}^{y}$ is the $t^{t h}$ hidden state of the decoder, which is further utilized to generate the $t^{t h}$ word $w_{t}^{y}$ , $w_{t - 1}^{y}$ is the embedding of word $w_{t - 1}^{y}$ .

4 Experiment

In this section, we conduct extensive experiments to investigate the quality of CoSEG and the comparative baselines.

4.1 Dataset

We evaluated our model on the ROCStories corpus [21]. This corpus contains 98,162 five-sentence stories. Our task is to generate an ending for each character that appears in a given four-sentence story²²2We identify characters in a macro way. We extract the subject of each sentence in the story. We regard name entity or noun as the character of the sentence. In principle, in this way, we can generate an ending for any noun. Since there are few endings regarding nouns as the characters in the training data (such as the ending with "car" as the character), the proposed model is difficult to generate high-quality endings for those characters.. For each story, we extract the experience sequence for the character who appears in the ground truth ending. We select 66,881 stories in which the length of the ground-truth character’s experience sequence is no less than 2 and treat these stories as the training set. We elaborately design two test sets, each with 3073 stories. Specifically, the two sets are called sufficient test set and inadequate test set. In the sufficient test set, the length of the ground-truth character’s experience sequence is no less than 2 for all stories, while the length is less than 2 for all stories in the inadequate test set. The two test sets are applied to evaluate the performance of our proposed model when the character information is sufficient and inadequate, respectively.

4.2 Experimental Settings

We use the GloVe.6B [24] pre-trained word embedding, and the number of dimensions is 200. The hidden size of the LSTM cell is 512. Since the size of the character representation and the encoder’s final state both are 512, the number of product candidates will be $512 + 1 = 513$ . A larger dimension size brings large computation costs for the model and the device. In summary, the detailed experimental settings are provided as follows:

We use 66,881 stories for training.
We use 3073 stories for validation, which is shared by the two test sets.
We have two test sets with 3073 stories each. We refer to the two test sets as the sufficient test set and the inadequate test set respectively. In the sufficient test set, the character in the ground truth ending also appears multiple times in the story context, and in the inadequate test set is the opposite. These two test sets evaluate the performance of our proposed model when the character information is sufficient and inadequate, respectively.
We use Momentum Optimizer to update parameters when training and empirically set the momentum to be 0.9.
The size of the character representation and the encoder hidden state are 512.
We select the product candidates generated using $V e c B_{0}$ , $V e c B_{128}$ , $V e c B_{256}$ , $V e c B_{384}$ and $V e c B_{512}$ five breaking operation.
The number of product candidates will be $512 + 1 = 513$ .

Note that the reason for the selection of the product candidates is analyzed in Combination Analysis on Product Candidates in Section 4.5.

4.3 Baselines

We compared our model with the following state-of-the-art baseline methods:

Seq2Seq [17]: A vanilla encoder-decoder model with an attention mechanism. The model treats the story context as a single sentence.

HAN [33]: A hierarchical attention architecture is applied to encode text information so as to generate the context representation.

IE [10]: It adopts an incremental encoding scheme to represent context clues and applies commonsense knowledge by multi-source attention.

T-CVAE [29]: It proposes a conditional variational autoencoder based on Transformer for missing plot generation.

MGCN-DP [11]: It leverages multi-level graph convolutional networks over dependency parse trees to capture dependency relations and context clues.

In addition, we introduce two variations of the proposed CoSEG model:

CoADD: We replace the VBF Module in CoSEG with an element-wise summation.

CoCAT: We concatenate the character representation and the encoder final state, and pass the concatenated vector through a linear layer to obtain the character-related story context representation.

4.4 Evaluation Metrics

We evaluate our model from two perspectives: the quality of the generated endings and the ability to customize endings.

4.4.1 Quality Evaluation

We adopt two kinds of evaluations to investigate the ability of the proposed method and the baselines in generating high-quality story endings.

Automatic Evaluation: We use perplexity (PPL) and BLEU (BLEU-1, BLEU-2 and BLEU-3) [23] to evaluate the quality of the generated endings. A smaller PPL and a higher BLEU indicate a better ending.

Manual Evaluation: We hire three evaluators, who are experts in English, to evaluate the generated story endings. We randomly sampled 200 stories from the two test sets and obtained 1400 endings from the seven models for each test set. Evaluators need to score the generated endings in terms of two criteria: coherency and grammar. The coherency score measures whether the endings are coherent with the story context; specifically, the score of 3 denotes coherency, the score of 1 denotes coherency to some extent, and the score of 0 denotes no coherency at all. In addition, the grammar score measures whether there are grammatical errors in generated endings; a grammar score is 0 if endings have errors, and 1 otherwise.

4.4.2 Ability to Customize Endings

We randomly sample 200 stories from the two test sets and generate ending for one random character in each story. To evaluate the ability to customize endings for different characters, we propose three evaluation metrics:

Success Rate (SucR): SucR measures whether the subject of the generated ending is the selected character.

Rationality: We adopt three levels to evaluate whether the generated ending matches the selected character given the story context: level 3 denotes perfect matching, level 1 denotes partial matching, and level 0 for mismatching.

Discrimination Degree (DiscD): Given an ending generated by our proposed model, we further hire three evaluators to choose which character is the ending generated for. If the character chosen by the evaluator is consistent with the selected character, it scores 1; and 0 otherwise.

4.5 Evaluation Results

Sufficient
Model	PPL	BLEU-1 (%)	BLEU-2 (%)	BLEU-3 (%)
Seq2Seq	13.26	22.46	6.88	4.21
HAN	13.43	22.43	6.96	4.47
IE	12.08	23.08	7.43	4.67
T-CVAE	11.21	23.72	8.05	5.11
MGCN-DP	11.01	23.90	8.11	5.34
CoADD	12.14	23.92	7.74	4.68
CoCAT	11.45	24.26	8.53	5.41
CoSEG	9.99	25.28	9.10	5.93
Inadequate
Model	PPL	BLEU-1 (%)	BLEU-2 (%)	BLEU-3 (%)
Seq2Seq	21.81	17.13	3.76	1.76
HAN	24.26	17.15	4.08	2.32
IE	16.90	18.40	4.89	2.78
T-CVAE	17.08	22.10	7.05	4.22
MGCN-DP	18.16	20.89	5.90	3.68
CoADD	14.53	21.83	6.99	4.03
CoCAT	15.08	24.50	9.09	5.26
CoSEG	11.45	26.06	9.80	5.70

Table 1: Automatic evaluation results of the sufficient and the inadequate test set.

Automatic Evaluation The automatic evaluation results for the sufficient and inadequate test sets are shown in Table 1. From the table, we can observe the following:

1) In both the sufficient and inadequate test sets, our model has lower perplexity and higher BLEU scores than the baselines. Specifically, in terms of perplexity, CoSEG outperforms MGCN-DP, T-CVAE, IE, HAN and Seq2Seq by 1.02/ 1.22/ 2.09/ 3.44/ 3.27 respectively in the sufficient test set, and by 6.01/ 5.63/ 5.45/ 12.81/ 10.36 respectively in the inadequate test set. In addition, in terms of BLEU-1, CoSEG outperforms MGCN-DP, T-CVAE, IE, HAN and Seq2Seq by 1.38%/ 1.56%/ 2.2%/ 2.85%/ 2.82% respectively in the sufficient test set, and by 5.17%/ 3.96%/ 7.66%/ 8.91%/ 8.93% respectively in the inadequate test set.

2) Our CoSEG model has the smallest performance gap between the two test sets, which illustrates the performance of our model is not easily affected by the amount of information. Specifically, the perplexity increased by 1.55 in the inadequate test set based on the sufficient test set, and the BLEU-1 increased by 0.78%.

3) In both the sufficient and inadequate test set, the CoSEG model outperforms the CoADD and the CoCAT a lot, which illustrates the interaction ability of the VBF Module is much stronger than the addition and concatenation.

	Sufficient		Inadequate
Model	Coherency	Grammar	Coherency	Grammar
Seq2Seq	1.395	0.655	0.905	0.780
HAN	1.160	0.685	0.600	0.785
IE	1.360	0.760	1.210	0.820
T-CVAE	1.750	0.785	1.440	0.815
MGCN-DP	1.760	0.780	1.315	0.795
CoADD	1.690	0.775	1.220	0.760
CoCAT	1.855	0.605	0.965	0.705
CoSEG	1.880	0.805	1.620	0.835

Table 2: Manual evaluation results of the sufficient and the inadequate test set.

Manual Evaluation The manual evaluation results for the sufficient and inadequate test sets are shown in Table 2, where we can observe:

In both the sufficient and inadequate test set, the CoSEG model obtains the best coherency score and the best grammar score. Specifically, in terms of Coherency, CoSEG outperforms CoCAT, CoADD, MGCN-DP,T-CVAE, IE, HAN and Seq2Seq by 0.025/ 0.19/ 0.12/ 0.13/ 0.52/ 0.72/ 0.485 respectively in sufficient test set, and by 0.655/ 0.4/ 0.305/ 0.18/ 0.41/ 1.02/ 0.715 respectively in inadequate test set. Moreover, in terms of Grammar, CoSEG outperforms CoCAT, CoADD, MGCN-DP, T-CVAE, IE, HAN and Seq2Seq by 0.2/ 0.03/ 0.025/ 0.02/ 0.045/ 0.12/ 0.15 respectively in sufficient test set, and by 0.13/ 0.075/ 0.04/ 0.02/ 0.015/ 0.05/ 0.055 respectively in inadequate test set.

Testset	SucR	Rationality	DiscD
Sufficient	0.855	1.965	0.755
Inadequate	0.605	1.980	0.555

Table 3: Ability to Customize Endings.

Ability to Customize Endings The ability to customize endings for different characters of our model is shown in Table 3, where we can observe:

1) In the sufficient test set, the success rate (SucR) and the discrimination degree (DiscD) are 85.5% and 75.5% respectively, which indicates that our model is able to identify the differences between characters. The SucR and DiscD in the inadequate test set are lower than there in the sufficient test set, which illustrates that the amount of character information has a certain influence on distinguishing different characters.

2) In both the sufficient and inadequate test sets, the rationality of the customized endings is close to 2.0 on the premise that the maximum score is 3.0. It indicates that our model has a high probability of 66% to predict a reasonable ending for each character.

Model	PPL
CoSEG (0)	12.08
CoSEG (128)	14.74
CoSEG (256)	13.06
CoSEG (0-256-512)	10.93
CoSEG (0-128-256-384-512)	9.99

Table 4: Experimental results of several different combinations.

Combination Analysis on Product Candidates We conduct experiments on several different combinations of product candidates. Specifically, the CoSEG (n) selects the product candidate generated using the $V e c B_{n}$ breaking operation; the CoSEG (0-256-512) selects the product candidates generated using $V e c B_{0}$ , $V e c B_{256}$ and $V e c B_{512}$ three breaking operations; the CoSEG (0-128-256-384-512) selects the product candidates generated using $V e c B_{0}$ , $V e c B_{128}$ , $V e c B_{256}$ , $V e c B_{384}$ and $V e c B_{512}$ five breaking operations.

The experimental results are shown in Table 4. We can observe that CoSEG (0-128-256-384-512) achieves the best performance. The result explains that we finally selected the product candidates generated using $V e c B_{0}$ , $V e c B_{128}$ , $V e c B_{256}$ , $V e c B_{384}$ and $V e c B_{512}$ five breaking operations and utilize the selected five product candidates as inputs to C-CA module. In addition, the result is attributed to the fact that CoSEG (0-128-256-384-512) involves more and smaller candidate which facilitate generating fine-grained semantic elements and providing more semantic combinations.

Context:	Ned was walking in the park one day. He noticed
	the sky started to turn gray. Ned turned back toward
	his house. He didn’t quite get home before it started
	raining.
Gold:	Ned ran inside, a little wet, but happy to be home.
Seq2Seq:	He went back to his car and bought with his warm.
HAN:	He had a clean mess.
IE:	He had to go home and go home.
T-CVAE:	Ned looked around and saw the sky.
MGCN-DP:	he was very happy with his new car!
CoADD:	He decided to go to the store to buy more umbrella.
CoCAT:	He went outside to find his car.
CoSEG:	He decided to go to the store to get an umbrella.
Context:	We removed a lamp last week during carpet cleaning.
	After the cleaners left my wife put the lamp back.
	The lighting seemed much dimmer.
	Then she realized she had brought out the wrong lamp.
Ground truth:	I said I had not even noticed.
Seq2Seq:	She was so relieved.
HAN:	She had to clean it up.
IE:	She had to clean it up and get a new one.
T-CVAE:	I bought a new lamp .
MGCN-DP:	We had to clean it up.
CoADD:	I was so mad that she was able to get it fixed.
CoCAT:	She was so happy to have a new lamp.
CoSEG:	I was so upset , I decided to get her a new one.
Context:	Larry needed a job.
	He was let go from a job a month ago and hadn’t found another one.
	His bills were starting to mount up.
	Larry was determined and applied to many jobs every day.
Ground truth:	He was finally hired.
Seq2Seq:	He was able to get a job in time and help for his job.
HAN:	He was able to get a job in a new city.
IE:	He was hired for a new job.
T-CVAE:	He have a job.
MGCN-DP:	He said he was going to be more careful.
CoADD:	He was able to get to work and get a job.
CoCAT:	He was hired at the store and bought a brand new car.
CoSEG:	He finally got a job offer and paid his bills.

Table 5: Case Study: Endings Generated by Different Models.

5 Case Study

5.1 Ground-Truth Endings

We present several examples of ground-truth story endings generated by baselines and our model in Table 5 to demonstrate that our model is able to generate more natural and more character-related endings than the baselines. Specifically, in the first story in Table 5, the ending generated by baseline T-CVAE makes the reader feel that the character is not smart enough, it has begun to rain, and it should not be very useful to look at the sky at this time; the baseline IE only describes He had to go home, without other details. And other baselines predicted wrong endings. By contrast, our model CoSEG allows the character to take an appropriate action, get an umbrella, according to the given story context, started raining.

In addition, the second story in Table 5 is derived from the inadequate test set. In this example, the context is a story about a wrong lamp. The baselines HAN, IE and MGCN-DP describe She/We had to clean it up, and IE further describes get a new one. Our model CoSEG not only allows the character to have an appropriate emotion, I was so upset, but also let the character take a reasonable action, decided to get her a new one. Obviously, the ending generated by CoSEG takes the character’s emotions (upset), behaviors (decided…get…), and relationships (get her a…) into account, which illustrates the ability of our model to obtain character’s personality.

The third example in Table 5 is sampled from the sufficient test set. In this example, the context is a story about a man’s bills mount up and he needs a job. Most of the baselines describe He get a job, as well as the ground truth and our proposed model. In addition, different from all baselines and the ground truth endings, our model further describes the man’s purpose of looking for a job, paid bills, which also demonstrates that our model is able to generate a more character-related ending.

Endings for each character:
Context:	I ran and climbed over the fence.
	My son was lying in the pea gravel on the road.
	The car had swerved just in time.
	I raged at the driver for not even stopping.
Ground truth:	I called 911 to come get my child.
For I:	I ended up falling and I had to go to the hospital.
For driver:	The driver pulled the car and I was able to get it back.
For son:	My son was upset.
Context:	I had a dental appointment I had to go to today.
	While getting my teeth checked , my dentist told I had a cavity.
	He said it’s probably because I’ve been using subpar toothpaste.
	I’ve been using the same toothpaste he recommended six months ago.
Ground truth:	Thanks a lot for the recommendation, doc.
Endings for each character:
For I:	I am glad I have a new toothpaste.
For dentist:	The dentist told me that he had to get a new toothpaste.
Context:	John was awakened by a phone call.
	Answering , John realized it was his buddy, Rich.
	Rich said he was stranded on a highway just outside of town.
	John drove out to pick up Rich.
Ground truth:	John drove Rich home , where they both fell asleep on the couch.
Endings for each character:
For Rich:	He drove to the mall and bought a new car.
For John:	John and his friends went to the park and had a great time.

Table 6: Customizing endings for each character using our proposed CoSEG model.

5.2 Character-Orient Endings

In this section, we present three examples with character-orient endings (including the ground-truth endings) generated by our method in Table 6, to illustrate the ability of our model to customize endings for different characters.

In the first example in Table 6, our model customizes endings for the characters I, driver and son. The context is a story about a car accident that happened to a father and son. The endings generated by our model are that the father had to go to the hospital, the driver ran away, and the son was upset. These three endings generated by our model describe the behavior of the father and the driver and the mood of the son, which demonstrate the effectiveness of our proposed model to customize endings for different characters. The second example in Table 6 is a story in that a man has a cavity because he uses the toothpaste recommended by the dentist, and our model customizes endings for the characters I and dentist. The third example in Table 6 is a story in that Rich is stranded on a highway and he calls John to pick him up. Our model identifies the differences between Rich and John, generating endings, He drove to the mall and bought a new car, for Rich, and John and his friends went to the park and had a great time, for John. Rich need a new car, because his car is stranded on the highway.

6 Conclusion

To tackle the challenging task of customizing story endings for different characters, we propose a Character-oriented Story Ending Generator (CoSEG). Experimental results demonstrate that our proposed model can not only generate more coherent and diversified story endings compared with state-of-the-art methods but also effectively customize the ending for each character in a story.

7 Acknowledgements

This work is supported by the National Key Research and Development Program of China(No. 2019YFB1406300), National Natural Science Foundation of China (No. 61502033) and the Fundamental Research Funds for the Central Universities.

References

[1] J. Chen, J. Chen, and Z. Yu (2019) Incorporating structured commonsense knowledge in story completion. In AAAI, pp. 6244–6251. Cited by: §1.
[2] J. C. Crittenden, R. R. Trussell, D. W. Hand, K. J. Howe, and G. Tchobanoglous (2012) Ion exchange. John Wiley & Sons, Inc.. Cited by: §1.
[3] J. Devlin, M. Chang, K. Lee, and K. Toutanova (2019) BERT: pre-training of deep bidirectional transformers for language understanding. In NAACL-HLT (1), pp. 4171–4186. Cited by: §2.
[4] A. Fan, M. Lewis, and Y. N. Dauphin (2018) Hierarchical neural story generation. In ACL, pp. 889–898. Cited by: §2.
[5] A. Fan, M. Lewis, and Y. N. Dauphin (2019) Strategies for structuring story generation. In ACL, pp. 2650–2660. Cited by: §1, §2.
[6] W. Fedus, I. J. Goodfellow, and A. M. Dai (2018) MaskGAN: better text generation via filling in the _______. In ICLR, Cited by: §2.
[7] C. Feng, C. Shi, S. Hao, Q. Zhang, X. Jiang, and D. Yu (2021) Hierarchical social similarity-guided model with dual-mode attention for session-based recommendation. Knowl. Based Syst. 230, pp. 107380. Cited by: §2.
[8] J. Gao, W. Bi, X. Liu, J. Li, and S. Shi (2019) Generating multiple diverse responses for short-text conversation. In AAAI, pp. 6383–6390. Cited by: §1, §2.
[9] J. Guan, F. Huang, M. Huang, Z. Zhao, and X. Zhu (2020) A knowledge-enhanced pretraining model for commonsense story generation. Trans. Assoc. Comput. Linguistics 8, pp. 93–108. Cited by: §2.
[10] J. Guan, Y. Wang, and M. Huang (2019) Story ending generation with incremental encoding and commonsense knowledge. In AAAI, pp. 6473–6480. Cited by: §1, §1, §2, §4.3.
[11] Q. Huang, L. Mo, P. Li, Y. Cai, Q. Liu, J. Wei, Q. Li, and H. Leung (2021) Story ending generation with multi-level graph convolutional networks over dependency trees. In AAAI, pp. 13073–13081. Cited by: §4.3.
[12] R. H. Kennedy (1959) Elution of uranium values from ion exchange resins. US.
[13] J. Kim and M. M. Benjamin (2004) Modeling a novel ion exchange process for arsenic and nitrate removal. Water Research 38 (8), pp. 2053–2062.
[14] Y. Kim (2014) Convolutional neural networks for sentence classification. In EMNLP, pp. 1746–1751. Cited by: §1.
[15] D. Liu, J. Li, M. Yu, Z. Huang, G. Liu, D. Zhao, and R. Yan (2020) A character-centric neural model for automated story generation. In AAAI, pp. 1725–1732. Cited by: §2.
[16] F. Luo, D. Dai, P. Yang, T. Liu, B. Chang, Z. Sui, and X. Sun (2019) Learning to control the fine-grained sentiment for story ending generation. In ACL, pp. 6020–6026. Cited by: §1, §2.
[17] T. Luong, H. Pham, and C. D. Manning (2015) Effective approaches to attention-based neural machine translation. In EMNLP, pp. 1412–1421. Cited by: §4.3.
[18] H. H. Mao, B. P. Majumder, J. J. McAuley, and G. W. Cottrell (2019) Improving neural story generation by targeted common sense grounding. In EMNLP, pp. 5987–5992. Cited by: §1, §2.
[19] L. J. Martin, P. Ammanabrolu, X. Wang, W. Hancock, S. Singh, B. Harrison, and M. O. Riedl (2018) Event representations for automated story generation with deep neural nets. In AAAI, pp. 868–875. Cited by: §1, §2.
[20] L. Mo, J. Wei, Q. Huang, Y. Cai, Q. Liu, X. Zhang, and Q. Li (2021) Incorporating sentimental trend into gated mechanism based transformer network for story ending generation. Neurocomputing 453, pp. 453–464. Cited by: §2.
[21] N. Mostafazadeh, L. Vanderwende, W. Yih, P. Kohli, and J. F. Allen (2016) Story cloze evaluator: vector space representation evaluation by predicting what happens next. In Proceedings of the 1st Workshop on Evaluating Vector-Space Representations for NLP, pp. 24–29. Cited by: §1, §4.1.
[22] L. Mou, Y. Song, R. Yan, G. Li, L. Zhang, and Z. Jin (2016) Sequence to backward and forward sequences: A content-introducing approach to generative short-text conversation. In COLING, pp. 3349–3358. Cited by: §2.
[23] K. Papineni, S. Roukos, T. Ward, and W. Zhu (2002) Bleu: a method for automatic evaluation of machine translation. In ACL, pp. 311–318. Cited by: §4.4.1.
[24] J. Pennington, R. Socher, and C. D. Manning (2014) Glove: global vectors for word representation. In EMNLP, pp. 1532–1543. Cited by: §4.2.
[25] K. T. Schütt, M. Gastegger, A. Tkatchenko, K. -R. Müller, and R. J. Maurer (2019) Unifying machine learning and quantum chemistry – a deep neural network for molecular wavefunctions. Nature Communications. Cited by: §1.
[26] M. H. S. Segler, M. Preuss, and M. P. Waller (2018) Planning chemical syntheses with deep neural networks and symbolic ai. Nature 555 (7698), pp. 604. Cited by: §1.
[27] L. Tu, X. Ding, D. Yu, and K. Gimpel (2019) Generating diverse story continuations with controllable semantics. In NGT@EMNLP-IJCNLP, pp. 44–58. Cited by: §1.
[28] A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L. Kaiser, and I. Polosukhin (2017) Attention is all you need. In NIPS, pp. 5998–6008. Cited by: §2.
[29] T. Wang and X. Wan (2019) T-CVAE: transformer-based conditioned variational autoencoder for story completion. In IJCAI, pp. 5233–5239. Cited by: §1, §1, §2, §4.3.
[30] S. Welleck, K. Brantley, H. D. III, and K. Cho (2019) Non-monotonic sequential text generation. In ICML, Proceedings of Machine Learning Research, Vol. 97, pp. 6716–6726. Cited by: §2.
[31] J. Xu, X. Ren, Y. Zhang, Q. Zeng, X. Cai, and X. Sun (2018) A skeleton-based model for promoting coherence among sentences in narrative story generation. In EMNLP, pp. 4306–4315. Cited by: §1, §1, §2.
[32] P. Xu, M. Patwary, M. Shoeybi, R. Puri, P. Fung, A. Anandkumar, and B. Catanzaro (2020) MEGATRON-CNTRL: controllable story generation with external knowledge using large-scale language models. In EMNLP (1), pp. 2831–2845. Cited by: §1, §2.
[33] Z. Yang, D. Yang, C. Dyer, X. He, A. J. Smola, and E. H. Hovy (2016) Hierarchical attention networks for document classification. In NAACL, pp. 1480–1489. Cited by: §2, §4.3.
[34] Q. Zhang, L. Cao, C. Shi, and Z. Niu (2021) Neural time-aware sequential recommendation by jointly modeling preference dynamics and explicit feature couplings. IEEE Transactions on Neural Networks and Learning Systems (), pp. 1–13. Cited by: §2.