A Mountain of Pent Up Tears – children’s stories generated texts

Children’s stories this is part of the classic literature series on this blog, experiments in hybrid fiction. 

I turned out so fast, I barely looked back. (I turned to look, only to find the figure had gone.) What has gone before was the dark room.

In the far left corner lay a pile of books. The ones on the page, all full of life’s lessons; the black ones, I thought of the colorless, the dark books, my soul had never seen red in my life. These weren’t books of old stories. These were children’s stories told by old schoolboys, and children’s books have been around ever since:

This was written in the 1840s when young, red dwarfs lived in the forest. They wore their clothes brown and brown and purple. At the close of the story, the author makes the child a dwarf, the woods were too far for them to go, so they are made brown. His book is told like many children’s stories.

“This is where the forest is.”

“The forest, where do I go then?”

“In the woods. In the woods, as you said.”

“I, too, am brown.”

“Yes, I am. I am in the forest. It is the way of the forest.”

“And me?”

“Yes, too, me too.”

“And you too?”

“Yes, too.”

The forest is the only place in which these children can know what things are true. They see the forest as their only home. If they go away, then they see them as an enemy who must be conquered. But these children have all travelled over, and gone past. They’ve seen it all over again.

They know what the forest is; how it works, how things go. They can see everything. And for some reason they know that they are all in the same boat, that they can never cross over to other paths. So they know, even in that boat, what the forest means, and why it is there.

What’s behind the green, silver-tinted door. Where’s the white door? That’s not the forest, is it? The white door has never been there. What was behind that door, the door to the land?

They do not know why they were born into the woods. They think that the forest is God’s gift for them. But what’s there to be gifted for? They are the only ones, after all.

When asked how the sun comes out, and when asked why the sun comes out.

Answer: There is no one here who knows.

And they need not have seen the sun.

 


GD: the first 68 words of the generated text (above) was next used as the prompt to generate the text (below). The 68 words are shown in italics.


I turned out so fast, I barely looked back. (I turned to look, only to find the figure had gone.) What has gone before was the dark room.

In the far left corner lay a pile of books. The ones on the page, all full of life’s lessons; the black ones, I thought of the colorless, the dark books, my soul had never seen red in my life.  

There, I thought, was the door leading to what was yet to come. But in the darkness, an unseen figure, one I couldn’t yet recognize, looked from the left to the right, slowly. As it followed, my eyes scanned the walls, but I couldn’t find the same room. I reached out to touch the door, but the figure stayed, a blank shadow across the room, it never left the door. It seemed to vanish into nothingness.

Then came an accelerating sound, as the figure dropped, and it moved slowly back into the room.

“What have you done?” I cried, the words begins to rise up like a mountain of pent up tears. There, I found, was the door. It turned, and my body seemed to lift it, before the figure itself left to my side.

A smile appeared on my face as I looked at the room, the books filled a giant library with life and lessons, the black ones I had hoped would be my own in my own time, the colorless ones, the ones I’ve already read, the ones I’ve been looking for. They were to tell me what I would need to know, how to survive this night, to know the unknown. They were a path, a map.

In the darkest hall, the room I saw the figure enter, a pale shadow in the light. The place was dark, it was cold. He took me in, and  I felt he was going to kill me, he was taking me inside. The sound woke me, and I opened my eyes to find another shadow, a figure standing behind me. I didn’t know what he was, but I was in my own home.

That man walked towards me, eyes glow green, his eyes were gloomy, the healers he wore, his hair was matted with grey, it was grey that faded in between them. He was big, at least, if a little tall. At least I had grown the right size for this thing.  I don’t think his arms were big, so that is probably just his arms, or perhaps his face was too short for his eyes. His fingers were lank, the seated fingers were black, and  in the dimmest corner of his head was a hood, with no lights.

It was very dark, there was an open door beyond it. He held my hand like a…well, for I was small. I could tell the size of his hand. He had long grey hair in his hands. He held my hand, and I saw red, it was black, as if it was a red colour.

I thought, and in that moment, my skin was soft and shiny. In between my knees. So, I moved over and looked, in my hands, and there was a dark green hand, with red hands, I knew this was a shadow, but that doesn’t mean anything to him.

The first hand, the first hand was big, its face was black, it was large, for I was small. The black was green, its color was darker than red. I touched it, and it was cold.

The second hand was big, but the hands were dark, and had big fingers, the first hand was long and dark green, it was like a purse, I didn’t know how it was, but it was cold, in that moment, I froze.

I could only watch.

 


You can guess the original story from the image (below).

Little Red Riding Hood and Wolf

 

Photo credit https://www.publicdomainpictures.net/

Guess the original classics – Born Into Slavery

Born into Slavery

This is a game to see if you can identify the original story author in the generated text below. It has a new title: Born Into Slavery. The generated text is followed by the classic story which was used as the prompt for the computer generation. For technical details see end.


 

Born Into Slavery

What would occur when the daughter came for a visit? Her daughter could not wish to come here but if she had to she was certain to be disappointed.

When her eldest daughter was in her early thirties, Mr. Smith thought it might be advisable to take her away with him to her own country. The time had been long since passed; what had gone before was now forgotten. She was quite happy at home, though she knew the consequences of her leaving the family.

Mrs. Smith, who lived on her own farm, was the more interested of the two, as she had a son she loved. She would rather she had one of her own daughters than, with him, a son. As it was she was obliged to take him along, though with a slight advantage in health and strength. The child, however, was well behaved, and was very fond of her.

The old couple would have had a child when Mrs. Smith was dead; the latter would not have died till Mr. Smith, as they thought, had been ninety years old, and with her, after him. The child, as they said, would live till old Mr. Smith had died; if that should prove impossible, they would have had an heir to one of the first families to the west of the river.

Mrs. Smith was still an old woman, she remembered; her eyes had become dim, her hands and feet were weak, her feet were dull and bent, her hands wrinkled and loose. A change from her old condition, she thought, would have been too much for her now. She had always been happy in her old condition, though always in the present mode.

This story made her do something she thought necessary for the little girl, and she went into the hall and sat down in the great old chair opposite to Mrs. Smith

A woman’s strength, she thought, was her knowledge of her father’s affairs. She knew that as long as her children could walk with her, they could not be married.

Her husband knew that too; he knew where he was going, what he was going to do, and was never the same afterwards. There was a very large number of children, her daughters, living abroad, in the most irregular manners. He had grown somewhat deaf, with a peculiar difficulty about their voices.

And Mrs. Smith never liked it when she was in any degree unhappy; she could not bear to see her men, much less her children, so miserable. Every time they came she wanted to have a chat with them. She was a woman of very little self-control, and she would have felt an extraordinary strain in talking to men in whom she was so anxious to be connected. She had grown quite aware of the dangers of these visits. She felt that she had the moral to avoid them. It was the same with her husband.

One of the dangers she had to observe was whether they should try to give her good advice on an important subject; then she could say nothing but good things about them. And she did not give you any reason why.

And so he took her into another house, where she found him dead. Here is the explanation of one of our own writers, that was born into slavery, and who has since been made a public writer.

In the first place, we say that his wife knew, but she did not have sufficient excuse for going back to him. For she had been married to his eldest son, whom he kept with her, but she had never had a son, not that she was less prone to be the subject of any kind of cruelty than any other woman.

She was a woman well grounded, and could be trusted not to become a mistress or to marry a knave. Now, the reason why Master Smith was the property of Mr. Smith was, that there were several houses there, called for the use of this one Mr. Smith, and he was not there, but was there at a different time; so he did not know him or know of his own family.

But of course this also became a reason for the destruction of his father’s property; for those other houses were to be used by Mr. Smith at a later time, and hence there was no excuse to lose them. And he knew of the condition of his father’s property, and that he had been made a public writer.

Now he said to Master Smith,
“What is the reason of your going to me at this time, that I do not go to him myself?”

“Well, I have said to him before, I do not go to him, because my father was in debt to you.”

“How did I know it?”

“He had a contract of sale of my house, for you had a lease for the premises. You and your father made a bargain, and did not pay; but he does me to make good my debt. He gave me this lease, but left the property to me.”

“What can I do for it?”

“I will not go with you.”

And Mr. Smith went away in his ship; and while he was doing this, he heard the voice of his father’s son, who was living in the ship, saying to him,

“Your father has made you a public writer, at a time when I should have told you everything that you ought to know about me; and he is coming to me to sell my property for your property, and that we must not meet there till the day after the feast.”

As it happened, at the end of his talk with Mr. Smith, who would then have come to him, he heard of the destruction of Mr. Smith’s house, and of the damage done, for a great many months afterwards, to the houses belonging to his son, because of that man’s writing; and Mr. Smith went away in his ship, that there might be no necessity of his telling Mr. Brown that he might not receive anything from him from him without reason.

And it was, at that time, a great pleasure to him to have the ship and his house in the same box, and he said nothing of it.


Original


Pride and Prejudice

Jane Austen 1813

It is a truth universally acknowledged, that a single man in possession of a good fortune, must be in want of a wife.

However little known the feelings or views of such a man may be on his first entering a neighbourhood, this truth is so well fixed in the minds of the surrounding families, that he is considered the rightful property of some one or other of their daughters.

“My dear Mr. Smith,” said his lady to him one day, “have you heard that Manorfield Park is let at last?”

Mr. Smith replied that he had not.

“But it is,” returned she; “for Mrs. Long has just been here, and she told me all about it.”

Mr. Smith made no answer.

“Do you not want to know who has taken it?” cried his wife impatiently.

You want to tell me, and I have no objection to hearing it.”

This was invitation enough.

“Why, my dear, you must know, Mrs. Long says that Manorfield is taken by a young man of large fortune from the north of England; that he came down on Monday in a chaise and four to see the place, and was so much delighted with it, that he agreed with Mr. Morris immediately; that he is to take possession before Michaelmas, and some of his servants are to be in the house by the end of next week.”

“What is his name?”

“Bingley.”

“Is he married or single?”

“Oh! Single, my dear, to be sure! A single man of large fortune; four or five thousand a year. What a fine thing for our girls!”

“How so? How can it affect them?”

“My dear Mr. Smith,” replied his wife, “how can you be so tiresome! You must know that I am thinking of his marrying one of them.”

“Is that his design in settling here?”

“Design! Nonsense, how can you talk so! But it is very likely that he may fall in love with one of them, and therefore you must visit him as soon as he comes.”

“I see no occasion for that. You and the girls may go, or you may send them by themselves, which perhaps will be still better, for as you are as handsome as any of them, Mr. Bingley may like you the best of the party.”

“My dear, you flatter me. I certainly have had my share of beauty, but I do not pretend to be anything extraordinary now. When a woman has five grown-up daughters, she ought to give over thinking of her own beauty.”

“In such cases, a woman has not often much beauty to think of.”

“But, my dear, you must indeed go and see Mr. Bingley when he comes into the neighbourhood.”

“It is more than I engage for, I assure you.”

“But consider your daughters. Only think what an establishment it would be for one of them. Sir William and Lady Lucas are determined to go, merely on that account, for in general, you know, they visit no newcomers. Indeed you must go, for it will be impossible for us to visit him if you do not.”

“You are over-scrupulous, surely. I dare say Mr. Bingley will be very glad to see you; and I will send a few lines by you to assure him of my hearty consent to his marrying whichever he chooses of the girls; though I must throw in a good word for my little Lizzy.”

“I desire you will do no such thing. Lizzy is not a bit better than the others; and I am sure she is not half so handsome as Jane, nor half so good-humoured as Lydia. But you are always giving her the preference.”

“They have none of them much to recommend them,” replied he; “they are all silly and ignorant like other girls; but Lizzy has something more of quickness than her sisters.”

“Mr. Smith, how can you abuse your own children in such a way? You take delight in vexing me. You have no compassion for my poor nerves.”

“You mistake me, my dear. I have a high respect for your nerves. They are my old friends. I have heard you mention them with consideration these last twenty years at least.”

“Ah, you do not know what I suffer.”

“But I hope you will get over it, and live to see many young men of four thousand a year come into the neighbourhood.”

“It will be no use to us, if twenty such should come, since you will not visit them.”

“Depend upon it, my dear, that when there are twenty, I will visit them all.”

Mr. Smith was so odd a mixture of quick parts, sarcastic humour, reserve, and caprice, that the experience of three-and-twenty years had been insufficient to make his wife understand his character. Her mind was less difficult to develop. She was a woman of mean understanding, little information, and uncertain temper. When she was discontented, she fancied herself nervous. The business of her life was to get her daughters married; its solace was visiting and news.


Notes

Prompt was 10-20 words from the original. GPT-2 system by Fabrice Bellard – see the site below for credits etc.
Made on my creativity app Story Live – please visit.

Text from Gutenburg free classic ebooks

A certain amount of cherry-picking of the most grammatical and interesting generated parts, but no actual editing (moving around of words or new words). Identifying names are changed.

More of these will be posted, see the index.

Fake News – what do real writers think about it? AI research

I examined in August 2020 research (UAL London see credits) what happens when writers use a computer text generator to write articles, giving them only an image prompt. There is a summary of the Study in the Index.

Go to Index of AI research

Fake News – (or ‘fake news’) what do real writers think about it?

Summary 

  • People who have a lower general opinion of text generation, and lower wordcounts for feedback, don’t think text generation is relevant to ‘fake news’.

The converse, that people who are more engaged with text generation (more feedback, more positive) can see that it is more related to fake text generation.

So, if people are educated on the topic, they will realise there is a relationship between advances in computer text simulation and fake news.

Definition of fake news

Even defining fake news is not that simple, since if you look for examples in the USA, or Syria, or anywhere else, both sides accuse the other of using it. Depends on there being no consensus as Barak Obama mentioned recently. See the bottom of the blog for some discussion.

Fake News

One of the feedback questions in the Study was about whether computer AI text generation was relevant to ‘fake news’ (false content for spam, propaganda etc.).

Q6-2: If you have heard of ‘fake news’ do you think this is relevant?

This had neutral phrasing in order not to influence replies. It did not say ‘text generator can make fake news’ as this presupposes a technical knowledge that might not be present, even after doing the experiments. Given the sometimes bizarre output of the generator, it might affect replies too specifically.

18 of the 82 respondents – 22% – replied to the question. It was the last question after a long session so perhaps some fatigue had set in. It was also a question not specifically to do with the experiment. About a quarter is still quite a high response rate.

Analysis of responses

I will be examining the actual text answers in another blog. This just looks at some basic relationships.

Several comparisons of the data were calculated and some are worth discussing here (the others are in reference material).

Comparing answer word count with positive – negative ratings:

The actual Q6 answer text was assessed for Positive – Neutral – Negative response to the ‘relevant to fake news’ question.

For instance, the text answer ‘No’ was 2 for negative, word count 1. ‘I stay out of these type debates.’ Is 1 for neutral, word count 7.

This was quite easy to do. I did not use a computer sentiment analyser as the samples were too small. Anything a bit ambiguous or with conflicting statements was rated Neutral.

Pos – neg is 0 (positive), 1 (neutral), 2 (negative) – these are scaled up so can be seen against the word count values. So low red columns are positive (yes, relevant to ‘fake news’) , high red is negative (not relevant to ‘fake news’). This is an ordinal scale.

fake news vs word count.png

Above: Pos-neg is zero for positive for ‘relevance’ to ‘fake news’. The red Pos-neg 0-1-2 value is scaled to make it easier to see. So a tall red line means negative for relevance to fake news. (There is a discussion of different statistical calculations in a forthcoming blog. The tests used in the study are Mann Whitney U, T-Tests, ANOVAS, and Pearsons, along with various bar charts and boxplots.)

At first glance this shows that higher answer word counts are associated with positive for ‘relevance to fake news’.
Low word counts are associated with negative for ‘relevance to fake news’.
There is only one negative out of the top half of the responses (1-9 on the chart above, 2-8 are pos=0).
The highest word count (column 1 on chart) was from an answer that discussed an actual example of real ‘fake news’, or propaganda in the Syrian War, so was a longer than usual negative response. It could be an outlier. After ruminating they decided it was too hard for a text generator to do, writing ‘news of any kind has to be created by humans…’ which is a value judgement (humans are best).

Statistics

Mann Whitney U test

The z-score is 4.25539. The p-value is < .00001. The result is significant at p < .05.

  • People who used the least words thought text generation was less relevant to ‘fake news’.
  • People who used more words in their answer thought it was more relevant to ‘fake news’.

Discussion

In other work, computer Sentiment Analysis revealed that all feedback answers had high scores for ‘Tentative’ and very low (no score) for ‘Confidence’. These results could be because of a general lack of experience of text generation (only 11% had previous experience).
There is a forthcoming blog on the Sentiment Analysis, there is already a blog on Sentiment Emotions (Joy etc.).

When making a positive claim (‘there is relevance’) with no ‘Confidence’ and feeling very ‘Tentative’, discursive answers are to be expected.

In the positive answers, the relevance of ‘fake news’ produced more discussion (‘Yes, and here’s why…’). This could be because the respondent felt it was relevant, but had little or no confidence in their feelings and supplied a tentative, more wordy, answer.

Whereas with a negative response, text generation is not relevant to ‘fake news’, and can be easily dismissed (‘No’ – 1 word).

Ranked Likert scores over the Study vs Pos-neg

Fake-ranked-Likert-Pos-neg.png
Mann-Whitney U Test
The z-score is 5.10963. The p-value is < .00001. The result is significant at p < .05.

This is not visually very clear until you sum the raw (before scaling for graph) scores from columns 1-8 (6) and 2-18 (10).

  • This shows that lower Likert scores (which means positive response to experiments and later questions) relates to lower Pos-neg scores (which means positive relevance to ‘fake news’). This is what would be expected.
  • People who have a lower general opinion of text generation, don’t think text generation is relevant to ‘fake news’ (and possibly anything else of practical use).

Amateur – Professional

People were asked to select Amateur/Professional (or both, scored as Pro) next to Occupations. Most people had more than one occupation (Poet and Journalist etc.) so it is not a fixed job allocation, but provides an indication that the writer either is or would like to be paid for at least some of their work.

This did not provide any significance at the ‘fake news’; response level.

There is a forthcoming blog on Amateur/Professional differences.

Comments

Is there a connection between text generation and ‘fake news’?

1/ Human pride: anthropocentrism: people that dismiss the use of text generation and its relevance to ‘fake news’, say ‘news of any kind has to be created by humans…’.

2/ The idea that a computer could not be used to write news, ‘fake’ or otherwise, might be due to the rather random output of the mid-level (GPT-2) text generator used in the Study. People might think it is producing readable nonsense, and brusquely dismiss any practical uses. This is shown in the relation between overall negative scores (Likerts) and low relevance for ‘fake news’.

3/ Perhaps dismissal of a text generation tool in regard to ‘fake news’ is indicative of not believing there is ‘fake news’ (or that it is not a problem), shown in the low word counts of their short negative responses. Further research required.

Notes

1/ Answers

If you have heard of ‘fake news’ do you think this is relevant?

Replies went from the four shortest (1 or 2 words)

If you have heard of ‘fake news’ do you think this is relevant?

No (scored 2, neg)

No! (scored 2, neg)

Sure (scored 0, pos)

Not really (scored 2, neg)

to the four longest (89, 85, 76, 50 words; 2,2,0,0)

If you have heard of ‘fake news’ do you think this is relevant?

“Unless the AI used to generate the text is extremely advanced, I think any news text generated wouldn’t be very effective — fake or otherwise. I think, at this point in our technology, news of any kind has to be created by humans to effectively educate, manipulate, or make humans react like chickens with their [thinking] heads cut off. If a text generator is going to be effective at writing propaganda, it’s skills are going to have to increase a heck of a lot. Ask again in five years.” (scored 2, neg)

“Very difficult to unpack – particularly as the different sides each claim the other side is distributing fake news. Having followed the Syrian War closely, I know for a fact our major news channels spouted fake news. Could something like this be used in social media to reply with biased opinions? Perhaps, but the existing troll farms used by all sides are often quite sophisticated, with the least sophisticated being the least effective. So, this type of low level autogeneration would not be very effective.” (scored 2, neg)

“Oh yes! Too many people are not analyzing the sources. Maybe that is too big a job or most and maybe many do not know to do this. My son repeats this claptrap to me as if it is gospel and he does not understand me when I tell him that his source is untrustworthy. He thinks if he hears something from dozens of people it must be true. Mass media is powerful in that way. “ (scored 0, pos)

“It is an enabler of fake news (eg allowing much faster fake news to be created – especially if fed in the analytics re effectiveness of dissemination etc.) but could also make REAL news reporting more efficient and effective especially if it supports optimum use of human vs AI input.” (scored 0, pos)

The text samples were too small to use computer sentiment analysis, so I scored the replies myself (human accuracy is still the best for nuanced material). Much training data for sentiment analysis is annotated by humans before use. See references.

Positive – is relevant – 0

Neutral – don’t know, don’t care or not sure – 1

Negative – is not relevant (not good enough, unsuitable, not required) – 2

How done

Sum Likerts (inverting the sixth Q (Q3/of 6 at end), the third after the 3 experiments), rank, then compare to texts, which are scored

0 pos (relevant to fake news), 1 neutral, 2 neg (not relevant to fake news).

Note: Q3/6: Do you feel that you have used somebody else’s work?

This Likert scored the other way, ie, the first option (score 1) was the most negative (is somebody else’s, ie plagiarism).

So in the calculations this score was inverted (1 become 5 etc.).

Other Calculations

1/
Overall Likert score (low positive affect) vs Fake Pos-neg relevant – neutral – not relevant

fake news vs likert.png

There is no significance to this relationship.

2/

vs. word count vs Fake news Pos-neg relevant – neutral – not relevant

fake news vs word count.png

References

Sentiment analysis papers:

https://www.kdnuggets.com/2020/06/5-essential-papers-sentiment-analysis.html

What is fake news?

Important researcher: Xinyi Zhou
These are the most recent papers on fake news:

A Survey of Fake News:

https://dl.acm.org/doi/10.1145/3395046

Fake News Early Detection: 

https://dl.acm.org/doi/10.1145/3377478

Generally, fake news is false or distorted information used as propaganda for immediate or long-term political gain. This definition separates it from advertising, which has similar approaches but is for brand promotion rather than life or death matters. False can be anything from an opinion to a website which appears to be proper news but is actually run to spread false information.

This has led to ‘reality checks’ where claims are checked against reality (which still exists) but the problem is that fake news spreads very quickly (because it appeals to the emotions, often fear or hate) while corrections take time to process and are very dull, so hardly anyone reads them, as they have no emotional content. Certainly the people that react to the fake news (if it is proven so) have moved on already, to the next item.

 

 

 

 

 

 

Stats explanation – boxplots

BOXPLOTS or ‘box-and-whisker’ plots

Go to Index of AI research

I will try to explain what the boxplot, a visual summary, or graphic visualisation of data, means by showing actual data.

Occupation vs Tentative

Occupation vs TentativeThis plots data for Occupation (eg, Other, Student, Scribbler, etc) against Sentiment scores for the emotion Tentative (using text analysis of their written feedback).

Looking at all of the different things on the boxplot, you can see:

    • a coloured (usually)

box

    • or

rectangle

    • a

vertical line

    • somewhere in the middle of the box

 

    • horizontal

lines

    • (but not always) coming out left and right

dot

    • or dots (but not always) on the same level but not on the lines

 

    • the data might show as just

one vertical line

    .

Boxplots show:

    • many features of the raw data in a simple way, and

 

    the distribution of a continuous variable.

The box edges to left and right are also called hinges. The vertical line in the middle is the median value (middle value of all the numbers). The horizontal lines are also called whiskers. (This is the Tukey method, see references at bottom.)

The boxplot shows five summary statistics:

    • the median

 

    • two hinges or edges of the box, the quartiles

 

    • with up to two lines or whiskers, showing the other quartiles

 

    • and all outlying (outlier) points individually as dots

 

    any consequently, skewing of data from a symmetrical normal distribution

Example
Now we will look at how one of these graphics is made from the raw data.

If you look at one of the horizontal graphics for occupation – Poet (sixth down):

Poet data Tentative boxplot
Poet data Tentative boxplot

Raw data and graphic explanation
First the numbers are sorted (ranked). Look at this breakdown, below, of where the numbers are in relation to the median (middle) value, and then how this related to the boxplot.
The data is the score on the sentiment analyser for tentative-related words, higher means more, score can be 0 to 1.0.

Data poet tentative explanation
Data poet tentative explanation

Other cases

The one below has just a single line instead of a box, because there is only one data point (0.87) – so you can get a gappy-looking boxplot, that is OK.

tentative-scientist


Notes

A boxplot helps to visualise the distribution of the data by quartile and show any outliers.

The plot above visualises five summary statistics, the median, two hinges or edges, and two whiskers or lines, and all outlier points individually as dots.

The box (coloured rectangle) always extends from the 25th to 75th percentiles. These sometimes called the ‘hinges’ of the plot.

The line in the middle of the box is plotted at the median.

Quartile: a type of quantile which divides the number of data points into four more or less equal parts, or quarters.

Quantile: in statistics and probability, quantiles are cut points dividing the range of a probability distribution into continuous intervals with equal probabilities, or dividing the observations in a sample in the same way.

Outliers: examination of the data for observations that are far removed from the mass of data (which could be for unrelated or distracting issues, or not).

Practical note: In the boxplot above, the data (which is from the experiment, saved as CSV files, and then imported into Excel for data cleaning (tidying up gaps etc. from the CSV format). From Excel it is then used in R statistical package.

References

General statistics calculators (great sites)

https://www.socscistatistics.com/tests/mannwhitney/

https://goodcalculators.com/statistics-calculators/

Boxplots (this is the best introduction)

Box Plot Explained: Interpretation, Examples, & Comparison

Wiki
https://en.wikipedia.org/wiki/Box_plot
R and boxplots
https://www.statmethods.net/graphs/boxplot.html

The box and whiskers plot was first introduced in 1970 by John Tukey, who later published on the subject in 1977.
John W. Tukey (1977). Exploratory Data Analysis. Addison-Wesley.

Computer-Human Hybrid AI Writing and Creative Ethics

Introduction

This blog is about my 2020 research into computer text generation and the effects on professional ands amateur writers. I am working on this topic at the University of the Arts London (UAL CCI, Dir. Mick Grierson).

No-one has asked creatives or writers what they think of the new ‘AI’ systems that generate readable text and so directly threaten their jobs, and could change the way people work forever (or don’t work forever). This is a topic that directly impinges on self-worth and financial worth in more ways than anyone can imagine, although plenty are worrying.

STUDY – ONLINE EXPERIMENT
August-October 2020

I devised an online experiment about this topic, allowing respondents to experiment with creating hybrid stories using a text generator. The people were all professional or serious amateurs (and a couple of small students) invited from my own creative writing software mailing list, a couple of writing forums, and a publisher’s writers’ forum, plus friends and relatives who generally use writing in their work. Credits are at the bottom.

Text generation

You might have heard of Google OpenAI’s GPT-2 and GPT-3. My experiment uses a generating system (Fabrice Bellard’s Text Synth, with permission)  based on GPT-2, that anyone can use. GPT-2 was used here as the model works well for idea generation and is more generally available at the time than GPT-3, which is much larger.

Note: The text generation and editing system is now a free online tool (creativity support tool or CST) at

Story Live writing with AI free online

The experimental results will feed into this blog (see Index for different aspects) and later an academic paper, and also a new book for the general public on the whole subject of computers, creativity and writing.

Please sign up for news and notifications – there’s a form on this page.

Brief description of the Study

Below is a graphic of the entire online study. Each block is a page and journey was left to right from top to bottom. The three text generation and editing experiments used a similar set up to the Story Live tool.

Each writing experiment – Caption, News and Fiction – had a question afterwards, then there were more questions after the experiments (see diagram below). All this will be addressed in blogs here, along with other discussions.

The image writing prompt was the same for each experiment and for all respondents for uniformity (there is a blog on the man and dog here).

Prompt image man and dog
Prompt image man and dog
Flowchart of Study

Geoff Davis

The computer support tool (CST) from this study is Story Live writing with AI free online

My other creativity tools are Notes Story Board and Story Lite from my Story Software. For my other activities please see the home page of this site.

Study

This study was devised and the site programmed by Geoff Davis for post-graduate research at University of London Creative Computing Institute UAL CCI 2020. The Supervisor is Professor Mick Grierson, Research Leader, UAL Creative Computing Institute.

Text Synth

Text Synth, by Fabrice Bellard, is a publicly available text generator, was used as this is the sort of system people might use outside of the study. It was also not practical to recreate (program, train, fine-tune, host) a large scale text generation system for this usability pre-study. Permission was granted to use Text Synth in the study by Fabrice Bellard Jul 7 2020.

Fabrice Bellard, coder of Text Synth.
Fabrice is an all-round genius and writes a lot of OS. Text Synth was built using the GPT-2 language model released by Google OpenAI. It is a neural network of 1.5 billion parameters based on the Transformer architecture.