Title: | The Entire Transcript from Friends in Tidy Format |
Version: | 0.1.0 |
Description: | The complete scripts from the American sitcom Friends in tibble format. Use this package to practice data wrangling, text analysis and network analysis. |
License: | MIT + file LICENSE |
Encoding: | UTF-8 |
LazyData: | true |
RoxygenNote: | 7.1.1 |
Depends: | R (≥ 2.10) |
Imports: | tibble |
URL: | https://github.com/EmilHvitfeldt/friends |
BugReports: | https://github.com/EmilHvitfeldt/friends/issues |
NeedsCompilation: | no |
Packaged: | 2020-08-29 02:28:29 UTC; emilhvitfeldthansen |
Author: | Emil Hvitfeldt |
Maintainer: | Emil Hvitfeldt <emilhhvitfeldt@gmail.com> |
Repository: | CRAN |
Date/Publication: | 2020-09-03 07:22:11 UTC |
The transcript of Friends
Description
Each season consists of episodes, each episode is divided into scenes, each scene comprises utterances. One utterance per row. Emotion annotation is included when available.
Usage
friends
Format
A tibble with 67,373 rows and 5 variables:
text
, speaker
, season
, episode
,
scene
and utterance
.
Source
https://github.com/emorynlp/character-mining
Emotions for transcript of Friends
Description
This tibble contains the emotions for the utterances where it is available.
Usage
friends_emotions
Format
A tibble with 12,606 rows and 5 variables:
season
, episode
, scene
, utterance
and
entities
.
Source
https://github.com/emorynlp/character-mining
https://github.com/emorynlp/emotion-detection
Character Entities for transcript of Friends
Description
This tibble contains the character entities for the utterances where it is available.
Usage
friends_entities
Format
A tibble with 10,557 rows and 5 variables:
season
, episode
, scene
, utterance
and
entities
.
Source
https://github.com/emorynlp/character-mining
Episode Information
Description
This tibble contains additional information about each of the episodes. Information was sourced from Wikipedia and IMDb on August 26th, 2020.
Usage
friends_info
Format
A tibble with 236 rows and 8 variables:
season
, episode
, title
, directed_by
,
written_by
, air_date
, us_views_millions
and
imdb_rating
.