Wrangling chapter review #235

leem44 · 2021-08-17T20:48:41Z

@trevorcampbell @ttimbers Please have a look at this PR and let me know if I should make any changes!
This PR addresses: #103, #160

Note: I have not yet removed map from this chapter -- I figure its easier to get any feedback on this, make changes and then create a branch off of this to remove map

…n above

…s, piping in data frame as per reviewers comments

…n above

…s, piping in data frame as per reviewers comments

leem44 · 2021-09-20T15:00:29Z

@ttimbers Changes look good! Only thing is the one below "Montreal" is spelled "Montreak"

Also changed the vector figure to be of type character, as previously it was called an integer, but then we actually made a double vector in code... I tried to swap it to double, but then it was too confusing for a first example. Character vector is a simpler and thus better first example:

ttimbers · 2021-09-20T16:12:12Z

Good catch!

…ta from the canlang package as I don't think we want to add that complexity...

ttimbers · 2021-09-20T23:47:39Z

I also removed loading data from the canlang package and instead get them to read the data from files. I don't think we really explain data packages and they are kind of weird, so best to leave out I think?

leem44 · 2021-09-21T16:00:52Z

Yes, I agree with that! Makes sense!

ttimbers · 2021-09-22T05:29:01Z

In Fig 3.10 we illlustrate going from longer to wider, but each table is the same number of columns. Yes, this is technically something that happens, but for our first teaching case we really should have a wider dataframe that we make longer:

Also, where did the values for commuters come from? I imagine population came from canlang, but I don't think that data set has commuters? Other columns we could add from canlang could be ~~area~~ dwellings or households (we need something that is a count to stick with the narrative)?

ttimbers · 2021-09-22T16:49:15Z

I think we should remove the & operator from here as we never use it elsewhere in the book or course. So it's just redundant and extra info that they do not "need" to know:

ttimbers · 2021-09-22T18:25:38Z

@leem44 - I am also removing the demonstration of filter not working on numeric data that is in the mutate section, as it is rendundant to what we wrote above in the section about convert and separate.

ttimbers · 2021-09-22T18:55:36Z

@leem44 - in the wrangling chapter, are you OK to move the pipe to before mutate and after select and filter? It uses select and filter so is timely after those two. Also, then I can use the |> in the setup for the as.numeric transformation for mutate so the creation of the data frame with the character columns that should be numbers is a little less magical?

If I do that, then I will use arrange in the pipe preamble since that was intro'ed in chapter 1, but mutate won't have been demo'd yet. Also, I think that works nicely since we use arrange in the code example we show that follows.

ttimbers · 2021-09-22T19:05:33Z

Maybe not actually - I am now fence sitting on this change, as it causes cascading changes...

Decided not to do this.

ttimbers · 2021-09-22T23:43:12Z

Simplified the example to just plot the proportion of people speaking English as the primary language at home.
Then we can skip case_when and vector recycling and keep the narrative pretty similar.

ttimbers · 2021-09-23T06:50:02Z

Done all sections except for summarize + across and purrr::map* (I did do rowwise which comes after these).

…ll need to add back in how to deal with NAs for summarize + across and purr map

ttimbers · 2021-09-26T19:35:20Z

OK, I think we are ready to merge. I added a bunch of images into the wrangling chapter, and re-organized the section where we do aggregation. I am pretty excited about it. @leem44 - I made quite a few changes to the wrangling chapter to (hopefully) simplify it. I hope that is OK. If there is anything I removed that you really want to keep in, please let me know and we can talk and find a way to keep it.

ttimbers · 2021-09-27T03:38:52Z

@leem44 gave me the go-ahead to merge, so merging!

leem44 and others added 30 commits June 27, 2021 13:34

updating the wide/long sections as per reviewer E suggestions

5b1fc62

adding convert argument to separate function

fb7c142

updating mutate section to account for new convert in separate sectio…

6b31e5c

…n above

editing piping section to include when you might use temporary object…

c03342a

…s, piping in data frame as per reviewers comments

updating piping section as per Reviewer Cs suggestion

bea4ae2

adding why wide is bad

8c4e76c

minor changes

7b729c2

changing column width in lang_long table so we can read all the rows

bb989f6

adding example for tibble

5f2d235

adding summarize_if

4a11419

adding section about select helpers

31cdcdf

added section on summarize +across

a0f9749

doing a pass through the chapter and editing grammar/spelling/logic

9b9f0c6

removing summarize_if since we have across

80df294

minor change

59dedd9

updating numbers in pivot longer table to match data frame

4a7b8a6

merging with remote branch

b94e2dc

adding image explanation for separate function

fd487c5

moving section down to additional resources

23a7442

wrapping text

c5b8533

making corrections to the formatting

c5dbe60

fixing code box in wrong place

b74c232

updating numbers in pivot longer table to match data frame

c74feab

updating the wide/long sections as per reviewer E suggestions

48ad252

adding convert argument to separate function

c7206bb

updating mutate section to account for new convert in separate sectio…

86e1b0d

…n above

editing piping section to include when you might use temporary object…

3b6c87e

…s, piping in data frame as per reviewers comments

updating piping section as per Reviewer Cs suggestion

0e969d5

adding why wide is bad

6d5536a

minor changes

d7bdff8

worked on tidying from wider to longer wording and removed loading da…

5551639

…ta from the canlang package as I don't think we want to add that complexity...

leem44 linked an issue Sep 21, 2021 that may be closed by this pull request

Review: Ch 3 (wrangling) #103

Closed

fixed wrong tidy image

0df1d18

wording changes to tidy data section

3fbf22e

ttimbers added 4 commits September 22, 2021 12:07

wording changes up to the end of mutate

6858f74

improved image size for fig 02-plot

a34e1ac

simplified mutate as a new column example

93d366e

small plot changes related to simplifuing the mutate example

cd611ba

ttimbers added 2 commits September 22, 2021 23:12

wording changes for the pipe section

3a651da

edited wording in rowwise section

d03b1d0

ttimbers added 4 commits September 25, 2021 12:10

reorganized and simplied the summarize/purrr map/rowwise section. Sti…

15ec161

…ll need to add back in how to deal with NAs for summarize + across and purr map

added NA section for summarize + across

ec9a2af

Fixed images for pivoting and added images for aggregating

2bc1b8f

tried to keep most text and all code to 80 characters

ee93e23

merging dev into wrangling

5d2abc1

ttimbers merged commit bc73e02 into dev Sep 27, 2021

ttimbers deleted the review_wrangling branch October 19, 2021 02:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wrangling chapter review #235

Wrangling chapter review #235

leem44 commented Aug 17, 2021 •

edited

Loading

leem44 commented Sep 20, 2021

ttimbers commented Sep 20, 2021

ttimbers commented Sep 20, 2021

leem44 commented Sep 21, 2021

ttimbers commented Sep 22, 2021 •

edited

Loading

ttimbers commented Sep 22, 2021 •

edited

Loading

ttimbers commented Sep 22, 2021

ttimbers commented Sep 22, 2021 •

edited

Loading

ttimbers commented Sep 22, 2021 •

edited

Loading

ttimbers commented Sep 22, 2021 •

edited

Loading

ttimbers commented Sep 23, 2021

ttimbers commented Sep 26, 2021

ttimbers commented Sep 27, 2021

Wrangling chapter review #235

Wrangling chapter review #235

Conversation

leem44 commented Aug 17, 2021 • edited Loading

leem44 commented Sep 20, 2021

ttimbers commented Sep 20, 2021

ttimbers commented Sep 20, 2021

leem44 commented Sep 21, 2021

ttimbers commented Sep 22, 2021 • edited Loading

ttimbers commented Sep 22, 2021 • edited Loading

ttimbers commented Sep 22, 2021

ttimbers commented Sep 22, 2021 • edited Loading

ttimbers commented Sep 22, 2021 • edited Loading

ttimbers commented Sep 22, 2021 • edited Loading

ttimbers commented Sep 23, 2021

ttimbers commented Sep 26, 2021

ttimbers commented Sep 27, 2021

leem44 commented Aug 17, 2021 •

edited

Loading

ttimbers commented Sep 22, 2021 •

edited

Loading

ttimbers commented Sep 22, 2021 •

edited

Loading

ttimbers commented Sep 22, 2021 •

edited

Loading

ttimbers commented Sep 22, 2021 •

edited

Loading

ttimbers commented Sep 22, 2021 •

edited

Loading