Bermain dengan tweet, putaran 1

Masih ingat bahwa pada bab-bab sebelumnya Anda bekerja sebagai analis data di sebuah agensi web? Kinerja Anda sangat baik dan kini Anda mendapat proyek baru ;) Pada bab ini, Anda akan menganalisis jenis data baru: keluaran JSON.

Tim engineering memberikan keluaran pengumpulan data yang berisi tweet, dikumpulkan selama RStudio Conf 2018. Karena himpunan data ini berupa JSON, Anda membacanya sebagai list bertingkat di R.

Pertama, Anda ingin melakukan eksplorasi dasar terhadap himpunan data ini, dan purrr akan sangat membantu. Paket telah dimuat untuk Anda, dan himpunan data rstudioconf tersedia di workspace Anda.

Catatan: jangan mencoba mencetak seluruh himpunan data — ukurannya terlalu besar untuk ditampilkan di konsol datacamp.

Harap diingat bahwa ini adalah data nyata dari Twitter dan karena itu selalu ada risiko mengandung kata-kata kasar atau konten ofensif lainnya (dalam latihan ini, dan latihan berikutnya yang juga menggunakan data Twitter nyata).

Latihan ini adalah bagian dari kursus

Pemrograman Fungsional Tingkat Menengah dengan purrr

Petunjuk latihan

Cetak elemen pertama dari list untuk mendapatkan gambaran umum konten dan strukturnya.
Karena Anda ingin berfokus pada tweet yang orisinal (bukan retweet), buat sublist non-retweet menggunakan elemen logis "is_retweet" yang terdapat di setiap sublist.
Ekstrak elemen "favorite_count" dari setiap elemen sublist baru ini menggunakan varian map_* untuk bilangan integer.
Dapatkan median dari hasil sebelumnya.

Latihan interaktif praktis

Cobalah latihan ini dengan menyelesaikan kode contoh berikut.

# Print the first element of the list to the console 


# Create a sublist of non-retweets
non_rt <- ___(___, "is_retweet")

# Extract the favorite count element of each non_rt sublist
fav_count <- ___(___, "favorite_count")

# Get the median of favorite_count for non_rt
___(___)

Edit dan Jalankan Kode

Latihan ini adalah bagian dari kursus

Pemrograman Fungsional Tingkat Menengah dengan purrr

SkillTag.level.intermediateSkillTag.label

4.8+

Mulai Kursus Gratis

Do lambda functions, mappers, and predicates sound scary to you? Fear no more! After refreshing your purrr memory, we will dive into functional programming 101, discover anonymous functions and predicates, and see how we can use them to clean and explore data.

Exercise 1: purrr basics - a refresher Exercise 2: Refreshing your purrr memory Exercise 3: Another purrr refresher Exercise 4: Introduction to mappers Exercise 5: Creating lambda functions Exercise 6: Lambda functions Exercise 7: Using mappers to clean up your data Exercise 8: Clean up your data with keep Exercise 9: Split up with keep() and discard()Exercise 10: Predicates Exercise 11: What is a predicate?Exercise 12: Exploring data with predicates

Ready to go deeper with functional programming and purrr? In this chapter, we'll discover the concept of functional programming, explore error handling using including safely() and possibly(), and introduce the function compact() for cleaning your code.

Exercise 1: Functional programming in R Exercise 2: Everything that happens is a function call Exercise 3: Identifying pure functions Exercise 4: Tools for functional programming in purrr Exercise 5: Safe iterations Exercise 6: Create a function Exercise 7: Using possibly()Exercise 8: A possibly() version of read_lines()Exercise 9: Everything in one call Exercise 10: Handling adverb results Exercise 11: Purrrfecting our function Exercise 12: Extracting status codes with GET()

In this chapter, we'll use purrr to write code that is clearer, cleaner, and easier to maintain. We'll learn how to write clean functions with compose() and negate(). We'll also use partial() to compose functions by "prefilling" arguments from existing functions. Lastly, we'll introduce list-columns, which are a convenient data structure that helps us write clean code using the Tidyverse.

Exercise 1: Why cleaner code?Exercise 2: How to write compose()Exercise 3: Back to the office Exercise 4: Building functions with compose() and negate()Exercise 5: Build a function Exercise 6: Count the NA Exercise 7: Prefilling functions Exercise 8: A content extractor Exercise 9: Another extractor Exercise 10: List columns Exercise 11: About list-columns Exercise 12: Create a list-column data.frame

We'll wrap up everything we know about purrr in a case study. Here, we'll use purrr to analyze data that has been scraped from Twitter. We'll use clean code to organize the data and then we'll identify Twitter influencers from the 2018 RStudio conference.

Exercise 1: Menemukan himpunan data Exercise 2: Bermain dengan tweet, putaran 1

Latihan Saat Ini

Exercise 3: Identifikasi profil Exercise 4: Mengekstrak informasi dari himpunan data Exercise 5: Menghitung favorit Exercise 6: Mengekstrak mention Exercise 7: Memanipulasi URL Exercise 8: Menganalisis URL Exercise 9: Bermain dengan URL Exercise 10: Mengidentifikasi influencer Exercise 11: Membagi himpunan data Exercise 12: Kita punya pemenang!Exercise 13: Selamat!