Planning a Ski Trip with R Analysis

Overview

My husband’s loved snowboarding ever since he learned it in college, and he’s always wanted to find some time to visit our local mountains in the winter. So when January came around this year, I called up my friends and arranged a surprise—skiing in the mountains, here we come!

But we didn’t want to lose much time on travel, so we narrowed down our two main locations to Austria and Slovenia. I picked some popular skiing resorts and searched for nearby accommodations, and I saved all this info in a CSV file. Now, it’s time to decide when, where, and for what price we’ll be staying.

And for that, I’ll turn to R. Let’s get started!

Loading a CSV File in R

R can read data and create a data frame from many different sources: Excel, txt, HTML, CSV, MySQL, Oracle… The list goes on.

Simply put, a data frame is a table with rows and columns. We can load my stored trip data (ski_accommodation.csv) into an R data frame with the read.csv function:

ski_acomodation <- read.csv("ski_accomodation.csv", sep=’;’, stringsAsFactors = FALSE, dec =’,’)

After executing this code, we get a ski_acomodation data frame that contains information about various accommodations in Austria and Slovenia.

Let's use the head function to check what this table looks like. head returns the first five rows of the specified R data frame. If you try to execute following command:

head(ski_acomodation)

You’ll get this result:

COUNTRY	DESTINATION	COST_TYPE	ACCOMODATION_NAME	COST_SUBTYPE	DATE	PRICE	RATING	DISTANCE_FROM _SKI_RESORT
SLOVENIA	KRVAVEC	ACCOMODATION	STAL STUDIO	STUDIO	03.01.2019	702	9.6	5.2
SLOVENIA	KRVAVEC	ACCOMODATION	STAL STUDIO	STUDIO	04.01.2019	850	9.6	5.2
SLOVENIA	KRVAVEC	ACCOMODATION	STAL STUDIO	STUDIO	05.01.2019	702	9.6	5.2
SLOVENIA	KRVAVEC	ACCOMODATION	PALIN APARTMENTS	HOUSE	03.01.2019	1020	8.7	3.2
SLOVENIA	KRVAVEC	ACCOMODATION	PALIN APARTMENTS	HOUSE	04.01.2019	990	8.7	3.2

The table contains information about various accommodations and the associated cost of staying there for five people. Each accommodation has three rows, one for each of the days from January 3rd to January 5th. For each accommodation, we also store its rating and distance from the nearest ski resort. Besides accommodation costs, there are also travel costs (like fuel) in this table that we need to consider for reaching each accommodation.

We can easily display the two different cost types (accommodation and travel) with the following command:

unique(ski_acomodations$COST_TYPE)

Here, unique simply takes a vectorized data type (in this case, a column) and returns only unique values. In this case, it returns all unique values from the COST_TYPE column.

For now, lets eliminate travel costs from our data frame. We’re not going to analyze them just yet:

ski_acomodation_1 <- ski_acomodation[!ski_acomodation$COST_TYPE==”TRAVEL”,]

Now it’s time to pick a country to visit: Austria or Slovenia? It would be nice to find a place that is priced reasonably, has an okay rating, and is located near a ski resort.

Below is a graph depicting the prices for Austria and Slovenia:

{"x":{"visdat":{"2aa44a1fce6":["function () ","plotlyVisDat"]},"cur_data":"2aa44a1fce6","attrs":{"2aa44a1fce6":{"x":{},"y":{},"marker":{"size":10},"text":["Cost type: ACCOMODATION , Destination: KRVAVEC","Cost type: ACCOMODATION , Destination: KRVAVEC","Cost type: ACCOMODATION , Destination: KRVAVEC","Cost type: ACCOMODATION , Destination: KRVAVEC","Cost type: ACCOMODATION , Destination: KRVAVEC","Cost type: ACCOMODATION , Destination: KRVAVEC","Cost type: ACCOMODATION , Destination: KRVAVEC","Cost type: ACCOMODATION , Destination: KRVAVEC","Cost type: ACCOMODATION , Destination: KRVAVEC","Cost type: ACCOMODATION , Destination: VOGEL","Cost type: ACCOMODATION , Destination: VOGEL","Cost type: ACCOMODATION , Destination: VOGEL","Cost type: ACCOMODATION , Destination: VOGEL","Cost type: ACCOMODATION , Destination: VOGEL","Cost type: ACCOMODATION , Destination: VOGEL","Cost type: ACCOMODATION , Destination: VOGEL","Cost type: ACCOMODATION , Destination: VOGEL","Cost type: ACCOMODATION , Destination: VOGEL","Cost type: ACCOMODATION , Destination: BLED","Cost type: ACCOMODATION , Destination: BLED","Cost type: ACCOMODATION , Destination: BLED","Cost type: ACCOMODATION , Destination: BLED","Cost type: ACCOMODATION , Destination: BLED","Cost type: ACCOMODATION , Destination: BLED","Cost type: ACCOMODATION , Destination: BLED","Cost type: ACCOMODATION , Destination: BLED","Cost type: ACCOMODATION , Destination: BLED","Cost type: ACCOMODATION , Destination: ALPBACH","Cost type: ACCOMODATION , Destination: ALPBACH","Cost type: ACCOMODATION , Destination: ALPBACH","Cost type: ACCOMODATION , Destination: ALPBACH","Cost type: ACCOMODATION , Destination: ALPBACH","Cost type: ACCOMODATION , Destination: ALPBACH","Cost type: ACCOMODATION , Destination: ALPBACH","Cost type: ACCOMODATION , Destination: ALPBACH","Cost type: ACCOMODATION , Destination: ALPBACH","Cost type: ACCOMODATION , Destination: SAALBACH","Cost type: ACCOMODATION , Destination: SAALBACH","Cost type: ACCOMODATION , Destination: SAALBACH","Cost type: ACCOMODATION , Destination: SAALBACH","Cost type: ACCOMODATION , Destination: SAALBACH","Cost type: ACCOMODATION , Destination: SAALBACH","Cost type: ACCOMODATION , Destination: SAALBACH","Cost type: ACCOMODATION , Destination: SAALBACH","Cost type: ACCOMODATION , Destination: SAALBACH","Cost type: ACCOMODATION , Destination: ST ANTON AM ARLBERG","Cost type: ACCOMODATION , Destination: ST ANTON AM ARLBERG","Cost type: ACCOMODATION , Destination: ST ANTON AM ARLBERG","Cost type: ACCOMODATION , Destination: ST ANTON AM ARLBERG","Cost type: ACCOMODATION , Destination: ST ANTON AM ARLBERG","Cost type: ACCOMODATION , Destination: ST ANTON AM ARLBERG","Cost type: ACCOMODATION , Destination: ST ANTON AM ARLBERG","Cost type: ACCOMODATION , Destination: ST ANTON AM ARLBERG","Cost type: ACCOMODATION , Destination: ST ANTON AM ARLBERG","Cost type: TRAVEL , Destination: KRVAVEC","Cost type: TRAVEL , Destination: KRVAVEC","Cost type: TRAVEL , Destination: KRVAVEC","Cost type: TRAVEL , Destination: VOGEL","Cost type: TRAVEL , Destination: VOGEL","Cost type: TRAVEL , Destination: VOGEL","Cost type: TRAVEL , Destination: BLED","Cost type: TRAVEL , Destination: BLED","Cost type: TRAVEL , Destination: BLED","Cost type: TRAVEL , Destination: ALPBACH","Cost type: TRAVEL , Destination: ALPBACH","Cost type: TRAVEL , Destination: ALPBACH","Cost type: TRAVEL , Destination: ST ANTON AM ARLBERG","Cost type: TRAVEL , Destination: ST ANTON AM ARLBERG","Cost type: TRAVEL , Destination: ST ANTON AM ARLBERG"],"color":{},"colors":["#5d3087","#d15197"],"alpha_stroke":1,"sizes":[10,100],"spans":[1,20]}},"layout":{"margin":{"b":40,"l":60,"t":25,"r":10},"title":"Price vs Rating","xaxis":{"domain":[0,1],"automargin":true,"title":"PRICE"},"yaxis":{"domain":[0,1],"automargin":true,"title":"RATING"},"hovermode":"closest","showlegend":true},"source":"A","config":{"modeBarButtonsToAdd":[{"name":"Collaborate","icon":{"width":1000,"ascent":500,"descent":-50,"path":"M487 375c7-10 9-23 5-36l-79-259c-3-12-11-23-22-31-11-8-22-12-35-12l-263 0c-15 0-29 5-43 15-13 10-23 23-28 37-5 13-5 25-1 37 0 0 0 3 1 7 1 5 1 8 1 11 0 2 0 4-1 6 0 3-1 5-1 6 1 2 2 4 3 6 1 2 2 4 4 6 2 3 4 5 5 7 5 7 9 16 13 26 4 10 7 19 9 26 0 2 0 5 0 9-1 4-1 6 0 8 0 2 2 5 4 8 3 3 5 5 5 7 4 6 8 15 12 26 4 11 7 19 7 26 1 1 0 4 0 9-1 4-1 7 0 8 1 2 3 5 6 8 4 4 6 6 6 7 4 5 8 13 13 24 4 11 7 20 7 28 1 1 0 4 0 7-1 3-1 6-1 7 0 2 1 4 3 6 1 1 3 4 5 6 2 3 3 5 5 6 1 2 3 5 4 9 2 3 3 7 5 10 1 3 2 6 4 10 2 4 4 7 6 9 2 3 4 5 7 7 3 2 7 3 11 3 3 0 8 0 13-1l0-1c7 2 12 2 14 2l218 0c14 0 25-5 32-16 8-10 10-23 6-37l-79-259c-7-22-13-37-20-43-7-7-19-10-37-10l-248 0c-5 0-9-2-11-5-2-3-2-7 0-12 4-13 18-20 41-20l264 0c5 0 10 2 16 5 5 3 8 6 10 11l85 282c2 5 2 10 2 17 7-3 13-7 17-13z m-304 0c-1-3-1-5 0-7 1-1 3-2 6-2l174 0c2 0 4 1 7 2 2 2 4 4 5 7l6 18c0 3 0 5-1 7-1 1-3 2-6 2l-173 0c-3 0-5-1-8-2-2-2-4-4-4-7z m-24-73c-1-3-1-5 0-7 2-2 3-2 6-2l174 0c2 0 5 0 7 2 3 2 4 4 5 7l6 18c1 2 0 5-1 6-1 2-3 3-5 3l-174 0c-3 0-5-1-7-3-3-1-4-4-5-6z"},"click":"function(gd) { \n // is this being viewed in RStudio?\n if (location.search == '?viewer_pane=1') {\n alert('To learn about plotly for collaboration, visit:\\n https://cpsievert.github.io/plotly_book/plot-ly-for-collaboration.html');\n } else {\n window.open('https://cpsievert.github.io/plotly_book/plot-ly-for-collaboration.html', '_blank');\n }\n }"}],"cloud":false},"data":[{"x":[2484,2210,2494,4056,4105,4200,1848,1848,1848,2395,2230,2230,5581,5481,5481,4017,4017,4017,7310,7310,7100,14569,14569,14569,4302,4302,4302,2000,2150,2030,2250,2500,2300],"y":[8.1,8.1,8.1,9.3,9.3,9.3,8.5,8.5,8.5,6.5,6.5,6.5,8.1,8.1,8.1,8.9,8.9,8.9,8.6,8.6,8.6,8.7,8.7,8.7,9.6,9.6,9.6,0,0,0,0,0,0],"marker":{"color":"rgba(93,48,135,1)","size":10,"line":{"color":"rgba(93,48,135,1)"}},"text":["Cost type: ACCOMODATION , Destination: ALPBACH","Cost type: ACCOMODATION , Destination: ALPBACH","Cost type: ACCOMODATION , Destination: ALPBACH","Cost type: ACCOMODATION , Destination: ALPBACH","Cost type: ACCOMODATION , Destination: ALPBACH","Cost type: ACCOMODATION , Destination: ALPBACH","Cost type: ACCOMODATION , Destination: ALPBACH","Cost type: ACCOMODATION , Destination: ALPBACH","Cost type: ACCOMODATION , Destination: ALPBACH","Cost type: ACCOMODATION , Destination: SAALBACH","Cost type: ACCOMODATION , Destination: SAALBACH","Cost type: ACCOMODATION , Destination: SAALBACH","Cost type: ACCOMODATION , Destination: SAALBACH","Cost type: ACCOMODATION , Destination: SAALBACH","Cost type: ACCOMODATION , Destination: SAALBACH","Cost type: ACCOMODATION , Destination: SAALBACH","Cost type: ACCOMODATION , Destination: SAALBACH","Cost type: ACCOMODATION , Destination: SAALBACH","Cost type: ACCOMODATION , Destination: ST ANTON AM ARLBERG","Cost type: ACCOMODATION , Destination: ST ANTON AM ARLBERG","Cost type: ACCOMODATION , Destination: ST ANTON AM ARLBERG","Cost type: ACCOMODATION , Destination: ST ANTON AM ARLBERG","Cost type: ACCOMODATION , Destination: ST ANTON AM ARLBERG","Cost type: ACCOMODATION , Destination: ST ANTON AM ARLBERG","Cost type: ACCOMODATION , Destination: ST ANTON AM ARLBERG","Cost type: ACCOMODATION , Destination: ST ANTON AM ARLBERG","Cost type: ACCOMODATION , Destination: ST ANTON AM ARLBERG","Cost type: TRAVEL , Destination: ALPBACH","Cost type: TRAVEL , Destination: ALPBACH","Cost type: TRAVEL , Destination: ALPBACH","Cost type: TRAVEL , Destination: ST ANTON AM ARLBERG","Cost type: TRAVEL , Destination: ST ANTON AM ARLBERG","Cost type: TRAVEL , Destination: ST ANTON AM ARLBERG"],"type":"scatter","mode":"markers","name":"AUSTRIA","textfont":{"color":"rgba(93,48,135,1)"},"error_y":{"color":"rgba(93,48,135,1)"},"error_x":{"color":"rgba(93,48,135,1)"},"line":{"color":"rgba(93,48,135,1)"},"xaxis":"x","yaxis":"y","frame":null},{"x":[702,850,702,1020,990,970,620,650,620,1035,1035,1035,2250,2250,2250,1271,1000,1101,1474,1200,1200,2400,2340,2300,1300,1230,1220,900,950,980,1080,1100,1050,1200,1100,1020],"y":[9.6,9.6,9.6,8.7,8.7,8.7,7.5,7.5,7.5,9.8,9.8,9.8,8.1,8.1,8.1,9.1,9.1,9.1,7.9,7.9,7.9,0,0,0,9.4,9.4,9.4,0,0,0,0,0,0,0,0,0],"marker":{"color":"rgba(209,81,151,1)","size":10,"line":{"color":"rgba(209,81,151,1)"}},"text":["Cost type: ACCOMODATION , Destination: KRVAVEC","Cost type: ACCOMODATION , Destination: KRVAVEC","Cost type: ACCOMODATION , Destination: KRVAVEC","Cost type: ACCOMODATION , Destination: KRVAVEC","Cost type: ACCOMODATION , Destination: KRVAVEC","Cost type: ACCOMODATION , Destination: KRVAVEC","Cost type: ACCOMODATION , Destination: KRVAVEC","Cost type: ACCOMODATION , Destination: KRVAVEC","Cost type: ACCOMODATION , Destination: KRVAVEC","Cost type: ACCOMODATION , Destination: VOGEL","Cost type: ACCOMODATION , Destination: VOGEL","Cost type: ACCOMODATION , Destination: VOGEL","Cost type: ACCOMODATION , Destination: VOGEL","Cost type: ACCOMODATION , Destination: VOGEL","Cost type: ACCOMODATION , Destination: VOGEL","Cost type: ACCOMODATION , Destination: VOGEL","Cost type: ACCOMODATION , Destination: VOGEL","Cost type: ACCOMODATION , Destination: VOGEL","Cost type: ACCOMODATION , Destination: BLED","Cost type: ACCOMODATION , Destination: BLED","Cost type: ACCOMODATION , Destination: BLED","Cost type: ACCOMODATION , Destination: BLED","Cost type: ACCOMODATION , Destination: BLED","Cost type: ACCOMODATION , Destination: BLED","Cost type: ACCOMODATION , Destination: BLED","Cost type: ACCOMODATION , Destination: BLED","Cost type: ACCOMODATION , Destination: BLED","Cost type: TRAVEL , Destination: KRVAVEC","Cost type: TRAVEL , Destination: KRVAVEC","Cost type: TRAVEL , Destination: KRVAVEC","Cost type: TRAVEL , Destination: VOGEL","Cost type: TRAVEL , Destination: VOGEL","Cost type: TRAVEL , Destination: VOGEL","Cost type: TRAVEL , Destination: BLED","Cost type: TRAVEL , Destination: BLED","Cost type: TRAVEL , Destination: BLED"],"type":"scatter","mode":"markers","name":"SLOVENIA","textfont":{"color":"rgba(209,81,151,1)"},"error_y":{"color":"rgba(209,81,151,1)"},"error_x":{"color":"rgba(209,81,151,1)"},"line":{"color":"rgba(209,81,151,1)"},"xaxis":"x","yaxis":"y","frame":null}],"highlight":{"on":"plotly_click","persistent":false,"dynamic":false,"selectize":false,"opacityDim":0.2,"selected":{"opacity":1},"debounce":0},"base_url":"https://plot.ly"},"evals":["config.modeBarButtonsToAdd.0.click"],"jsHooks":[]}

It’s obvious from the graph that Slovenia has more acceptable prices. This cool graph was made in R with the help of plot_ly:

plot_ly(data = ski_acomodation, x = ~Price, y = ~Rating,color=~COUNTRY,colors = c(“red”,”blue”),
text=paste(‘Cost type: ‘, ski_acomodation$COST_TYPE,’,’, ‘Destination:’,ski_acomodation$DESTINATION)) %>% layout(title=”Price vs Rating”)

This is a visual approach. We can also prove that Slovenia is cheaper with some simple statistics—we can calculate the average price per night (in HRK) at the country level using this line of code:

sapply(split(ski_acomodation_1$PRICE,ski_acomodation_1$COUNTRY),mean)

R returns two figures: one for Austria, and one for Slovenia:

AUSTRIA 		SLOVENIA 
5143.519 		1296.852

As you can see, Slovenia is much, much cheaper than Austria. Here, we used the sapply function. This is part of a broader family of related functions that we’ll now explore in more detail.

The `apply` Family of Functions

Although R has looping constructs like the for loop that are present in other languages, these aren’t commonly used. Instead of manually looping over data structures and performing repetitive tasks, we often use R apply set of functions to make our job easier.

In data science, it’s a common task to group or slice your data according to a specific key and then call a certain function on each of those slices. To that end, we can use apply/sapply in combination with another function named split.

The `split` Function

As you may have guessed, split divides R data frame into several slices using a specific key. It then returns a list where each element of represents one slice of that data frame. Consider this code:

split(ski_acomodations$PRICE, ski_acomodations$COUNTRY)

R returns the following list:

$AUSTRIA
[1] 2484 2210 2494 4056 4105 4200 1848 1848 1848 2395 2230 2230 5581 5481 5481 4017 4017 4017 7310 7310 7100 14569 14569
[24] 14569 4302 4302 4302

$SLOVENIA
[1] 702 850 702 1020 990 970 620 650 620 1035 1035 1035 2250 2250 2250 1271 1000 1101 1474 1200 1200 2400 2340 2300 1300 1230 1220

Here, each element of the list is a vector of prices for a single country. The first vector is the vector of prices for Austria, and the second is for Slovenia.

Now if we use sapply like this:

sapply(split(ski_acomodation_1$PRICE,ski_acomodation_1$COUNTRY),mean)

R will go through each element of the list (in this case, there are only two elements) and calculate the average value for each. Effectively, this gives us the average accommodation prices for Austria and Slovenia. This is the same as if we had used loops, only it’s much cleaner and easier to understand.

For this trip, we’re not interested in visiting the best ski resorts overall, so we’ll go with the more affordable location—Slovenia, here we come!

Finding the Most Acceptable Location in Slovenia

Now that we’ve narrowed down our country to Slovenia, it’s time to decide what location we’ll be staying at. This time around, I’ll display the average price per ski resort (e.g., Vogel, Krvavec, Bled) in Slovenia:

ski_acomodation_1_SLO <- ski_acomodation_1[ski_acomodation_1$COUNTRY==”SLOVENIA”,]
sapply(split(ski_acomodation_1_SLO$PRICE,ski_acomodation_1_SLO$DESTINATION),mean)

Based on these results, it seems that the Krvavec ski resort has the most acceptable rates:

BLED      KRVAVEC   VOGEL
1629.3333 791.5556  1469.6667

But what about accommodation ratings? If accommodations in Krvavec are also acceptable, we can go ahead and book something there. Once again, we’ll use sapply in combination with split:

sapply(split(ski_acomodation_1_SLO$RATING,ski_acomodation_1_SLO$DESTINATION),mean)

R returns the average rating for each ski resort:

BLED     KRVAVEC   VOGEL
5.766667 8.600000  9.000000

Based on these results, it seems the rating is actually quite good. So far, Krvavec seems like a good choice—it’s got good accommodation prices and a strong rating. But what about travel costs?

By extracting only travel costs and calculating the average for each destination once again, we can confirm that Krvavec is indeed an excellent choice:

ski_acomodation_2_SLO <- ski_acomodation[ski_acomodation$COST_TYPE==”TRAVEL” & ski_acomodation$COUNTRY==”SLOVENIA”,]
sapply(split(ski_acomodation_2_SLO$PRICE,ski_acomodation_2_SLO$DESTINATION),mean)

BLED      KRVAVEC  VOGEL
1106.6667 943.3333 1076.6667

So with all of that out of the way, we’re now ready to take a look at the total cost for three nights at Krvavec and also factor in travel expenses.

The Total Cost for Our Trip

In my CSV file, I stored several accommodations near Krvavec. First, we’ll extract only those that are in Krvavec and then calculate the total costs. Keep in mind that price is expressed per night (remember that there are three rows in the data frame for each accommodation), so we need to sum all three prices together:

ski_acomodation_KRVAVEC <-ski_acomodation_1[ski_acomodation_1$DESTINATION==”KRVAVEC”,]
sapply(split(ski_acomodation_KRVAVEC$PRICE,ski_acomodation_KRVAVEC$ACCOMODATION_NAME),sum)

Here’s the price for staying three nights at each of the accommodations:

COOL HOUSE HOSTEL  PALIN APARTMENTS  STAL STUDIO
2870              3930              3154

Using `tapply` for Group Aggregations

Have you noticed a pattern yet? So far, we’ve been using split and sapply repeatedly. And whenever something is this repetitive in programming, there has to be a better alternative, right?

Well, there is, and its name is tapply. This function is used when you need to split/slice your data with a specific group and then perform some aggregate calculations on each slice. Statistics like average, sum, min, and max are really nice candidates for tapply.

In previous examples, like when we wanted to find the total price per destination, we used sapply with split. Let’s now use tapply; its syntax is cleaner, which makes it easier to understand the code we write. Take a look at the code below:

tapply(ski_acomodation_KRVAVEC$PRICE, ski_acomodation_KRVAVEC$ACCOMODATION_NAME, sum)

This gives us the same result as:

sapply(split(ski_acomodation_KRVAVEC$PRICE,ski_acomodation_KRVAVEC$ACCOMODATION_NAME),sum)

Great! I’m going to use tapply two more times to review each accommodation’s average rating and distance from the ski resort. Remember: We want to take all three parameters (price, rating, and distance) into consideration before booking our stay.

Here’s the code and result for the average rating:

tapply(ski_acomodation_KRVAVEC$RATING, ski_acomodation_KRVAVEC$ACCOMODATION_NAME, mean)

COOL HOUSE HOSTEL PALIN APARTMENTS STAL STUDIO
7.5              8.7              9.6

And here’s each accommodation’s distance from the Krvavec ski resort:

tapply(ski_acomodation_KRVAVEC$DISTANCE_FROM_SKI_RESORT, 
ski_acomodation_KRVAVEC$ACCOMODATION_NAME, mean)

COOL HOUSE HOSTEL  PALIN APARTMENTS  STAL STUDIO
3.6               3.2               5.2

Notice that Stal Studio has the highest rating and is 5 km from the ski resort. Palin Apartments is 3.2 km from ski resort with a good rating of 8.7. But it’s the most expensive accommodation, which is sort of expected—it’s spacious and offers cozy rooms. So, we decided to go with this place and pay 2980 HRK ($464) for three nights. And if we include travel costs as well, this will amount to 3930 HRK ($612):

sum(ski_acomodation[ski_acomodation$ACCOMODATION_NAME==”PALIN APARTMENTS”,]$PRICE)

I’d say that’s a fairly reasonable price for five people over three nights!

Conclusion

Analyzing data by hand or with Excel can certainly take more time than if you use R programming and the convenient functions that we saw here. All you really need is a file with your data, a place to write R scripts, and some basic knowledge of R programming and data science. Learn it online with Vertabelo Academy today!

Marija Ilic

Marija works as a data scientist in the banking industry. She specializes in big data platforms (Cloudera and Hadoop) with software and technologies such as Hive/Impala, Python and PySpark, Kafka, and R. Marija has an extensive background in DWH/ETL development in the banking industry. Her main interests are predictive modeling, real-time decision-making, and social network analysis. Outside of work, Marija enjoys listening to her favorite LPs on her old gramophone—and never grows tired of its soothing crackle.

Planning a Ski Trip with R Analysis

Overview

Loading a CSV File in R

The `apply` Family of Functions

The `split` Function

Finding the Most Acceptable Location in Slovenia

The Total Cost for Our Trip

Using `tapply` for Group Aggregations

Conclusion

Marija Ilic

Going Down to South Park, Part 3: TF-IDF Analysis with R

Done with a Python Basics Course? Here’s How to Write Python Code on Your Own Computer

Overview

Loading a CSV File in R

The apply Family of Functions

The split Function

Finding the Most Acceptable Location in Slovenia

The Total Cost for Our Trip

Using tapply for Group Aggregations

Conclusion

GET ACCESS TO EXPERT CONTENT

Marija Ilic

Going Down to South Park, Part 3: TF-IDF Analysis with R

Done with a Python Basics Course? Here’s How to Write Python Code on Your Own Computer

Related Posts:

New Vertabelo Academy Course: Learn R with Introduction to R

R Jobs and Salaries—All You Need to Know!

Going Down to South Park, Part 1: Text Analysis with R

The `apply` Family of Functions

The `split` Function

Using `tapply` for Group Aggregations