Questions tagged [skimr]

For questions about the R package for flexible creation of summary statistics.

skimr provides a frictionless approach to summary statistics which conforms to the principle of least surprise, displaying summary statistics the user can skim quickly to understand their data. It handles different data types and returns a skim_df object which can be included in a pipeline or displayed nicely for the human reader

https://github.com/ropensci/skimr

23 questions
7
votes
1 answer

Skimr - cant seem to produce the histograms

came across this seemingly new package - skimr, which looks pretty nifty, and was trying it out and looks like I'm missing some package installation. Skim works fine except that it doesn't print the histogram, it is supposed to print for numeric…
ashleych
  • 1,042
  • 8
  • 25
4
votes
2 answers

Change the order in which summary functions are printed by skim

I'm using skimr, and I added two summary functions (iqr_na_rm and median_na_rm) to the list of summary functions for the function skim. However, by default these new summary functions (called skimmers in skimr documentation) appear at the end of the…
DeltaIV
  • 4,773
  • 12
  • 39
  • 86
3
votes
1 answer

How to see all rows of skimr output without changing its layout, in console as well as in latex pdf output?

I used skimr package for descriptive table. On using View() function it is showing a very-very long table, 600 rows.
koliii
  • 109
  • 1
  • 8
2
votes
0 answers

How do I change the Body Width of the content box of a slide in Quarto

This is driving me crazy. It's just a basic example. This is the Quarto code. ```{r} #| echo: true library(tidyverse) library(skimr) ``` ```{r} #| echo: true skim(diamonds) ``` The output I get is: The question is: How do I extend the width of…
user23438
  • 445
  • 3
  • 11
2
votes
1 answer

Adjusting spark graphs/histograms in skimr package using R

I am working on a report that will display the results of some Likert scale data. I want to use the skim() function from the skimr package to utilize the spark graphs/histogram visual. The issue is that my response options range from 1 to 5 on each…
JeffB
  • 139
  • 1
  • 10
2
votes
2 answers

skimr: how to get the top 3 and bottom 3 values?

Consider this simple example > tibble(value = c(1,2,3,4,5,5,6,7,8,9,10,11,12)) %>% + skim() Skim summary statistics n obs: 13 n variables: 1 -- Variable type:numeric ------------------------------------------------------- variable missing…
ℕʘʘḆḽḘ
  • 18,566
  • 34
  • 128
  • 235
1
vote
1 answer

skimr: How to customize Data Summary in skim() output?

I want to use the function skim from R package skimr to produce summary statistics of multiple datasets. To save space, I need to prioritize information that gets displayed. I would like to remove these rows from the Data Summary section of skim…
ekirbw
  • 11
  • 3
1
vote
1 answer

Create an indicator variable in one data frame based on values in another data frame

Say, I have a dataset called iris. I want to create an indicator variable called sepal_length_group in this dataset. The values of this indicator will be p25, p50, p75, and p100. For example, I want sepal_length_group to be equal to "p25" for an…
Anup
  • 239
  • 2
  • 11
1
vote
3 answers

skimr: how to remove histogram?

I want to use the function skim from R package skimr on Windows. Unfortunately, in many situations column, hist is printed incorrectly (with many -like symbols), as in the example below. Question: is there an easy way to either disable…
GegznaV
  • 4,938
  • 4
  • 23
  • 43
1
vote
2 answers

Output confidence interval with skim

I am trying to add a confidence interval to the output produced by skimr library(skimr); library(Rmisc) skim_with(numeric = list(CI = Rmisc::CI), append = FALSE) skim(mtcars) Skim summary statistics n obs: 32 n variables: 11 ── Variable…
luciano
  • 13,158
  • 36
  • 90
  • 130
1
vote
3 answers

Cannot select rows with 0 in a column and values different from 0 in another, in skimr summary

I have the following data set (simplified with respect to the real one): foo <- structure(list(id = c(1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 1L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L, 2L,…
DeltaIV
  • 4,773
  • 12
  • 39
  • 86
0
votes
0 answers

Why is the skim_with() function returning double output?

I am using the following code to provide customized summary statistics of a generic dataframe: library(skimr) my_skim <- skim_with(base = sfl(n_missing = n_missing), numeric = sfl(hist =NULL , range = range, iqr = iqr, skew=skewness, cv =…
0
votes
1 answer

Skimr error message when I try to do skim() with the data exported from REDCap

I would like to use the skimr package and skim() function to explore my data exported from REDCap, but it gives me this error message. Error in `dplyr::summarize()`: ! Problem while computing `skimmed = purrr::map2(...)`. ℹ The error occurred in…
Yue Pan
  • 11
  • 2
0
votes
1 answer

How to export skimr::skim() results to a file with variable type reset on many data frames?

I have 2 data frames(more in real life). My goal is to generate summary reports with skimr package then export them as a file to a folder. They would also have different file names. The problem that makes it not so straight forward is the I need the…
merry123
  • 103
  • 5
0
votes
0 answers

How to allow skimr::skim_with function to output stats for date variable?

I have a date variable in my data frame, but skim(df) will output the date variable as character variable. How do I specify in the code that it needs to recognize the date, and output range of the date variable? df = data.frame(date = c(…
merry123
  • 103
  • 5
1
2