Dataset statistics
| Number of variables | 1 |
|---|---|
| Number of observations | 13880 |
| Missing cells | 1236 |
| Missing cells (%) | 8.9% |
| Duplicate rows | 902 |
| Duplicate rows (%) | 6.5% |
| Total size in memory | 216.9 KiB |
| Average record size in memory | 16.0 B |
Variable types
| TimeSeries | 1 |
|---|
Timeseries statistics
| Number of series | 1 |
|---|---|
| Time series length | 13880 |
| Starting point | 1983-01-01 00:00:00 |
| Ending point | 2020-12-31 00:00:00 |
| Period | 1 day |
| Dataset has 902 (6.5%) duplicate rows | Duplicates |
Flow has 1236 (8.9%) missing values | Missing |
Reproduction
| Analysis started | 2024-05-12 19:34:46.215446 |
|---|---|
| Analysis finished | 2024-05-12 19:34:49.024520 |
| Duration | 2.81 seconds |
| Missing | Q_Station_NA_25027050_ok_Missing.csv |
| Download configuration | config.json |
Flow
Numeric time series
MISSING 
| Distinct | 3522 |
|---|---|
| Distinct (%) | 27.9% |
| Missing | 1236 |
| Missing (%) | 8.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.05962591 |
|---|---|
| Minimum | -1533 |
| Maximum | 1416 |
| Zeros | 89 |
| Zeros (%) | 0.6% |
| Memory size | 216.9 KiB |
Quantile statistics
| Minimum | -1533 |
|---|---|
| 5-th percentile | -269.34 |
| Q1 | -66.5475 |
| median | 5.84 |
| Q3 | 73 |
| 95-th percentile | 248 |
| Maximum | 1416 |
| Range | 2949 |
| Interquartile range (IQR) | 139.5475 |
Descriptive statistics
| Standard deviation | 169.34023 |
|---|---|
| Coefficient of variation (CV) | 2840.0445 |
| Kurtosis | 8.4972995 |
| Mean | 0.05962591 |
| Median Absolute Deviation (MAD) | 69.84 |
| Skewness | -0.22764574 |
| Sum | 753.91 |
| Variance | 28676.115 |
| Monotonicity | Not monotonic |
| Augmented Dickey-Fuller test p-value | 0 |
Histogram with fixed size bins (bins=50)
Gap statistics
| number of gaps | 27 |
|---|---|
| min | 4 days |
| max | 1 year and 6 days |
| mean | 6 weeks, 4 days and 13 hours |
| std | 13 weeks, 5 days and 1 hour |
| Value | Count | Frequency (%) |
| 0 | 89 | 0.6% |
| 10 | 58 | 0.4% |
| 8 | 56 | 0.4% |
| -7 | 52 | 0.4% |
| 4 | 52 | 0.4% |
| 2 | 52 | 0.4% |
| 39 | 50 | 0.4% |
| -4 | 50 | 0.4% |
| 1 | 50 | 0.4% |
| 9 | 50 | 0.4% |
| Other values (3512) | 12085 | |
| (Missing) | 1236 | 8.9% |
| Value | Count | Frequency (%) |
| -1533 | 1 | |
| -1452 | 1 | |
| -1435 | 1 | |
| -1425 | 1 | |
| -1201 | 1 | |
| -1194 | 1 | |
| -1114 | 1 | |
| -1097 | 2 | |
| -1082 | 1 | |
| -1072 | 1 |
| Value | Count | Frequency (%) |
| 1416 | 1 | |
| 1391 | 1 | |
| 1352 | 1 | |
| 1319 | 1 | |
| 1276 | 1 | |
| 1214 | 1 | |
| 1158 | 1 | |
| 1145 | 1 | |
| 1073 | 1 | |
| 1007 | 1 |
ACF and PACF
A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
| Flow | |
|---|---|
| Date | |
| 1983-01-01 | NaN |
| 1983-01-02 | NaN |
| 1983-01-03 | -44.0 |
| 1983-01-04 | 47.0 |
| 1983-01-05 | -128.0 |
| 1983-01-06 | 31.0 |
| 1983-01-07 | 4.0 |
| 1983-01-08 | 24.0 |
| 1983-01-09 | 50.0 |
| 1983-01-10 | -37.0 |
| Flow | |
|---|---|
| Date | |
| 2020-12-22 | 25.20 |
| 2020-12-23 | -3.41 |
| 2020-12-24 | 45.68 |
| 2020-12-25 | 32.84 |
| 2020-12-26 | -63.76 |
| 2020-12-27 | 350.10 |
| 2020-12-28 | NaN |
| 2020-12-29 | NaN |
| 2020-12-30 | NaN |
| 2020-12-31 | NaN |
Most frequently occurring
| Flow | # duplicates | |
|---|---|---|
| 901 | NaN | 1236 |
| 445 | 0.0 | 89 |
| 470 | 10.0 | 58 |
| 467 | 8.0 | 56 |
| 424 | -7.0 | 52 |
| 451 | 2.0 | 52 |
| 456 | 4.0 | 52 |
| 432 | -4.0 | 50 |
| 449 | 1.0 | 50 |
| 468 | 9.0 | 50 |