For single age count data, perturb the data in such a way as to induce heaping on ages ending in 0 and 5. This is a common phenomenon that several age heaping evaluation methods are designed to test for. In order to estimate how these methods respond to different degrees of heaping in a systematic way, a function such as this may be useful. The way this works is purely a guess, and no checks are in place, so use with caution.
heapify( Value, Age, p0 = 2, p5 = p0, pdf0 = c(1, 6, 15, 20, 15, 6, 1), pdf5 = c(c(1, 4, 6, 4, 1)), ageMin = 25, ageMax = max(Age[Age%%5 == 0]) )
Value | numeric. A vector of demographic counts by single age. |
---|---|
Age | numeric. A vector of ages corresponding to the lower integer bound of the counts. |
p0 | total size of bias to round to 0s. Default 2. |
p5 | total size of bias to round to 5s. Defaults to |
pdf0 | numeric vector of length 7 giving the pdf to apply around 0s. |
pdf5 | numeric vector of length 5 giving the pdf to apply around 5s. |
ageMin | integer ending in 0 or 5. The lowest age to be heaped upon, default 25. |
ageMax | integer ending in 0 or 5. The highest age to be heaped upon, defaults to highest age evenly divisible by 10. |
Value numeric vector perturbed to look like age heaping as happened.
We use pascal weights, c(1,4,6,4,1)
scaled to argument p5
and c(1,6,15,20,15,6,1)
scaled to argument p0
to shift individuals from surrounding ages to ages endings in 5 and 0, respectively. This assumes that ages closer to digits 0 or 5 are more likely to be declared as 0 or 5, that this bias is symmetrical, and that such rounding can come from farther away ages for terminal digit 0 than for 5. The user can also make heaping scale differently for 0s and 5s, but there is no control over the shape of bias.
# example to show what we're talking about. # pop1m_pasex is already quite heaped: Age <- 0:99 A5 <- seq(0,95,by=5) if (FALSE) { plot(Age,pop1m_pasex) # here it is again, smoothed: smoothed <- graduate_sprague( smooth_age_5(pop1m_pasex, Age, method = "Strong", OAG = FALSE, young.tail = "Arriaga"), Age = A5, OAG = FALSE) lines(Age, smoothed) # an OK approximation for testing purposes. points(Age,heapify(smoothed,Age=0:99,1.8,1.1,ageMin=20),pch="x") }