The work presented in this paper focuses on zero-resource children's speech recognition task. In such tasks, adults' speech data is used for learning the acoustic models. However, this leads to severe acoustic mismatch and hence poor recognition rates. One of the main mismatch factor is that the pitch values are higher in the case of children's speech. In order to mitigate the ill-effects of pitch-induced acoustic mismatch, two front-end speech parameterization techniques are proposed in this study. The proposed approaches employ spectral smoothing based on either pitch-adaptive cepstral truncation or variational mode decomposition. Furthermore, we have used Gamma-tone-filterbank for warping the spectra to the ERB scale. Consequently, the cepstral coefficients exhibit lower variance than those obtained using Mel-filterbank. Therefore, the proposed features are observed to be very effective resulting in a relative reduction in word error rate by nearly 17% over the baseline.