This study describes a method to identify laboratory test procedures and impact response requirements suitable for assessing the biofidelity of finite element head models used in prediction of traumatic brain injury. The selection of the experimental data and the response requirements were result of a critical evaluation based on the accuracy, reproducibility and relevance of the available experimental data. A weighted averaging procedure was chosen in order to consider different contributions from the various test conditions and target measurements based on experimental error. According to the quality criteria, 40 experimental cases were selected to be a representative dataset for validation. Based on the evaluation of response curves from four head finite element models, CORA was chosen as a quantitative method to compare the predicted time history response to the measured data. Optimization of the CORA global settings led to the recommendation of performing curve comparison on a fixed time interval of 0-30 ms for intracranial pressure and at least 0-40 ms for brain motion and deformation. The allowable maximum time shift was adjusted depending on the shape of the experimental curves (DMAX = 0.12 for intracranial pressure, DMAX = 0.40 for brain motion and DMAX = 0.25 for brain deformation). Finally, bigger penalization of ratings was assigned to curves with fundamentally incorrect shape compared to those having inaccuracies in amplitude or time shift (cubic vs linear). This rigorous approach is necessary to ensure confidence in the model results and progress in the usage of finite element head models for traumatic brain injury prediction.