Do as mention: x-axis should be number of hidden neurons and y-axis the best validation accuracy/loss you obtained with the corresponding model (the model with the hidden layer size as indicated in the x-axis). So rather a comparison than a single learning curve (which is validation loss/accuracy vs epochs).