How can I break down my loaded dataset into training set and test set, and develop random forest on the training set, calculating fit for both the training set and test set?