Re: Training Decision Tree

Tech Tip: Click here to run a free scan for Windows Errors and optimize PC performance



But if I want SQL Server to take a specific amount of data for validation,
is there a way for that?

I am not sure if I understand what you mean. In all examples from my
previous post you can easily specify any amount you need in percentages;
with SSIS Row Sampling transform, you can also specify number of rows you
need. If you want to get a fixed number of rows in T-SQL, you can use TOP
operator, like

SELECT TOP 1000 *
FROM table
ORDER BY NEWID()

By using NEWID() function for ordering you get approximately random sample,
usually good enough for analyses; however, SSIS Row Sampling transform gives
you statisticaly random rows.

--
Dejan Sarka
http://www.solidqualitylearning.com/blogs/


.