I'd be minded to go with a hybrid approach -- write my tests for the both the old and new code, then use the results of the Reference Implementation as the standard to validate my new code against. Obviously, the new features may not (probably will not) have analogs in the RI, but where there is comparable functionality, use it. (And remember the first rule of System Analysis: "The System you are designing must function at least as well as the System it is going to replace.")
----
I Go Back to Sleep, Now.
OGB