Does the SEQ file contain a mixture of audio, midi and VSTi tracks? If so, is there a time difference between tracks of the same type?

When swapping files, it's usually best to render all the tracks to audio if possible.

ROG.