Date: Feb 7, 2013 5:57 PM
Subject: Pull out specific numbers from unstructured text file
Here is an unstructured text file (chess_mov.out) that I have:
gsgfs 6y856 words
Nmoves=84 (chess win)
Nrequired=101 (chess win maximum moves requested)
I need to:
a. ignore all header lines that come before the line starting with 'Nmoves'. For this part, I need to first search for the line where the string 'Nmoves' occurs and ignore all lines before it. The number of lines before the string 'Nmoves' is 4 for this example only, but it is not always fixed.
b. extract the numbers that come after 'Nmoves' and 'Nrequired'. So, in this case, the numbers I need are 84 and 101.
I have tried this with textscan. The problem is that this is an unstructured text file and textscan is having a lot of problems picking up a fixed pattern.
How can I extract the numbers that come after 'Nmoves' and 'Nrequired'?