take corpus of books from a sql file and return a csv for each book with page IDs and their transcription