Simple format for batch import?
Working on migrating from a google group to askbot forum, and trying to find the best format to convert the data to so that I can import it.
All I need to keep are the messages(with poster, and message text as well as the parent message if applicable) and the users (name and email)
Are there any simple formats/tools I can use to import this data?
I have looked at dumping the database and trying to copy the format but... that was not terribly effective. If there is good documentation for these dumps that I could not find, which would make adding things to it easier, that would also work.
Thanks!
Comments
In what format do you have your data?
Right now? I exported the emails with the data (as google groups does not have an export function) and I parsed them into a python dictionary.
To expand on my previous comment about the data format: I have a list of python dictionaries generated from an MBOX data structure. The dictionary is, right now, in the format [{'subject': '...', 'From': '...', 'To': '...', 'text': '...'}, ...] where 'text' is the actual body of the question/response. I am working on distinguishing threads from these messages, which shouldn't be too difficult. What I am looking for is the correct format to put the message/thread data into after I've separated it all out.