Simple format for batch import?

Working on migrating from a google group to askbot forum, and trying to find the best format to convert the data to so that I can import it.

All I need to keep are the messages(with poster, and message text as well as the parent message if applicable) and the users (name and email)

Are there any simple formats/tools I can use to import this data?

I have looked at dumping the database and trying to copy the format but... that was not terribly effective. If there is good documentation for these dumps that I could not find, which would make adding things to it easier, that would also work.

Thanks!

muselesscreator's avatar
77
muselesscreator
asked 2014-02-24 14:56:40 -0500
edit flag offensive 0 remove flag close merge delete

Comments

In what format do you have your data?

Evgeny's avatar Evgeny (2014-02-24 15:05:41 -0500) edit

Right now? I exported the emails with the data (as google groups does not have an export function) and I parsed them into a python dictionary.

muselesscreator's avatar muselesscreator (2014-02-24 15:30:50 -0500) edit

To expand on my previous comment about the data format: I have a list of python dictionaries generated from an MBOX data structure. The dictionary is, right now, in the format [{'subject': '...', 'From': '...', 'To': '...', 'text': '...'}, ...] where 'text' is the actual body of the question/response. I am working on distinguishing threads from these messages, which shouldn't be too difficult. What I am looking for is the correct format to put the message/thread data into after I've separated it all out.

muselesscreator's avatar muselesscreator (2014-02-26 10:46:33 -0500) edit
add a comment see more comments