Simple format for batch import?
 

Working on migrating from a google group to askbot forum, and trying to find the best format to convert the data to so that I can import it.

All I need to keep are the messages(with poster, and message text as well as the parent message if applicable) and the users (name and email)

Are there any simple formats/tools I can use to import this data?

I have looked at dumping the database and trying to copy the format but... that was not terribly effective. If there is good documentation for these dumps that I could not find, which would make adding things to it easier, that would also work.

Thanks!

To enter a block of code:

  • enter empty line after your previous text
  • paste or type the code
  • select the code and press the button above
Preview: (hide)
muselesscreator's avatar
77
muselesscreator
asked 11 years ago

Comments

In what format do you have your data?

Evgeny's avatar Evgeny (11 years ago)

Right now? I exported the emails with the data (as google groups does not have an export function) and I parsed them into a python dictionary.

muselesscreator's avatar muselesscreator (11 years ago)

To expand on my previous comment about the data format: I have a list of python dictionaries generated from an MBOX data structure. The dictionary is, right now, in the format [{'subject': '...', 'From': '...', 'To': '...', 'text': '...'}, ...] where 'text' is the actual body of the question/response. I am working on distinguishing threads from these messages, which shouldn't be too difficult. What I am looking for is the correct format to put the message/thread data into after I've separated it all out.

muselesscreator's avatar muselesscreator (11 years ago)
see more comments