An open-source forensic exporter for ChatGPT conversations (SHA-256 hashing, verification, full project enumeration)
嗯,用户让我帮忙总结一下这篇文章的内容,控制在一百个字以内,而且不需要特定的开头。我得先仔细读一下文章。 文章是关于一个叫做ChatGPT的工具的取证收集。作者遇到了需要收集用户整个ChatGPT历史、项目和对话的问题,于是他开发了一个工具包。这个工具包通过Chrome的CDP连接到会话,提取认证令牌,然后直接访问ChatGPT的后端API。收集的数据被保存为JSON文件,并记录SHA-256哈希到CSV文件中。还有一个验证脚本在收集后检查哈希是否匹配,是否有文件缺失或未跟踪。 作者还提到了几个开发过程中遇到的挑战:比如ChatGPT只显示5个固定的项目,其他隐藏了;对话是树结构而不是列表;团队工作区需要额外的账号ID头;还有严格的速率限制,所以加入了指数回退和自动重试机制。此外,还有一个转换JSON到PDF的脚本,支持恢复功能。 最后,作者将这个工具开源,并提到即使没有取证需求的人也可以用来备份数据,因为OpenAI的数据导出格式不够好。 总结的时候要注意控制字数在100字以内,并且直接描述内容。可能需要突出工具的功能、解决的问题以及开源部分。 </think> 作者开发了一款工具,通过Chrome CDP提取ChatGPT认证令牌并直接访问其API,实现用户完整历史、项目和对话的取证收集。工具将数据保存为JSON文件并生成哈希校验表单,支持PDF导出和断点续传。该工具已开源,并适用于个人数据备份。 2026-3-28 21:43:5 Author: www.reddit.com(查看原文) 阅读量:2 收藏

Hi r/computerforensics, I had a matter recently where I needed to forensically collect a user's entire ChatGPT history, projects, conversations, generated images, the whole thing. So I built a toolkit that attaches to a Chrome session via CDP, extracts the auth token, and hits ChatGPT's backend API directly. Every conversation gets saved as an individual JSON file with a SHA-256 hash recorded in a CSV manifest. There's a separate verification script that recomputes all hashes, post-collection, and flags any mismatches, missing files, or untracked artifacts.

A few things that made this harder than expected:

  • ChatGPT only shows ~5 "pinned" projects in the sidebar API. The rest are hidden, so I had to build a multi-phase discovery process that paginates the sidebar endpoint AND scans the full conversation list to find project IDs the sidebar doesn't return.

  • Conversations are stored as tree structures (not flat lists) with branch points for edits and regenerations. The tool walks the active branch from current_node back to root.

  • Team/Enterprise workspaces require a separate account ID header or you only see personal data.

  • Rate limiting is aggressive, so I built in exponential backoff with automatic retry.

I've also included a script to convert the JSON exports to formatted PDFs (useful for handing off to counsel). It also supports resume, so if it crashes or gets rate-limited mid-run, you re-run and it picks up where it left off.

Open-source for the community: https://github.com/loucdg/chatgpt-forensic-exporter

Even if you don't have a forensic use case right now, it's worth having for backing up your own ChatGPT data. OpenAI has a 24-48 hour delay and the format it exports in is not as usable as this.

This is my first time releasing a tool like this publicly. And yes, I heavily leveraged "vibe coding" to get it done but I've been happy with the results. I have a few other python scripts that I've used during matters that I will upload if there's interest.

Happy to answer questions or take feedback.


文章来源: https://www.reddit.com/r/computerforensics/comments/1s6cx7e/an_opensource_forensic_exporter_for_chatgpt/
如有侵权请联系:admin#unsafe.sh