An open-source forensic exporter for ChatGPT conversations (SHA-256 hashing, verification, full project enumeration)

An open-source forensic exporter for ChatGPT conversations (SHA-256 hashing, verification, full project enumeration)
嗯，用户让我帮忙总结一下这篇文章的内容，控制在一百个字以内，而且不需要特定的开头。我得先仔细读一下文章。文章是关于一个叫做ChatGPT的工具的取证收集。作者遇到了需要收集用户整个ChatGPT历史、项目和对话的问题，于是他开发了一个工具包。这个工具包通过Chrome的CDP连接到会话，提取认证令牌，然后直接访问ChatGPT的后端API。收集的数据被保存为JSON文件，并记录SHA-256哈希到CSV文件中。还有一个验证脚本在收集后检查哈希是否匹配，是否有文件缺失或未跟踪。作者还提到了几个开发过程中遇到的挑战：比如ChatGPT只显示5个固定的项目，其他隐藏了；对话是树结构而不是列表；团队工作区需要额外的账号ID头；还有严格的速率限制，所以加入了指数回退和自动重试机制。此外，还有一个转换JSON到PDF的脚本，支持恢复功能。最后，作者将这个工具开源，并提到即使没有取证需求的人也可以用来备份数据，因为OpenAI的数据导出格式不够好。总结的时候要注意控制字数在100字以内，并且直接描述内容。可能需要突出工具的功能、解决的问题以及开源部分。 </think> 作者开发了一款工具，通过Chrome CDP提取ChatGPT认证令牌并直接访问其API，实现用户完整历史、项目和对话的取证收集。工具将数据保存为JSON文件并生成哈希校验表单，支持PDF导出和断点续传。该工具已开源，并适用于个人数据备份。 2026-3-28 21:43:5 Author: www.reddit.com(查看原文) 阅读量:2 收藏

Hi r/computerforensics, I had a matter recently where I needed to forensically collect a user's entire ChatGPT history, projects, conversations, generated images, the whole thing. So I built a toolkit that attaches to a Chrome session via CDP, extracts the auth token, and hits ChatGPT's backend API directly. Every conversation gets saved as an individual JSON file with a SHA-256 hash recorded in a CSV manifest. There's a separate verification script that recomputes all hashes, post-collection, and flags any mismatches, missing files, or untracked artifacts.

A few things that made this harder than expected:

ChatGPT only shows ~5 "pinned" projects in the sidebar API. The rest are hidden, so I had to build a multi-phase discovery process that paginates the sidebar endpoint AND scans the full conversation list to find project IDs the sidebar doesn't return.
Conversations are stored as tree structures (not flat lists) with branch points for edits and regenerations. The tool walks the active branch from current_node back to root.
Team/Enterprise workspaces require a separate account ID header or you only see personal data.
Rate limiting is aggressive, so I built in exponential backoff with automatic retry.

I've also included a script to convert the JSON exports to formatted PDFs (useful for handing off to counsel). It also supports resume, so if it crashes or gets rate-limited mid-run, you re-run and it picks up where it left off.

Open-source for the community: https://github.com/loucdg/chatgpt-forensic-exporter

Even if you don't have a forensic use case right now, it's worth having for backing up your own ChatGPT data. OpenAI has a 24-48 hour delay and the format it exports in is not as usable as this.

This is my first time releasing a tool like this publicly. And yes, I heavily leveraged "vibe coding" to get it done but I've been happy with the results. I have a few other python scripts that I've used during matters that I will upload if there's interest.

Happy to answer questions or take feedback.

文章来源: https://www.reddit.com/r/computerforensics/comments/1s6cx7e/an_opensource_forensic_exporter_for_chatgpt/
如有侵权请联系:admin#unsafe.sh