feat: support multi-CSV profiling, ratio analysis, and code guidance#17
Conversation
There was a problem hiding this comment.
💡 Codex Review
Here are some automated review suggestions for this pull request.
Reviewed commit: 4f343c900b
ℹ️ About Codex in GitHub
Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you
- Open a pull request for review
- Mark a draft as ready
- Comment "@codex review".
If Codex has suggestions, it will comment; otherwise it will react with 👍.
Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".
| "print('결측 비율 상위:\n', missing_ratio.head(10))\n\n" | ||
| "numeric_cols = merged.select_dtypes(include='number').columns\n" | ||
| "if len(numeric_cols) > 0:\n" | ||
| " ratio = (merged[numeric_cols] > 0).mean().sort_values(ascending=False)\n" | ||
| " print('양수 비율 상위:\n', ratio.head(10))\n" |
There was a problem hiding this comment.
Escape newline sequences in generated pandas snippet
build_code_guidance currently uses \n inside quoted literals in the template string, so the generated pandas_example contains actual line breaks inside '...' and fails to compile (SyntaxError: unterminated string literal) when users run the suggested code from JSON/markdown output. This breaks the new “code guidance” workflow for any multi-analyze result that is copied into Python.
Useful? React with 👍 / 👎.
Motivation
pandascode and a human-readable markdown report so users can continue with visualization or feature engineering.Description
bitnet_tools/multi_csv.pywhich implementsanalyze_multiple_csv, per-file column profiling (missing_ratio,unique_ratio,top_values), numeric sign distribution,build_code_guidance,build_multi_csv_markdown, andresult_to_json.summarize_readerinbitnet_tools/analysis.pyand addedbuild_markdown_reportto produce single-file markdown outputs.bitnet_tools/cli.py) with a newmulti-analyzesubcommand and support forreport,desktop, anddoctorflows, and addedbitnet-desktopentry point topyproject.toml.bitnet_tools/desktop.pyandbitnet_desktop.pyw) and an environment diagnostic helper (bitnet_tools/doctor.py) and addedBitNet_Desktop_Start.batto simplify Windows launch.README.mdto document multi-CSV capabilities and workflow, and added tests intests/test_analysis.pyandtests/test_cli.pycovering the new multi-CSV report builder and CLI behavior.Testing
pytest -qand all tests passed (11 passed).multi-analyzewithpython -m bitnet_tools.cli --help.python -m bitnet_tools.cli multi-analyze /tmp/a.csv /tmp/b.csv --question "다중 csv 분석" --out-json /tmp/multi.json --out-report /tmp/multi.mdwhich produced both JSON and markdown outputs successfully.Codex Task