Skip to content

Conversation

xming521
Copy link
Owner

数据处理第一次重构

xming521 and others added 17 commits December 26, 2024 21:52
- Introduced new `make_dataset_args` in `settings.json` to support dataset creation with a specified processing method.
- Updated `load_config` function in `config.py` to handle the new `make_dataset` argument type, allowing for flexible configuration loading.
- Improved file handling in `config.py` by specifying UTF-8 encoding when reading `settings.json`.
- Added `commentjson` to `requirements.txt` for improved JSON handling.
- Enhanced `settings.json` with a new `prefer_comma` option for chat preferences.
- Refactored `csv_to_json.py` to improve data processing logic and added comments for clarity.
- Updated `qa_generator.py` to include data validation and filtering based on blocked words.
- Modified `strategies.py` to clarify the time window parameter and renamed a strategy class for better understanding.
- Changed JSON loading in `config.py` to use `commentjson` for better compatibility with comments in JSON files.
- Updated `settings.json` to include a new `conversation_strategy` and `time_window` for improved conversation handling.
- Refactored `qa_generator.py` to initialize conversation strategies based on the new configuration options.
- Modified `strategies.py` to improve type hints and clarify strategy implementations for conversation management.
…or.py以支持新的消息处理策略,优化数据处理逻辑,更新测试用例以适应新功能。
…参数,更新qa_generator.py以支持新的消息处理逻辑,删除旧的测试文件test_old_csv_to_json copy.py并更新相关测试用例。
@xming521 xming521 requested a review from Copilot April 13, 2025 11:36
Copy link
Contributor

@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copilot reviewed 12 out of 14 changed files in this pull request and generated 1 comment.

Files not reviewed (2)
  • .cursor/rules/weclone-rules.mdc: Language not supported
  • settings.json: Language not supported

import pandas as pd
import json

current_dir = os.path.dirname(p=os.path.abspath(__file__))
Copy link
Preview

Copilot AI Apr 13, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The use of the keyword argument 'p=' in os.path.dirname is invalid; please change it to use os.path.dirname(os.path.abspath(file)) instead.

Suggested change
current_dir = os.path.dirname(p=os.path.abspath(__file__))
current_dir = os.path.dirname(os.path.abspath(__file__))

Copilot uses AI. Check for mistakes.

@xming521 xming521 merged commit cdb54d3 into master Apr 13, 2025
1 check passed
xming521 added a commit that referenced this pull request Apr 21, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant