scripts/data_collector/README.md
Scripts for data collection
Specific implementation reference: https://github.com/microsoft/qlib/tree/main/scripts/data_collector/yahoo
collector.py
CUR_DIR = Path(__file__).resolve().parent
sys.path.append(str(CUR_DIR.parent.parent))
from data_collector.base import BaseCollector, BaseNormalize, BaseRun
class UserCollector(BaseCollector):
...
class UserNormalzie(BaseNormalize):
...
CLI class:
class Run(BaseRun):
...
README.mdrequirements.txt| Basic data | |
|---|---|
| Features | Price/Volume: |
| - $close/$open/$low/$high/$volume/$change/$factor | |
| Calendar | <freq>.txt: |
| - day.txt | |
| - 1min.txt | |
| Instruments | <market>.txt: |
| - required: all.txt; | |
| - csi300.txt/csi500.txt/sp500.txt |
Features: data, digital
To make the component running correctly, the dependent data are required
| Component | required data |
|---|---|
| Data retrieval | Features, Calendar, Instrument |
| Backtest | Features[Price/Volume], Calendar, Instruments |