Files
aiData/Test/ocr_llm_output.txt
HuangHai b66f683dfb 'commit'
2026-01-12 07:49:18 +08:00

7.0 KiB

D:\anaconda3\envs\py310\python.exe :
Checking connectivity to the model hosters
, this may take a while. To bypass this ch
eck, set `DISABLE_MODEL_SOURCE_CHECK` to `
True`.
所在位置 行:1 字符: 1
+ D:\anaconda3\envs\py310\python.exe D:\ds
Work\YltProject\dsCrawler\Tes ...
+ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
+ CategoryInfo : NotSpecifie
d: (Checking c... to `True`.
:String) [], RemoteException
+ FullyQualifiedErrorId : NativeComma
ndError
 
信息: 用提供的模式无法找到文件。
D:\anaconda3\envs\py310\lib\site-packages\
paddle\utils\cpp_extension\extension_utils
.py:718: UserWarning: No ccache found. Ple
ase be aware that recompiling all source f
iles may be required. You can download and
install ccache from: https://github.com/c
cache/ccache/blob/master/doc/INSTALL.md
warnings.warn(warning_message)
Creating model: ('PP-LCNet_x1_0_doc_o
ri', None)
Model files already exist. Using cach
ed files. To redownload, please delete the
directory manually: `C:\Users\Administrat
or\.paddlex\official_models\PP-LCNet_x1_0_
doc_ori`.
WARNING: Logging before InitGoogleLogging(
) is written to STDERR
I0111 15:32:54.676050 22720 onednn_context
.cc:81] oneDNN v3.6.2
Creating model: ('UVDoc', None)
Model files already exist. Using cach
ed files. To redownload, please delete the
directory manually: `C:\Users\Administrat
or\.paddlex\official_models\UVDoc`.
Creating model: ('PP-LCNet_x1_0_textl
ine_ori', None)
Model files already exist. Using cach
ed files. To redownload, please delete the
directory manually: `C:\Users\Administrat
or\.paddlex\official_models\PP-LCNet_x1_0_
textline_ori`.
Creating model: ('PP-OCRv5_server_det
', None)
Model files already exist. Using cach
ed files. To redownload, please delete the
directory manually: `C:\Users\Administrat
or\.paddlex\official_models\PP-OCRv5_serve
r_det`.
Creating model: ('PP-OCRv5_server_rec
', None)
Model files already exist. Using cach
ed files. To redownload, please delete the
directory manually: `C:\Users\Administrat
or\.paddlex\official_models\PP-OCRv5_serve
r_rec`.
D:\dsWork\YltProject\dsCrawler\Test\TestOc
rLlm.py:42: DeprecationWarning: Please use
`predict` instead.
result = ocr.ocr(image_path)
Testing OCR + LLM Pipeline on: D:\dsWork\YltProject\dsCrawler\Test\2.jpg
--------------------------------------------------
Initializing PaddleOCR...
PaddleOCR Init Time: 3.6460s
Running OCR Inference...
OCR Result (15.6618s):
长春市绿园区雁鸣湖公共充电站
(…
7.4km
0.7111/度

闲3/4

组团
2倍积分
P
·收费停车:以场地实际收费为准
--------------------------------------------------
Calling LLM for Parsing...
```json
{
"station_name": "长春市绿园区雁鸣湖公共充电站",
"distance": "7.4km",
"price": "0.7111/度",
"tags": ["快", "闲3/4", "组团", "2倍积分", "P"],
"parking_info": "收费停车:以场地实际收费为准"
}
```
LLM Response (3.2610s):
```json
{
"station_name": "长春市绿园区雁鸣湖公共充电站",
"distance": "7.4km",
"price": "0.7111/度",
"tags": ["快", "闲3/4", "组团", "2倍积分", "P"],
"parking_info": "收费停车:以场地实际收费为准"
}
```
 
Parsed JSON Data:
{
"station_name": "长春市绿园区雁鸣湖公共充电站",
"distance": "7.4km",
"price": "0.7111/度",
"tags": [
"快",
"闲3/4",
"组团",
"2倍积分",
"P"
],
"parking_info": "收费停车:以场地实际收费为准"
}
--------------------------------------------------
Summary:
OCR Time: 15.6618s
LLM Time: 3.2610s
Total Pipeline Time (excluding init): 18.9228s