Skip to content

[improvement](log) Fix English grammar issues in FE/BE log messages#61705

Open
morningman wants to merge 6 commits intoapache:masterfrom
morningman:opt_logs
Open

[improvement](log) Fix English grammar issues in FE/BE log messages#61705
morningman wants to merge 6 commits intoapache:masterfrom
morningman:opt_logs

Conversation

@morningman
Copy link
Contributor

No description provided.

@Thearas
Copy link
Contributor

Thearas commented Mar 25, 2026

Thank you for your contribution to Apache Doris.
Don't know what should be done next? See How to process your PR.

Please clearly describe your PR:

  1. What problem was fixed (it's best to include specific error reporting information). How it was fixed.
  2. Which behaviors were modified. What was the previous behavior, what is it now, why was it modified, and what possible impacts might there be.
  3. What features were added. Why was this function added?
  4. Which code was refactored and why was this part of the code refactored?
  5. Which functions were optimized and what is the difference before and after the optimization?

@morningman morningman requested a review from airborne12 as a code owner March 25, 2026 05:37
@morningman
Copy link
Contributor Author

run buildall

Issue Number: close #xxx

Problem Summary:
FE log messages contain widespread non-idiomatic English grammar patterns
inherited from Chinese-English direct translation. These patterns affect
log readability and professionalism. Specifically:

1. "finished to <verb>" used instead of "Finished <verb>ing" (~95 places)
2. "begin to <verb>" used instead of "Beginning to <verb>" (~40 places)
3. "start to <verb>" used instead of "Starting to <verb>" (~7 places)
4. Acronyms not properly capitalized: "gc" -> "GC", "ldap" -> "LDAP"
5. Minor spelling fix: "effected" -> "affected" in JdbcClient
6. Inconsistent capitalization of log message first letter

None

- Test: No need to test (log message text-only changes, no logic change)
- Behavior changed: No
- Does this need documentation: No
### What problem does this PR solve?

Issue Number: close #xxx

Problem Summary:
FE log messages lack standardized rules for grammar, log levels, and
output volume. Code review has no formal checklist for verifying log
quality. This leads to issues such as Chinglish patterns ("finished to"),
excessive INFO logging on idle cycles, and full Thrift object
serialization in log messages.

### Release note

None

### Check List (For Author)

- Test: No need to test (documentation-only change)
- Behavior changed: No
- Does this need documentation: No
### What problem does this PR solve?

Issue Number: close #xxx

Problem Summary:
1. Reduced duplicated and spammy logs in fs_file_cache_storage.cpp by changing `LOG(INFO)` to `VLOG_DEBUG`.
2. Standardized BE logging grammar across ~130 files by fixing Chinglish patterns such as 'start to', 'begin to', 'finished to', 'can not' and typos like 'destoried' and 'envorinment'.
3. Ensured that key log lines start with an uppercase letter, following Doris logging standards.

### Release note

None

### Check List (For Author)

- Test: Manual test
- Behavior changed: No
- Does this need documentation: No
Issue Number: close #xxx

Problem Summary:
1. Standardized FE and BE logging grammar based on AGENTS.md rules ('Start [verb]ing', 'Begin to [verb]', 'Finished [verb]ing').
2. Reduced massive log spam in fs_file_cache_storage.cpp.
3. Corrected misspellings (e.g. 'destoried', 'envorinment', 'founded', 'can not' -> 'cannot') and capitalized key info logs according to community standards.

None

- Test: Manual test
- Behavior changed: No
- Does this need documentation: No
@morningman
Copy link
Contributor Author

run buildall

@doris-robot
Copy link

TPC-H: Total hot run time: 26804 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpch-tools
Tpch sf100 test result on commit 5bf2da68fc00ba012d39d84e8479f5ef5664d343, data reload: false

------ Round 1 ----------------------------------
orders	Doris	NULL	NULL	0	0	0	NULL	0	NULL	NULL	2023-12-26 18:27:23	2023-12-26 18:42:55	NULL	utf-8	NULL	NULL	
============================================
q1	17662	4497	4305	4305
q2	q3	10637	792	516	516
q4	4683	367	249	249
q5	7594	1231	1021	1021
q6	179	175	148	148
q7	802	847	682	682
q8	10030	1517	1352	1352
q9	5521	4769	4539	4539
q10	6318	1950	1655	1655
q11	483	258	235	235
q12	754	586	463	463
q13	18055	2724	1936	1936
q14	238	228	216	216
q15	q16	756	752	666	666
q17	730	858	435	435
q18	5932	5505	5282	5282
q19	1484	977	615	615
q20	531	484	371	371
q21	4537	1865	1808	1808
q22	461	381	310	310
Total cold run time: 97387 ms
Total hot run time: 26804 ms

----- Round 2, with runtime_filter_mode=off -----
orders	Doris	NULL	NULL	150000000	42	6422171781	NULL	22778155	NULL	NULL	2023-12-26 18:27:23	2023-12-26 18:42:55	NULL	utf-8	NULL	NULL	
============================================
q1	4830	4620	4788	4620
q2	q3	4027	4330	3842	3842
q4	921	1245	770	770
q5	4120	4498	4460	4460
q6	188	180	141	141
q7	1764	1681	1546	1546
q8	2541	2808	2640	2640
q9	7595	7368	7379	7368
q10	3863	4120	3630	3630
q11	598	431	434	431
q12	486	600	442	442
q13	2461	2902	2082	2082
q14	301	308	273	273
q15	q16	721	786	736	736
q17	1192	1420	1430	1420
q18	7295	6841	6647	6647
q19	897	929	916	916
q20	2078	2174	1981	1981
q21	3983	3478	3332	3332
q22	451	455	409	409
Total cold run time: 50312 ms
Total hot run time: 47686 ms

@doris-robot
Copy link

TPC-DS: Total hot run time: 168643 ms
machine: 'aliyun_ecs.c7a.8xlarge_32C64G'
scripts: https://github.com/apache/doris/tree/master/tools/tpcds-tools
TPC-DS sf100 test result on commit 5bf2da68fc00ba012d39d84e8479f5ef5664d343, data reload: false

query5	4332	639	496	496
query6	336	236	209	209
query7	4200	466	272	272
query8	358	257	229	229
query9	8719	2705	2732	2705
query10	546	396	346	346
query11	7035	5096	4849	4849
query12	188	132	141	132
query13	1276	461	352	352
query14	5765	3708	3498	3498
query14_1	2863	2837	2797	2797
query15	208	191	177	177
query16	987	480	488	480
query17	1139	754	642	642
query18	2457	459	353	353
query19	218	229	189	189
query20	135	132	126	126
query21	214	138	119	119
query22	13128	13943	15004	13943
query23	16767	16390	16075	16075
query23_1	16033	16097	15724	15724
query24	7197	1632	1225	1225
query24_1	1238	1226	1250	1226
query25	626	467	417	417
query26	1106	271	150	150
query27	2764	478	293	293
query28	4453	1844	1834	1834
query29	858	572	469	469
query30	295	228	188	188
query31	994	923	876	876
query32	80	71	71	71
query33	503	350	285	285
query34	915	871	532	532
query35	638	690	589	589
query36	1042	1133	1003	1003
query37	140	92	82	82
query38	2937	2982	2860	2860
query39	858	838	810	810
query39_1	792	805	805	805
query40	233	152	135	135
query41	62	99	59	59
query42	260	254	255	254
query43	238	255	223	223
query44	
query45	199	189	183	183
query46	887	982	616	616
query47	2130	2150	2071	2071
query48	335	309	229	229
query49	635	452	393	393
query50	690	275	215	215
query51	4049	4052	3940	3940
query52	263	266	255	255
query53	286	335	287	287
query54	306	271	265	265
query55	97	89	84	84
query56	318	323	324	323
query57	1944	1614	1734	1614
query58	279	273	274	273
query59	2760	2949	2712	2712
query60	346	338	327	327
query61	161	158	156	156
query62	629	590	538	538
query63	309	281	278	278
query64	4979	1297	1042	1042
query65	
query66	1428	453	367	367
query67	24188	24272	24112	24112
query68	
query69	412	314	292	292
query70	982	959	868	868
query71	338	315	297	297
query72	2936	2659	2503	2503
query73	537	552	313	313
query74	9614	9591	9360	9360
query75	2847	2807	2484	2484
query76	2306	1060	680	680
query77	370	388	310	310
query78	10973	11022	10488	10488
query79	3162	748	566	566
query80	1730	626	556	556
query81	577	282	224	224
query82	1005	152	117	117
query83	335	260	247	247
query84	253	117	99	99
query85	922	486	447	447
query86	503	333	332	332
query87	3080	3169	2998	2998
query88	3714	2651	2635	2635
query89	423	367	341	341
query90	2103	182	179	179
query91	175	168	137	137
query92	90	71	71	71
query93	2879	848	485	485
query94	639	309	266	266
query95	601	406	320	320
query96	645	524	229	229
query97	2478	2516	2386	2386
query98	240	223	217	217
query99	1003	1036	945	945
Total cold run time: 254437 ms
Total hot run time: 168643 ms

@hello-stephen
Copy link
Contributor

BE UT Coverage Report

Increment line coverage 🎉

Increment coverage report
Complete coverage report

Category Coverage
Function Coverage 52.86% (19885/37621)
Line Coverage 36.34% (185772/511139)
Region Coverage 32.58% (143863/441561)
Branch Coverage 33.80% (63011/186399)

@hello-stephen
Copy link
Contributor

BE Regression && UT Coverage Report

Increment line coverage 100% (0/0) 🎉

Increment coverage report
Complete coverage report

Category Coverage
Function Coverage 72.96% (26878/36838)
Line Coverage 56.41% (287436/509576)
Region Coverage 53.67% (239207/445687)
Branch Coverage 55.45% (103677/186963)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants