Ideas - WORD_SEPARATORS should be configable#
The current WORD_SEPARATORS is ",.|;+=&()", but for chinese, there are more word separators like ",。:§·" etc. If a CamelCase word is close with such separator, for example: CamelCase,It will not be separated as a word, and will not have a url to the CamelCase page.