Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

File detected as iso-8859-2, but it is Windows-1250 #141

Open
petrbizon opened this issue May 17, 2022 · 3 comments
Open

File detected as iso-8859-2, but it is Windows-1250 #141

petrbizon opened this issue May 17, 2022 · 3 comments

Comments

@petrbizon
Copy link

Hello, I attached the file which is in Windows-1250, but library detects it as iso-8859-2. Thank You for trying. Petr B.
774723_PPBP_0510_02_gu.csv

@304NotModified
Copy link
Member

Could you please add the code call?

I'ts important to know if you use DetectFromFile/DetectFromStream/DetectFromBytes/ and with of without loops

@petrbizon
Copy link
Author

petrbizon commented May 17, 2022

I receive bytes from web request (byte[] data)

and then I am working with them:

DetectionResult charsetDetectorResult = CharsetDetector.DetectFromBytes(data);
DetectionDetail resultDetected = charsetDetectorResult.Detected;
string encodingName = resultDetected.EncodingName;

@petrbizon
Copy link
Author

petrbizon commented Sep 14, 2023

I tried to look on the file in notepad++ with detected codepage iso-8859-2 and there is wrong character 'ž'
image
when I am looking on the file with codepage Windows-1250, the character 'ž' is showed right.
image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants