I want to read unsigned bytes from a binary file. So I wrote the following code.
#include <iostream>
#include <fstream>
#include <vector>
#include <istream>
std::string filename("file");
size_t bytesAvailable = 128;
size_t toRead = 128;
std::basic_ifstream<unsigned char> inf(filename.c_str(), std::ios_base::in | std::ios_base::binary) ;
if (inF.good())
{
std::vector<unsigned char> mDataBuffer;
mDataBuffer.resize(bytesAvailable) ;
inF.read(&mDataBuffer[0], toRead) ;
size_t counted = inF.gcount() ;
}
This results in reading in always 0 bytes as shown by the variable counted.
There seem to be references on the web saying that I need to set the locale to make this work. How to do this exactly is not clear to me.
The same code works using the data type 'char' instead of 'unsigned char'
The above code using unsigned char seems to work on Windows but fails running in a colinux Fedora 2.6.22.18 .
What do I need to do to get it to work for linux?
C++ does require the implementation only to provide explicit specializations for two versions of character traits:
std::char_traits<char>
std::char_traits<wchar_t>
The streams and strings use those traits to figure out a variety of things, like the EOF value, comparison of a range of characters, widening of a character to an int, and such stuff.
If you instantiate a stream like
std::basic_ifstream<unsigned char>
You have to make sure that there is a corresponding character trait specialization that the stream can use and that this specialization does do useful things. In addition, streams use facets to do actual formatting and reading of numbers. Likewise you have to provide specializations of those too manually. The standard doesn't even require the implementation to have a complete definition of the primary template. So you could aswell get a compile error:
error: specialization std::char_traits could not be instantiated.
I would use ifstream
instead (which is a basic_ifstream<char>
) and then go and read into a vector<char>
. When interpreting the data in the vector, you can still convert them to unsigned char
later.