How to use fgets if you don't know the number of characters to be read?

vette982 picture vette982 · May 21, 2010 · Viewed 20.3k times · Source

I need to read a file and send the text from it to a string so I can parse it. However, the program won't know exactly how long the file is, so what would I do if I wanted to use fgets(), or is there a better alternative?

Note:

char *fgets(char *str, size_t num, FILE *stream);

Answer

Jonathan Leffler picture Jonathan Leffler · May 21, 2010

Don't forget that fgets() reads a line at a time, subject to having enough space.

Humans seldom write lines longer than ... 80, 256, pick a number ... characters. POSIX suggests a line length of 4096. So, I usually use:

char buffer[4096];

while (fgets(buffer, sizeof(buffer), fp)) 
{
    ...process line...
}

If you are worried that someone might provide more than 4K of data in a single line (and a machine generated file, such as HTML or JSON, might contain that), then you have to decide what to do next. You can do any of the following (and there are likely some other options I've not mentioned):

  1. Process the over-long lines in bits without assuming that there was a newline in between.
  2. Allocate memory for a longer line (say 8K to start with), copy the initial 4K into the allocated buffer, and read more data into the second half of the buffer, iterating until you find the end of line.
  3. Use the POSIX 2008 function getline() which is available on Linux. It does memory allocation for you.