Coverage Report

Created: 2026-01-17 07:08

next uncovered line (L), next uncovered region (R), next uncovered branch (B)
/src/stb/stb_image.h
Line
Count
Source
1
/* stb_image - v2.30 - public domain image loader - http://nothings.org/stb
2
                                  no warranty implied; use at your own risk
3
4
   Do this:
5
      #define STB_IMAGE_IMPLEMENTATION
6
   before you include this file in *one* C or C++ file to create the implementation.
7
8
   // i.e. it should look like this:
9
   #include ...
10
   #include ...
11
   #include ...
12
   #define STB_IMAGE_IMPLEMENTATION
13
   #include "stb_image.h"
14
15
   You can #define STBI_ASSERT(x) before the #include to avoid using assert.h.
16
   And #define STBI_MALLOC, STBI_REALLOC, and STBI_FREE to avoid using malloc,realloc,free
17
18
19
   QUICK NOTES:
20
      Primarily of interest to game developers and other people who can
21
          avoid problematic images and only need the trivial interface
22
23
      JPEG baseline & progressive (12 bpc/arithmetic not supported, same as stock IJG lib)
24
      PNG 1/2/4/8/16-bit-per-channel
25
26
      TGA (not sure what subset, if a subset)
27
      BMP non-1bpp, non-RLE
28
      PSD (composited view only, no extra channels, 8/16 bit-per-channel)
29
30
      GIF (*comp always reports as 4-channel)
31
      HDR (radiance rgbE format)
32
      PIC (Softimage PIC)
33
      PNM (PPM and PGM binary only)
34
35
      Animated GIF still needs a proper API, but here's one way to do it:
36
          http://gist.github.com/urraka/685d9a6340b26b830d49
37
38
      - decode from memory or through FILE (define STBI_NO_STDIO to remove code)
39
      - decode from arbitrary I/O callbacks
40
      - SIMD acceleration on x86/x64 (SSE2) and ARM (NEON)
41
42
   Full documentation under "DOCUMENTATION" below.
43
44
45
LICENSE
46
47
  See end of file for license information.
48
49
RECENT REVISION HISTORY:
50
51
      2.30  (2024-05-31) avoid erroneous gcc warning
52
      2.29  (2023-05-xx) optimizations
53
      2.28  (2023-01-29) many error fixes, security errors, just tons of stuff
54
      2.27  (2021-07-11) document stbi_info better, 16-bit PNM support, bug fixes
55
      2.26  (2020-07-13) many minor fixes
56
      2.25  (2020-02-02) fix warnings
57
      2.24  (2020-02-02) fix warnings; thread-local failure_reason and flip_vertically
58
      2.23  (2019-08-11) fix clang static analysis warning
59
      2.22  (2019-03-04) gif fixes, fix warnings
60
      2.21  (2019-02-25) fix typo in comment
61
      2.20  (2019-02-07) support utf8 filenames in Windows; fix warnings and platform ifdefs
62
      2.19  (2018-02-11) fix warning
63
      2.18  (2018-01-30) fix warnings
64
      2.17  (2018-01-29) bugfix, 1-bit BMP, 16-bitness query, fix warnings
65
      2.16  (2017-07-23) all functions have 16-bit variants; optimizations; bugfixes
66
      2.15  (2017-03-18) fix png-1,2,4; all Imagenet JPGs; no runtime SSE detection on GCC
67
      2.14  (2017-03-03) remove deprecated STBI_JPEG_OLD; fixes for Imagenet JPGs
68
      2.13  (2016-12-04) experimental 16-bit API, only for PNG so far; fixes
69
      2.12  (2016-04-02) fix typo in 2.11 PSD fix that caused crashes
70
      2.11  (2016-04-02) 16-bit PNGS; enable SSE2 in non-gcc x64
71
                         RGB-format JPEG; remove white matting in PSD;
72
                         allocate large structures on the stack;
73
                         correct channel count for PNG & BMP
74
      2.10  (2016-01-22) avoid warning introduced in 2.09
75
      2.09  (2016-01-16) 16-bit TGA; comments in PNM files; STBI_REALLOC_SIZED
76
77
   See end of file for full revision history.
78
79
80
 ============================    Contributors    =========================
81
82
 Image formats                          Extensions, features
83
    Sean Barrett (jpeg, png, bmp)          Jetro Lauha (stbi_info)
84
    Nicolas Schulz (hdr, psd)              Martin "SpartanJ" Golini (stbi_info)
85
    Jonathan Dummer (tga)                  James "moose2000" Brown (iPhone PNG)
86
    Jean-Marc Lienher (gif)                Ben "Disch" Wenger (io callbacks)
87
    Tom Seddon (pic)                       Omar Cornut (1/2/4-bit PNG)
88
    Thatcher Ulrich (psd)                  Nicolas Guillemot (vertical flip)
89
    Ken Miller (pgm, ppm)                  Richard Mitton (16-bit PSD)
90
    github:urraka (animated gif)           Junggon Kim (PNM comments)
91
    Christopher Forseth (animated gif)     Daniel Gibson (16-bit TGA)
92
                                           socks-the-fox (16-bit PNG)
93
                                           Jeremy Sawicki (handle all ImageNet JPGs)
94
 Optimizations & bugfixes                  Mikhail Morozov (1-bit BMP)
95
    Fabian "ryg" Giesen                    Anael Seghezzi (is-16-bit query)
96
    Arseny Kapoulkine                      Simon Breuss (16-bit PNM)
97
    John-Mark Allen
98
    Carmelo J Fdez-Aguera
99
100
 Bug & warning fixes
101
    Marc LeBlanc            David Woo          Guillaume George     Martins Mozeiko
102
    Christpher Lloyd        Jerry Jansson      Joseph Thomson       Blazej Dariusz Roszkowski
103
    Phil Jordan                                Dave Moore           Roy Eltham
104
    Hayaki Saito            Nathan Reed        Won Chun
105
    Luke Graham             Johan Duparc       Nick Verigakis       the Horde3D community
106
    Thomas Ruf              Ronny Chevalier                         github:rlyeh
107
    Janez Zemva             John Bartholomew   Michal Cichon        github:romigrou
108
    Jonathan Blow           Ken Hamada         Tero Hanninen        github:svdijk
109
    Eugene Golushkov        Laurent Gomila     Cort Stratton        github:snagar
110
    Aruelien Pocheville     Sergio Gonzalez    Thibault Reuille     github:Zelex
111
    Cass Everitt            Ryamond Barbiero                        github:grim210
112
    Paul Du Bois            Engin Manap        Aldo Culquicondor    github:sammyhw
113
    Philipp Wiesemann       Dale Weiler        Oriol Ferrer Mesia   github:phprus
114
    Josh Tobin              Neil Bickford      Matthew Gregan       github:poppolopoppo
115
    Julian Raschke          Gregory Mullen     Christian Floisand   github:darealshinji
116
    Baldur Karlsson         Kevin Schmidt      JR Smith             github:Michaelangel007
117
                            Brad Weinberger    Matvey Cherevko      github:mosra
118
    Luca Sas                Alexander Veselov  Zack Middleton       [reserved]
119
    Ryan C. Gordon          [reserved]                              [reserved]
120
                     DO NOT ADD YOUR NAME HERE
121
122
                     Jacko Dirks
123
124
  To add your name to the credits, pick a random blank space in the middle and fill it.
125
  80% of merge conflicts on stb PRs are due to people adding their name at the end
126
  of the credits.
127
*/
128
129
#ifndef STBI_INCLUDE_STB_IMAGE_H
130
#define STBI_INCLUDE_STB_IMAGE_H
131
132
// DOCUMENTATION
133
//
134
// Limitations:
135
//    - no 12-bit-per-channel JPEG
136
//    - no JPEGs with arithmetic coding
137
//    - GIF always returns *comp=4
138
//
139
// Basic usage (see HDR discussion below for HDR usage):
140
//    int x,y,n;
141
//    unsigned char *data = stbi_load(filename, &x, &y, &n, 0);
142
//    // ... process data if not NULL ...
143
//    // ... x = width, y = height, n = # 8-bit components per pixel ...
144
//    // ... replace '0' with '1'..'4' to force that many components per pixel
145
//    // ... but 'n' will always be the number that it would have been if you said 0
146
//    stbi_image_free(data);
147
//
148
// Standard parameters:
149
//    int *x                 -- outputs image width in pixels
150
//    int *y                 -- outputs image height in pixels
151
//    int *channels_in_file  -- outputs # of image components in image file
152
//    int desired_channels   -- if non-zero, # of image components requested in result
153
//
154
// The return value from an image loader is an 'unsigned char *' which points
155
// to the pixel data, or NULL on an allocation failure or if the image is
156
// corrupt or invalid. The pixel data consists of *y scanlines of *x pixels,
157
// with each pixel consisting of N interleaved 8-bit components; the first
158
// pixel pointed to is top-left-most in the image. There is no padding between
159
// image scanlines or between pixels, regardless of format. The number of
160
// components N is 'desired_channels' if desired_channels is non-zero, or
161
// *channels_in_file otherwise. If desired_channels is non-zero,
162
// *channels_in_file has the number of components that _would_ have been
163
// output otherwise. E.g. if you set desired_channels to 4, you will always
164
// get RGBA output, but you can check *channels_in_file to see if it's trivially
165
// opaque because e.g. there were only 3 channels in the source image.
166
//
167
// An output image with N components has the following components interleaved
168
// in this order in each pixel:
169
//
170
//     N=#comp     components
171
//       1           grey
172
//       2           grey, alpha
173
//       3           red, green, blue
174
//       4           red, green, blue, alpha
175
//
176
// If image loading fails for any reason, the return value will be NULL,
177
// and *x, *y, *channels_in_file will be unchanged. The function
178
// stbi_failure_reason() can be queried for an extremely brief, end-user
179
// unfriendly explanation of why the load failed. Define STBI_NO_FAILURE_STRINGS
180
// to avoid compiling these strings at all, and STBI_FAILURE_USERMSG to get slightly
181
// more user-friendly ones.
182
//
183
// Paletted PNG, BMP, GIF, and PIC images are automatically depalettized.
184
//
185
// To query the width, height and component count of an image without having to
186
// decode the full file, you can use the stbi_info family of functions:
187
//
188
//   int x,y,n,ok;
189
//   ok = stbi_info(filename, &x, &y, &n);
190
//   // returns ok=1 and sets x, y, n if image is a supported format,
191
//   // 0 otherwise.
192
//
193
// Note that stb_image pervasively uses ints in its public API for sizes,
194
// including sizes of memory buffers. This is now part of the API and thus
195
// hard to change without causing breakage. As a result, the various image
196
// loaders all have certain limits on image size; these differ somewhat
197
// by format but generally boil down to either just under 2GB or just under
198
// 1GB. When the decoded image would be larger than this, stb_image decoding
199
// will fail.
200
//
201
// Additionally, stb_image will reject image files that have any of their
202
// dimensions set to a larger value than the configurable STBI_MAX_DIMENSIONS,
203
// which defaults to 2**24 = 16777216 pixels. Due to the above memory limit,
204
// the only way to have an image with such dimensions load correctly
205
// is for it to have a rather extreme aspect ratio. Either way, the
206
// assumption here is that such larger images are likely to be malformed
207
// or malicious. If you do need to load an image with individual dimensions
208
// larger than that, and it still fits in the overall size limit, you can
209
// #define STBI_MAX_DIMENSIONS on your own to be something larger.
210
//
211
// ===========================================================================
212
//
213
// UNICODE:
214
//
215
//   If compiling for Windows and you wish to use Unicode filenames, compile
216
//   with
217
//       #define STBI_WINDOWS_UTF8
218
//   and pass utf8-encoded filenames. Call stbi_convert_wchar_to_utf8 to convert
219
//   Windows wchar_t filenames to utf8.
220
//
221
// ===========================================================================
222
//
223
// Philosophy
224
//
225
// stb libraries are designed with the following priorities:
226
//
227
//    1. easy to use
228
//    2. easy to maintain
229
//    3. good performance
230
//
231
// Sometimes I let "good performance" creep up in priority over "easy to maintain",
232
// and for best performance I may provide less-easy-to-use APIs that give higher
233
// performance, in addition to the easy-to-use ones. Nevertheless, it's important
234
// to keep in mind that from the standpoint of you, a client of this library,
235
// all you care about is #1 and #3, and stb libraries DO NOT emphasize #3 above all.
236
//
237
// Some secondary priorities arise directly from the first two, some of which
238
// provide more explicit reasons why performance can't be emphasized.
239
//
240
//    - Portable ("ease of use")
241
//    - Small source code footprint ("easy to maintain")
242
//    - No dependencies ("ease of use")
243
//
244
// ===========================================================================
245
//
246
// I/O callbacks
247
//
248
// I/O callbacks allow you to read from arbitrary sources, like packaged
249
// files or some other source. Data read from callbacks are processed
250
// through a small internal buffer (currently 128 bytes) to try to reduce
251
// overhead.
252
//
253
// The three functions you must define are "read" (reads some bytes of data),
254
// "skip" (skips some bytes of data), "eof" (reports if the stream is at the end).
255
//
256
// ===========================================================================
257
//
258
// SIMD support
259
//
260
// The JPEG decoder will try to automatically use SIMD kernels on x86 when
261
// supported by the compiler. For ARM Neon support, you must explicitly
262
// request it.
263
//
264
// (The old do-it-yourself SIMD API is no longer supported in the current
265
// code.)
266
//
267
// On x86, SSE2 will automatically be used when available based on a run-time
268
// test; if not, the generic C versions are used as a fall-back. On ARM targets,
269
// the typical path is to have separate builds for NEON and non-NEON devices
270
// (at least this is true for iOS and Android). Therefore, the NEON support is
271
// toggled by a build flag: define STBI_NEON to get NEON loops.
272
//
273
// If for some reason you do not want to use any of SIMD code, or if
274
// you have issues compiling it, you can disable it entirely by
275
// defining STBI_NO_SIMD.
276
//
277
// ===========================================================================
278
//
279
// HDR image support   (disable by defining STBI_NO_HDR)
280
//
281
// stb_image supports loading HDR images in general, and currently the Radiance
282
// .HDR file format specifically. You can still load any file through the existing
283
// interface; if you attempt to load an HDR file, it will be automatically remapped
284
// to LDR, assuming gamma 2.2 and an arbitrary scale factor defaulting to 1;
285
// both of these constants can be reconfigured through this interface:
286
//
287
//     stbi_hdr_to_ldr_gamma(2.2f);
288
//     stbi_hdr_to_ldr_scale(1.0f);
289
//
290
// (note, do not use _inverse_ constants; stbi_image will invert them
291
// appropriately).
292
//
293
// Additionally, there is a new, parallel interface for loading files as
294
// (linear) floats to preserve the full dynamic range:
295
//
296
//    float *data = stbi_loadf(filename, &x, &y, &n, 0);
297
//
298
// If you load LDR images through this interface, those images will
299
// be promoted to floating point values, run through the inverse of
300
// constants corresponding to the above:
301
//
302
//     stbi_ldr_to_hdr_scale(1.0f);
303
//     stbi_ldr_to_hdr_gamma(2.2f);
304
//
305
// Finally, given a filename (or an open file or memory block--see header
306
// file for details) containing image data, you can query for the "most
307
// appropriate" interface to use (that is, whether the image is HDR or
308
// not), using:
309
//
310
//     stbi_is_hdr(char *filename);
311
//
312
// ===========================================================================
313
//
314
// iPhone PNG support:
315
//
316
// We optionally support converting iPhone-formatted PNGs (which store
317
// premultiplied BGRA) back to RGB, even though they're internally encoded
318
// differently. To enable this conversion, call
319
// stbi_convert_iphone_png_to_rgb(1).
320
//
321
// Call stbi_set_unpremultiply_on_load(1) as well to force a divide per
322
// pixel to remove any premultiplied alpha *only* if the image file explicitly
323
// says there's premultiplied data (currently only happens in iPhone images,
324
// and only if iPhone convert-to-rgb processing is on).
325
//
326
// ===========================================================================
327
//
328
// ADDITIONAL CONFIGURATION
329
//
330
//  - You can suppress implementation of any of the decoders to reduce
331
//    your code footprint by #defining one or more of the following
332
//    symbols before creating the implementation.
333
//
334
//        STBI_NO_JPEG
335
//        STBI_NO_PNG
336
//        STBI_NO_BMP
337
//        STBI_NO_PSD
338
//        STBI_NO_TGA
339
//        STBI_NO_GIF
340
//        STBI_NO_HDR
341
//        STBI_NO_PIC
342
//        STBI_NO_PNM   (.ppm and .pgm)
343
//
344
//  - You can request *only* certain decoders and suppress all other ones
345
//    (this will be more forward-compatible, as addition of new decoders
346
//    doesn't require you to disable them explicitly):
347
//
348
//        STBI_ONLY_JPEG
349
//        STBI_ONLY_PNG
350
//        STBI_ONLY_BMP
351
//        STBI_ONLY_PSD
352
//        STBI_ONLY_TGA
353
//        STBI_ONLY_GIF
354
//        STBI_ONLY_HDR
355
//        STBI_ONLY_PIC
356
//        STBI_ONLY_PNM   (.ppm and .pgm)
357
//
358
//   - If you use STBI_NO_PNG (or _ONLY_ without PNG), and you still
359
//     want the zlib decoder to be available, #define STBI_SUPPORT_ZLIB
360
//
361
//  - If you define STBI_MAX_DIMENSIONS, stb_image will reject images greater
362
//    than that size (in either width or height) without further processing.
363
//    This is to let programs in the wild set an upper bound to prevent
364
//    denial-of-service attacks on untrusted data, as one could generate a
365
//    valid image of gigantic dimensions and force stb_image to allocate a
366
//    huge block of memory and spend disproportionate time decoding it. By
367
//    default this is set to (1 << 24), which is 16777216, but that's still
368
//    very big.
369
370
#ifndef STBI_NO_STDIO
371
#include <stdio.h>
372
#endif // STBI_NO_STDIO
373
374
#define STBI_VERSION 1
375
376
enum
377
{
378
   STBI_default = 0, // only used for desired_channels
379
380
   STBI_grey       = 1,
381
   STBI_grey_alpha = 2,
382
   STBI_rgb        = 3,
383
   STBI_rgb_alpha  = 4
384
};
385
386
#include <stdlib.h>
387
typedef unsigned char stbi_uc;
388
typedef unsigned short stbi_us;
389
390
#ifdef __cplusplus
391
extern "C" {
392
#endif
393
394
#ifndef STBIDEF
395
#ifdef STB_IMAGE_STATIC
396
#define STBIDEF static
397
#else
398
#define STBIDEF extern
399
#endif
400
#endif
401
402
//////////////////////////////////////////////////////////////////////////////
403
//
404
// PRIMARY API - works on images of any type
405
//
406
407
//
408
// load image by filename, open file, or memory buffer
409
//
410
411
typedef struct
412
{
413
   int      (*read)  (void *user,char *data,int size);   // fill 'data' with 'size' bytes.  return number of bytes actually read
414
   void     (*skip)  (void *user,int n);                 // skip the next 'n' bytes, or 'unget' the last -n bytes if negative
415
   int      (*eof)   (void *user);                       // returns nonzero if we are at end of file/data
416
} stbi_io_callbacks;
417
418
////////////////////////////////////
419
//
420
// 8-bits-per-channel interface
421
//
422
423
STBIDEF stbi_uc *stbi_load_from_memory   (stbi_uc           const *buffer, int len   , int *x, int *y, int *channels_in_file, int desired_channels);
424
STBIDEF stbi_uc *stbi_load_from_callbacks(stbi_io_callbacks const *clbk  , void *user, int *x, int *y, int *channels_in_file, int desired_channels);
425
426
#ifndef STBI_NO_STDIO
427
STBIDEF stbi_uc *stbi_load            (char const *filename, int *x, int *y, int *channels_in_file, int desired_channels);
428
STBIDEF stbi_uc *stbi_load_from_file  (FILE *f, int *x, int *y, int *channels_in_file, int desired_channels);
429
// for stbi_load_from_file, file pointer is left pointing immediately after image
430
#endif
431
432
#ifndef STBI_NO_GIF
433
STBIDEF stbi_uc *stbi_load_gif_from_memory(stbi_uc const *buffer, int len, int **delays, int *x, int *y, int *z, int *comp, int req_comp);
434
#endif
435
436
#ifdef STBI_WINDOWS_UTF8
437
STBIDEF int stbi_convert_wchar_to_utf8(char *buffer, size_t bufferlen, const wchar_t* input);
438
#endif
439
440
////////////////////////////////////
441
//
442
// 16-bits-per-channel interface
443
//
444
445
STBIDEF stbi_us *stbi_load_16_from_memory   (stbi_uc const *buffer, int len, int *x, int *y, int *channels_in_file, int desired_channels);
446
STBIDEF stbi_us *stbi_load_16_from_callbacks(stbi_io_callbacks const *clbk, void *user, int *x, int *y, int *channels_in_file, int desired_channels);
447
448
#ifndef STBI_NO_STDIO
449
STBIDEF stbi_us *stbi_load_16          (char const *filename, int *x, int *y, int *channels_in_file, int desired_channels);
450
STBIDEF stbi_us *stbi_load_from_file_16(FILE *f, int *x, int *y, int *channels_in_file, int desired_channels);
451
#endif
452
453
////////////////////////////////////
454
//
455
// float-per-channel interface
456
//
457
#ifndef STBI_NO_LINEAR
458
   STBIDEF float *stbi_loadf_from_memory     (stbi_uc const *buffer, int len, int *x, int *y, int *channels_in_file, int desired_channels);
459
   STBIDEF float *stbi_loadf_from_callbacks  (stbi_io_callbacks const *clbk, void *user, int *x, int *y,  int *channels_in_file, int desired_channels);
460
461
   #ifndef STBI_NO_STDIO
462
   STBIDEF float *stbi_loadf            (char const *filename, int *x, int *y, int *channels_in_file, int desired_channels);
463
   STBIDEF float *stbi_loadf_from_file  (FILE *f, int *x, int *y, int *channels_in_file, int desired_channels);
464
   #endif
465
#endif
466
467
#ifndef STBI_NO_HDR
468
   STBIDEF void   stbi_hdr_to_ldr_gamma(float gamma);
469
   STBIDEF void   stbi_hdr_to_ldr_scale(float scale);
470
#endif // STBI_NO_HDR
471
472
#ifndef STBI_NO_LINEAR
473
   STBIDEF void   stbi_ldr_to_hdr_gamma(float gamma);
474
   STBIDEF void   stbi_ldr_to_hdr_scale(float scale);
475
#endif // STBI_NO_LINEAR
476
477
// stbi_is_hdr is always defined, but always returns false if STBI_NO_HDR
478
STBIDEF int    stbi_is_hdr_from_callbacks(stbi_io_callbacks const *clbk, void *user);
479
STBIDEF int    stbi_is_hdr_from_memory(stbi_uc const *buffer, int len);
480
#ifndef STBI_NO_STDIO
481
STBIDEF int      stbi_is_hdr          (char const *filename);
482
STBIDEF int      stbi_is_hdr_from_file(FILE *f);
483
#endif // STBI_NO_STDIO
484
485
486
// get a VERY brief reason for failure
487
// on most compilers (and ALL modern mainstream compilers) this is threadsafe
488
STBIDEF const char *stbi_failure_reason  (void);
489
490
// free the loaded image -- this is just free()
491
STBIDEF void     stbi_image_free      (void *retval_from_stbi_load);
492
493
// get image dimensions & components without fully decoding
494
STBIDEF int      stbi_info_from_memory(stbi_uc const *buffer, int len, int *x, int *y, int *comp);
495
STBIDEF int      stbi_info_from_callbacks(stbi_io_callbacks const *clbk, void *user, int *x, int *y, int *comp);
496
STBIDEF int      stbi_is_16_bit_from_memory(stbi_uc const *buffer, int len);
497
STBIDEF int      stbi_is_16_bit_from_callbacks(stbi_io_callbacks const *clbk, void *user);
498
499
#ifndef STBI_NO_STDIO
500
STBIDEF int      stbi_info               (char const *filename,     int *x, int *y, int *comp);
501
STBIDEF int      stbi_info_from_file     (FILE *f,                  int *x, int *y, int *comp);
502
STBIDEF int      stbi_is_16_bit          (char const *filename);
503
STBIDEF int      stbi_is_16_bit_from_file(FILE *f);
504
#endif
505
506
507
508
// for image formats that explicitly notate that they have premultiplied alpha,
509
// we just return the colors as stored in the file. set this flag to force
510
// unpremultiplication. results are undefined if the unpremultiply overflow.
511
STBIDEF void stbi_set_unpremultiply_on_load(int flag_true_if_should_unpremultiply);
512
513
// indicate whether we should process iphone images back to canonical format,
514
// or just pass them through "as-is"
515
STBIDEF void stbi_convert_iphone_png_to_rgb(int flag_true_if_should_convert);
516
517
// flip the image vertically, so the first pixel in the output array is the bottom left
518
STBIDEF void stbi_set_flip_vertically_on_load(int flag_true_if_should_flip);
519
520
// as above, but only applies to images loaded on the thread that calls the function
521
// this function is only available if your compiler supports thread-local variables;
522
// calling it will fail to link if your compiler doesn't
523
STBIDEF void stbi_set_unpremultiply_on_load_thread(int flag_true_if_should_unpremultiply);
524
STBIDEF void stbi_convert_iphone_png_to_rgb_thread(int flag_true_if_should_convert);
525
STBIDEF void stbi_set_flip_vertically_on_load_thread(int flag_true_if_should_flip);
526
527
// ZLIB client - used by PNG, available for other purposes
528
529
STBIDEF char *stbi_zlib_decode_malloc_guesssize(const char *buffer, int len, int initial_size, int *outlen);
530
STBIDEF char *stbi_zlib_decode_malloc_guesssize_headerflag(const char *buffer, int len, int initial_size, int *outlen, int parse_header);
531
STBIDEF char *stbi_zlib_decode_malloc(const char *buffer, int len, int *outlen);
532
STBIDEF int   stbi_zlib_decode_buffer(char *obuffer, int olen, const char *ibuffer, int ilen);
533
534
STBIDEF char *stbi_zlib_decode_noheader_malloc(const char *buffer, int len, int *outlen);
535
STBIDEF int   stbi_zlib_decode_noheader_buffer(char *obuffer, int olen, const char *ibuffer, int ilen);
536
537
538
#ifdef __cplusplus
539
}
540
#endif
541
542
//
543
//
544
////   end header file   /////////////////////////////////////////////////////
545
#endif // STBI_INCLUDE_STB_IMAGE_H
546
547
#ifdef STB_IMAGE_IMPLEMENTATION
548
549
#if defined(STBI_ONLY_JPEG) || defined(STBI_ONLY_PNG) || defined(STBI_ONLY_BMP) \
550
  || defined(STBI_ONLY_TGA) || defined(STBI_ONLY_GIF) || defined(STBI_ONLY_PSD) \
551
  || defined(STBI_ONLY_HDR) || defined(STBI_ONLY_PIC) || defined(STBI_ONLY_PNM) \
552
  || defined(STBI_ONLY_ZLIB)
553
   #ifndef STBI_ONLY_JPEG
554
   #define STBI_NO_JPEG
555
   #endif
556
   #ifndef STBI_ONLY_PNG
557
   #define STBI_NO_PNG
558
   #endif
559
   #ifndef STBI_ONLY_BMP
560
   #define STBI_NO_BMP
561
   #endif
562
   #ifndef STBI_ONLY_PSD
563
   #define STBI_NO_PSD
564
   #endif
565
   #ifndef STBI_ONLY_TGA
566
   #define STBI_NO_TGA
567
   #endif
568
   #ifndef STBI_ONLY_GIF
569
   #define STBI_NO_GIF
570
   #endif
571
   #ifndef STBI_ONLY_HDR
572
   #define STBI_NO_HDR
573
   #endif
574
   #ifndef STBI_ONLY_PIC
575
   #define STBI_NO_PIC
576
   #endif
577
   #ifndef STBI_ONLY_PNM
578
   #define STBI_NO_PNM
579
   #endif
580
#endif
581
582
#if defined(STBI_NO_PNG) && !defined(STBI_SUPPORT_ZLIB) && !defined(STBI_NO_ZLIB)
583
#define STBI_NO_ZLIB
584
#endif
585
586
587
#include <stdarg.h>
588
#include <stddef.h> // ptrdiff_t on osx
589
#include <stdlib.h>
590
#include <string.h>
591
#include <limits.h>
592
593
#if !defined(STBI_NO_LINEAR) || !defined(STBI_NO_HDR)
594
#include <math.h>  // ldexp, pow
595
#endif
596
597
#ifndef STBI_NO_STDIO
598
#include <stdio.h>
599
#endif
600
601
#ifndef STBI_ASSERT
602
#include <assert.h>
603
4.38G
#define STBI_ASSERT(x) assert(x)
604
#endif
605
606
#ifdef __cplusplus
607
#define STBI_EXTERN extern "C"
608
#else
609
#define STBI_EXTERN extern
610
#endif
611
612
613
#ifndef _MSC_VER
614
   #ifdef __cplusplus
615
   #define stbi_inline inline
616
   #else
617
   #define stbi_inline
618
   #endif
619
#else
620
   #define stbi_inline __forceinline
621
#endif
622
623
#ifndef STBI_NO_THREAD_LOCALS
624
   #if defined(__cplusplus) &&  __cplusplus >= 201103L
625
      #define STBI_THREAD_LOCAL       thread_local
626
   #elif defined(__GNUC__) && __GNUC__ < 5
627
      #define STBI_THREAD_LOCAL       __thread
628
   #elif defined(_MSC_VER)
629
      #define STBI_THREAD_LOCAL       __declspec(thread)
630
   #elif defined (__STDC_VERSION__) && __STDC_VERSION__ >= 201112L && !defined(__STDC_NO_THREADS__)
631
      #define STBI_THREAD_LOCAL       _Thread_local
632
   #endif
633
634
   #ifndef STBI_THREAD_LOCAL
635
      #if defined(__GNUC__)
636
        #define STBI_THREAD_LOCAL       __thread
637
      #endif
638
   #endif
639
#endif
640
641
#if defined(_MSC_VER) || defined(__SYMBIAN32__)
642
typedef unsigned short stbi__uint16;
643
typedef   signed short stbi__int16;
644
typedef unsigned int   stbi__uint32;
645
typedef   signed int   stbi__int32;
646
#else
647
#include <stdint.h>
648
typedef uint16_t stbi__uint16;
649
typedef int16_t  stbi__int16;
650
typedef uint32_t stbi__uint32;
651
typedef int32_t  stbi__int32;
652
#endif
653
654
// should produce compiler error if size is wrong
655
typedef unsigned char validate_uint32[sizeof(stbi__uint32)==4 ? 1 : -1];
656
657
#ifdef _MSC_VER
658
#define STBI_NOTUSED(v)  (void)(v)
659
#else
660
51.2M
#define STBI_NOTUSED(v)  (void)sizeof(v)
661
#endif
662
663
#ifdef _MSC_VER
664
#define STBI_HAS_LROTL
665
#endif
666
667
#ifdef STBI_HAS_LROTL
668
   #define stbi_lrot(x,y)  _lrotl(x,y)
669
#else
670
83.7M
   #define stbi_lrot(x,y)  (((x) << (y)) | ((x) >> (-(y) & 31)))
671
#endif
672
673
#if defined(STBI_MALLOC) && defined(STBI_FREE) && (defined(STBI_REALLOC) || defined(STBI_REALLOC_SIZED))
674
// ok
675
#elif !defined(STBI_MALLOC) && !defined(STBI_FREE) && !defined(STBI_REALLOC) && !defined(STBI_REALLOC_SIZED)
676
// ok
677
#else
678
#error "Must define all or none of STBI_MALLOC, STBI_FREE, and STBI_REALLOC (or STBI_REALLOC_SIZED)."
679
#endif
680
681
#ifndef STBI_MALLOC
682
36.4k
#define STBI_MALLOC(sz)           malloc(sz)
683
2.27k
#define STBI_REALLOC(p,newsz)     realloc(p,newsz)
684
37.6k
#define STBI_FREE(p)              free(p)
685
#endif
686
687
#ifndef STBI_REALLOC_SIZED
688
2.27k
#define STBI_REALLOC_SIZED(p,oldsz,newsz) STBI_REALLOC(p,newsz)
689
#endif
690
691
// x86/x64 detection
692
#if defined(__x86_64__) || defined(_M_X64)
693
#define STBI__X64_TARGET
694
#elif defined(__i386) || defined(_M_IX86)
695
#define STBI__X86_TARGET
696
#endif
697
698
#if defined(__GNUC__) && defined(STBI__X86_TARGET) && !defined(__SSE2__) && !defined(STBI_NO_SIMD)
699
// gcc doesn't support sse2 intrinsics unless you compile with -msse2,
700
// which in turn means it gets to use SSE2 everywhere. This is unfortunate,
701
// but previous attempts to provide the SSE2 functions with runtime
702
// detection caused numerous issues. The way architecture extensions are
703
// exposed in GCC/Clang is, sadly, not really suited for one-file libs.
704
// New behavior: if compiled with -msse2, we use SSE2 without any
705
// detection; if not, we don't use it at all.
706
#define STBI_NO_SIMD
707
#endif
708
709
#if defined(__MINGW32__) && defined(STBI__X86_TARGET) && !defined(STBI_MINGW_ENABLE_SSE2) && !defined(STBI_NO_SIMD)
710
// Note that __MINGW32__ doesn't actually mean 32-bit, so we have to avoid STBI__X64_TARGET
711
//
712
// 32-bit MinGW wants ESP to be 16-byte aligned, but this is not in the
713
// Windows ABI and VC++ as well as Windows DLLs don't maintain that invariant.
714
// As a result, enabling SSE2 on 32-bit MinGW is dangerous when not
715
// simultaneously enabling "-mstackrealign".
716
//
717
// See https://github.com/nothings/stb/issues/81 for more information.
718
//
719
// So default to no SSE2 on 32-bit MinGW. If you've read this far and added
720
// -mstackrealign to your build settings, feel free to #define STBI_MINGW_ENABLE_SSE2.
721
#define STBI_NO_SIMD
722
#endif
723
724
#if !defined(STBI_NO_SIMD) && (defined(STBI__X86_TARGET) || defined(STBI__X64_TARGET))
725
#define STBI_SSE2
726
#include <emmintrin.h>
727
728
#ifdef _MSC_VER
729
730
#if _MSC_VER >= 1400  // not VC6
731
#include <intrin.h> // __cpuid
732
static int stbi__cpuid3(void)
733
{
734
   int info[4];
735
   __cpuid(info,1);
736
   return info[3];
737
}
738
#else
739
static int stbi__cpuid3(void)
740
{
741
   int res;
742
   __asm {
743
      mov  eax,1
744
      cpuid
745
      mov  res,edx
746
   }
747
   return res;
748
}
749
#endif
750
751
#define STBI_SIMD_ALIGN(type, name) __declspec(align(16)) type name
752
753
#if !defined(STBI_NO_JPEG) && defined(STBI_SSE2)
754
static int stbi__sse2_available(void)
755
{
756
   int info3 = stbi__cpuid3();
757
   return ((info3 >> 26) & 1) != 0;
758
}
759
#endif
760
761
#else // assume GCC-style if not VC++
762
2.66k
#define STBI_SIMD_ALIGN(type, name) type name __attribute__((aligned(16)))
763
764
#if !defined(STBI_NO_JPEG) && defined(STBI_SSE2)
765
static int stbi__sse2_available(void)
766
4.75k
{
767
   // If we're even attempting to compile this on GCC/Clang, that means
768
   // -msse2 is on, which means the compiler is allowed to use SSE2
769
   // instructions at will, and so are we.
770
4.75k
   return 1;
771
4.75k
}
772
#endif
773
774
#endif
775
#endif
776
777
// ARM NEON
778
#if defined(STBI_NO_SIMD) && defined(STBI_NEON)
779
#undef STBI_NEON
780
#endif
781
782
#ifdef STBI_NEON
783
#include <arm_neon.h>
784
#ifdef _MSC_VER
785
#define STBI_SIMD_ALIGN(type, name) __declspec(align(16)) type name
786
#else
787
#define STBI_SIMD_ALIGN(type, name) type name __attribute__((aligned(16)))
788
#endif
789
#endif
790
791
#ifndef STBI_SIMD_ALIGN
792
#define STBI_SIMD_ALIGN(type, name) type name
793
#endif
794
795
#ifndef STBI_MAX_DIMENSIONS
796
21.2k
#define STBI_MAX_DIMENSIONS (1 << 24)
797
#endif
798
799
///////////////////////////////////////////////
800
//
801
//  stbi__context struct and start_xxx functions
802
803
// stbi__context structure is our basic context used by all images, so it
804
// contains all the IO context, plus some basic image information
805
typedef struct
806
{
807
   stbi__uint32 img_x, img_y;
808
   int img_n, img_out_n;
809
810
   stbi_io_callbacks io;
811
   void *io_user_data;
812
813
   int read_from_callbacks;
814
   int buflen;
815
   stbi_uc buffer_start[128];
816
   int callback_already_read;
817
818
   stbi_uc *img_buffer, *img_buffer_end;
819
   stbi_uc *img_buffer_original, *img_buffer_original_end;
820
} stbi__context;
821
822
823
static void stbi__refill_buffer(stbi__context *s);
824
825
// initialize a memory-decode context
826
static void stbi__start_mem(stbi__context *s, stbi_uc const *buffer, int len)
827
15.5k
{
828
15.5k
   s->io.read = NULL;
829
15.5k
   s->read_from_callbacks = 0;
830
15.5k
   s->callback_already_read = 0;
831
15.5k
   s->img_buffer = s->img_buffer_original = (stbi_uc *) buffer;
832
15.5k
   s->img_buffer_end = s->img_buffer_original_end = (stbi_uc *) buffer+len;
833
15.5k
}
834
835
// initialize a callback-based context
836
static void stbi__start_callbacks(stbi__context *s, stbi_io_callbacks *c, void *user)
837
0
{
838
0
   s->io = *c;
839
0
   s->io_user_data = user;
840
0
   s->buflen = sizeof(s->buffer_start);
841
0
   s->read_from_callbacks = 1;
842
0
   s->callback_already_read = 0;
843
0
   s->img_buffer = s->img_buffer_original = s->buffer_start;
844
0
   stbi__refill_buffer(s);
845
0
   s->img_buffer_original_end = s->img_buffer_end;
846
0
}
847
848
#ifndef STBI_NO_STDIO
849
850
static int stbi__stdio_read(void *user, char *data, int size)
851
0
{
852
0
   return (int) fread(data,1,size,(FILE*) user);
853
0
}
854
855
static void stbi__stdio_skip(void *user, int n)
856
0
{
857
0
   int ch;
858
0
   fseek((FILE*) user, n, SEEK_CUR);
859
0
   ch = fgetc((FILE*) user);  /* have to read a byte to reset feof()'s flag */
860
0
   if (ch != EOF) {
861
0
      ungetc(ch, (FILE *) user);  /* push byte back onto stream if valid. */
862
0
   }
863
0
}
864
865
static int stbi__stdio_eof(void *user)
866
0
{
867
0
   return feof((FILE*) user) || ferror((FILE *) user);
868
0
}
869
870
static stbi_io_callbacks stbi__stdio_callbacks =
871
{
872
   stbi__stdio_read,
873
   stbi__stdio_skip,
874
   stbi__stdio_eof,
875
};
876
877
static void stbi__start_file(stbi__context *s, FILE *f)
878
0
{
879
0
   stbi__start_callbacks(s, &stbi__stdio_callbacks, (void *) f);
880
0
}
881
882
//static void stop_file(stbi__context *s) { }
883
884
#endif // !STBI_NO_STDIO
885
886
static void stbi__rewind(stbi__context *s)
887
77.1k
{
888
   // conceptually rewind SHOULD rewind to the beginning of the stream,
889
   // but we just rewind to the beginning of the initial buffer, because
890
   // we only use it after doing 'test', which only ever looks at at most 92 bytes
891
77.1k
   s->img_buffer = s->img_buffer_original;
892
77.1k
   s->img_buffer_end = s->img_buffer_original_end;
893
77.1k
}
894
895
enum
896
{
897
   STBI_ORDER_RGB,
898
   STBI_ORDER_BGR
899
};
900
901
typedef struct
902
{
903
   int bits_per_channel;
904
   int num_channels;
905
   int channel_order;
906
} stbi__result_info;
907
908
#ifndef STBI_NO_JPEG
909
static int      stbi__jpeg_test(stbi__context *s);
910
static void    *stbi__jpeg_load(stbi__context *s, int *x, int *y, int *comp, int req_comp, stbi__result_info *ri);
911
static int      stbi__jpeg_info(stbi__context *s, int *x, int *y, int *comp);
912
#endif
913
914
#ifndef STBI_NO_PNG
915
static int      stbi__png_test(stbi__context *s);
916
static void    *stbi__png_load(stbi__context *s, int *x, int *y, int *comp, int req_comp, stbi__result_info *ri);
917
static int      stbi__png_info(stbi__context *s, int *x, int *y, int *comp);
918
static int      stbi__png_is16(stbi__context *s);
919
#endif
920
921
#ifndef STBI_NO_BMP
922
static int      stbi__bmp_test(stbi__context *s);
923
static void    *stbi__bmp_load(stbi__context *s, int *x, int *y, int *comp, int req_comp, stbi__result_info *ri);
924
static int      stbi__bmp_info(stbi__context *s, int *x, int *y, int *comp);
925
#endif
926
927
#ifndef STBI_NO_TGA
928
static int      stbi__tga_test(stbi__context *s);
929
static void    *stbi__tga_load(stbi__context *s, int *x, int *y, int *comp, int req_comp, stbi__result_info *ri);
930
static int      stbi__tga_info(stbi__context *s, int *x, int *y, int *comp);
931
#endif
932
933
#ifndef STBI_NO_PSD
934
static int      stbi__psd_test(stbi__context *s);
935
static void    *stbi__psd_load(stbi__context *s, int *x, int *y, int *comp, int req_comp, stbi__result_info *ri, int bpc);
936
static int      stbi__psd_info(stbi__context *s, int *x, int *y, int *comp);
937
static int      stbi__psd_is16(stbi__context *s);
938
#endif
939
940
#ifndef STBI_NO_HDR
941
static int      stbi__hdr_test(stbi__context *s);
942
static float   *stbi__hdr_load(stbi__context *s, int *x, int *y, int *comp, int req_comp, stbi__result_info *ri);
943
static int      stbi__hdr_info(stbi__context *s, int *x, int *y, int *comp);
944
#endif
945
946
#ifndef STBI_NO_PIC
947
static int      stbi__pic_test(stbi__context *s);
948
static void    *stbi__pic_load(stbi__context *s, int *x, int *y, int *comp, int req_comp, stbi__result_info *ri);
949
static int      stbi__pic_info(stbi__context *s, int *x, int *y, int *comp);
950
#endif
951
952
#ifndef STBI_NO_GIF
953
static int      stbi__gif_test(stbi__context *s);
954
static void    *stbi__gif_load(stbi__context *s, int *x, int *y, int *comp, int req_comp, stbi__result_info *ri);
955
static void    *stbi__load_gif_main(stbi__context *s, int **delays, int *x, int *y, int *z, int *comp, int req_comp);
956
static int      stbi__gif_info(stbi__context *s, int *x, int *y, int *comp);
957
#endif
958
959
#ifndef STBI_NO_PNM
960
static int      stbi__pnm_test(stbi__context *s);
961
static void    *stbi__pnm_load(stbi__context *s, int *x, int *y, int *comp, int req_comp, stbi__result_info *ri);
962
static int      stbi__pnm_info(stbi__context *s, int *x, int *y, int *comp);
963
static int      stbi__pnm_is16(stbi__context *s);
964
#endif
965
966
static
967
#ifdef STBI_THREAD_LOCAL
968
STBI_THREAD_LOCAL
969
#endif
970
const char *stbi__g_failure_reason;
971
972
STBIDEF const char *stbi_failure_reason(void)
973
0
{
974
0
   return stbi__g_failure_reason;
975
0
}
976
977
#ifndef STBI_NO_FAILURE_STRINGS
978
static int stbi__err(const char *str)
979
35.3k
{
980
35.3k
   stbi__g_failure_reason = str;
981
35.3k
   return 0;
982
35.3k
}
983
#endif
984
985
static void *stbi__malloc(size_t size)
986
36.4k
{
987
36.4k
    return STBI_MALLOC(size);
988
36.4k
}
989
990
// stb_image uses ints pervasively, including for offset calculations.
991
// therefore the largest decoded image size we can support with the
992
// current code, even on 64-bit targets, is INT_MAX. this is not a
993
// significant limitation for the intended use case.
994
//
995
// we do, however, need to make sure our size calculations don't
996
// overflow. hence a few helper functions for size calculations that
997
// multiply integers together, making sure that they're non-negative
998
// and no overflow occurs.
999
1000
// return 1 if the sum is valid, 0 on overflow.
1001
// negative terms are considered invalid.
1002
static int stbi__addsizes_valid(int a, int b)
1003
18.9k
{
1004
18.9k
   if (b < 0) return 0;
1005
   // now 0 <= b <= INT_MAX, hence also
1006
   // 0 <= INT_MAX - b <= INTMAX.
1007
   // And "a + b <= INT_MAX" (which might overflow) is the
1008
   // same as a <= INT_MAX - b (no overflow)
1009
18.9k
   return a <= INT_MAX - b;
1010
18.9k
}
1011
1012
// returns 1 if the product is valid, 0 on overflow.
1013
// negative factors are considered invalid.
1014
static int stbi__mul2sizes_valid(int a, int b)
1015
34.1k
{
1016
34.1k
   if (a < 0 || b < 0) return 0;
1017
34.0k
   if (b == 0) return 1; // mul-by-0 is always safe
1018
   // portable way to check for no overflows in a*b
1019
32.8k
   return a <= INT_MAX/b;
1020
34.0k
}
1021
1022
#if !defined(STBI_NO_JPEG) || !defined(STBI_NO_PNG) || !defined(STBI_NO_TGA) || !defined(STBI_NO_HDR)
1023
// returns 1 if "a*b + add" has no negative terms/factors and doesn't overflow
1024
static int stbi__mad2sizes_valid(int a, int b, int add)
1025
5.53k
{
1026
5.53k
   return stbi__mul2sizes_valid(a, b) && stbi__addsizes_valid(a*b, add);
1027
5.53k
}
1028
#endif
1029
1030
// returns 1 if "a*b*c + add" has no negative terms/factors and doesn't overflow
1031
static int stbi__mad3sizes_valid(int a, int b, int c, int add)
1032
11.9k
{
1033
11.9k
   return stbi__mul2sizes_valid(a, b) && stbi__mul2sizes_valid(a*b, c) &&
1034
11.8k
      stbi__addsizes_valid(a*b*c, add);
1035
11.9k
}
1036
1037
// returns 1 if "a*b*c*d + add" has no negative terms/factors and doesn't overflow
1038
#if !defined(STBI_NO_LINEAR) || !defined(STBI_NO_HDR) || !defined(STBI_NO_PNM)
1039
static int stbi__mad4sizes_valid(int a, int b, int c, int d, int add)
1040
1.64k
{
1041
1.64k
   return stbi__mul2sizes_valid(a, b) && stbi__mul2sizes_valid(a*b, c) &&
1042
1.59k
      stbi__mul2sizes_valid(a*b*c, d) && stbi__addsizes_valid(a*b*c*d, add);
1043
1.64k
}
1044
#endif
1045
1046
#if !defined(STBI_NO_JPEG) || !defined(STBI_NO_PNG) || !defined(STBI_NO_TGA) || !defined(STBI_NO_HDR)
1047
// mallocs with size overflow checking
1048
static void *stbi__malloc_mad2(int a, int b, int add)
1049
4.28k
{
1050
4.28k
   if (!stbi__mad2sizes_valid(a, b, add)) return NULL;
1051
4.28k
   return stbi__malloc(a*b + add);
1052
4.28k
}
1053
#endif
1054
1055
static void *stbi__malloc_mad3(int a, int b, int c, int add)
1056
6.65k
{
1057
6.65k
   if (!stbi__mad3sizes_valid(a, b, c, add)) return NULL;
1058
6.65k
   return stbi__malloc(a*b*c + add);
1059
6.65k
}
1060
1061
#if !defined(STBI_NO_LINEAR) || !defined(STBI_NO_HDR) || !defined(STBI_NO_PNM)
1062
static void *stbi__malloc_mad4(int a, int b, int c, int d, int add)
1063
799
{
1064
799
   if (!stbi__mad4sizes_valid(a, b, c, d, add)) return NULL;
1065
799
   return stbi__malloc(a*b*c*d + add);
1066
799
}
1067
#endif
1068
1069
// returns 1 if the sum of two signed ints is valid (between -2^31 and 2^31-1 inclusive), 0 on overflow.
1070
static int stbi__addints_valid(int a, int b)
1071
127M
{
1072
127M
   if ((a >= 0) != (b >= 0)) return 1; // a and b have different signs, so no overflow
1073
127M
   if (a < 0 && b < 0) return a >= INT_MIN - b; // same as a + b >= INT_MIN; INT_MIN - b cannot overflow since b < 0.
1074
117M
   return a <= INT_MAX - b;
1075
127M
}
1076
1077
// returns 1 if the product of two ints fits in a signed short, 0 on overflow.
1078
static int stbi__mul2shorts_valid(int a, int b)
1079
127M
{
1080
127M
   if (b == 0 || b == -1) return 1; // multiplication by 0 is always 0; check for -1 so SHRT_MIN/b doesn't overflow
1081
15.8M
   if ((a >= 0) == (b >= 0)) return a <= SHRT_MAX/b; // product is positive, so similar to mul2sizes_valid
1082
352k
   if (b < 0) return a <= SHRT_MIN / b; // same as a * b >= SHRT_MIN
1083
352k
   return a >= SHRT_MIN / b;
1084
352k
}
1085
1086
// stbi__err - error
1087
// stbi__errpf - error returning pointer to float
1088
// stbi__errpuc - error returning pointer to unsigned char
1089
1090
#ifdef STBI_NO_FAILURE_STRINGS
1091
   #define stbi__err(x,y)  0
1092
#elif defined(STBI_FAILURE_USERMSG)
1093
   #define stbi__err(x,y)  stbi__err(y)
1094
#else
1095
35.3k
   #define stbi__err(x,y)  stbi__err(x)
1096
#endif
1097
1098
165
#define stbi__errpf(x,y)   ((float *)(size_t) (stbi__err(x,y)?NULL:NULL))
1099
5.63k
#define stbi__errpuc(x,y)  ((unsigned char *)(size_t) (stbi__err(x,y)?NULL:NULL))
1100
1101
STBIDEF void stbi_image_free(void *retval_from_stbi_load)
1102
0
{
1103
0
   STBI_FREE(retval_from_stbi_load);
1104
0
}
1105
1106
#ifndef STBI_NO_LINEAR
1107
static float   *stbi__ldr_to_hdr(stbi_uc *data, int x, int y, int comp);
1108
#endif
1109
1110
#ifndef STBI_NO_HDR
1111
static stbi_uc *stbi__hdr_to_ldr(float   *data, int x, int y, int comp);
1112
#endif
1113
1114
static int stbi__vertically_flip_on_load_global = 0;
1115
1116
STBIDEF void stbi_set_flip_vertically_on_load(int flag_true_if_should_flip)
1117
0
{
1118
0
   stbi__vertically_flip_on_load_global = flag_true_if_should_flip;
1119
0
}
1120
1121
#ifndef STBI_THREAD_LOCAL
1122
#define stbi__vertically_flip_on_load  stbi__vertically_flip_on_load_global
1123
#else
1124
static STBI_THREAD_LOCAL int stbi__vertically_flip_on_load_local, stbi__vertically_flip_on_load_set;
1125
1126
STBIDEF void stbi_set_flip_vertically_on_load_thread(int flag_true_if_should_flip)
1127
0
{
1128
0
   stbi__vertically_flip_on_load_local = flag_true_if_should_flip;
1129
0
   stbi__vertically_flip_on_load_set = 1;
1130
0
}
1131
1132
3.26k
#define stbi__vertically_flip_on_load  (stbi__vertically_flip_on_load_set       \
1133
3.26k
                                         ? stbi__vertically_flip_on_load_local  \
1134
3.26k
                                         : stbi__vertically_flip_on_load_global)
1135
#endif // STBI_THREAD_LOCAL
1136
1137
static void *stbi__load_main(stbi__context *s, int *x, int *y, int *comp, int req_comp, stbi__result_info *ri, int bpc)
1138
6.48k
{
1139
6.48k
   memset(ri, 0, sizeof(*ri)); // make sure it's initialized if we add new fields
1140
6.48k
   ri->bits_per_channel = 8; // default is 8 so most paths don't have to be changed
1141
6.48k
   ri->channel_order = STBI_ORDER_RGB; // all current input & output are this, but this is here so we can add BGR order
1142
6.48k
   ri->num_channels = 0;
1143
1144
   // test the formats with a very explicit header first (at least a FOURCC
1145
   // or distinctive magic number first)
1146
6.48k
   #ifndef STBI_NO_PNG
1147
6.48k
   if (stbi__png_test(s))  return stbi__png_load(s,x,y,comp,req_comp, ri);
1148
5.02k
   #endif
1149
   #ifndef STBI_NO_BMP
1150
5.02k
   if (stbi__bmp_test(s))  return stbi__bmp_load(s,x,y,comp,req_comp, ri);
1151
3.90k
   #endif
1152
   #ifndef STBI_NO_GIF
1153
3.90k
   if (stbi__gif_test(s))  return stbi__gif_load(s,x,y,comp,req_comp, ri);
1154
3.46k
   #endif
1155
   #ifndef STBI_NO_PSD
1156
3.46k
   if (stbi__psd_test(s))  return stbi__psd_load(s,x,y,comp,req_comp, ri, bpc);
1157
   #else
1158
0
   STBI_NOTUSED(bpc);
1159
0
   #endif
1160
   #ifndef STBI_NO_PIC
1161
3.15k
   if (stbi__pic_test(s))  return stbi__pic_load(s,x,y,comp,req_comp, ri);
1162
2.92k
   #endif
1163
1164
   // then the formats that can end up attempting to load with just 1 or 2
1165
   // bytes matching expectations; these are prone to false positives, so
1166
   // try them later
1167
   #ifndef STBI_NO_JPEG
1168
2.92k
   if (stbi__jpeg_test(s)) return stbi__jpeg_load(s,x,y,comp,req_comp, ri);
1169
1.10k
   #endif
1170
   #ifndef STBI_NO_PNM
1171
1.10k
   if (stbi__pnm_test(s))  return stbi__pnm_load(s,x,y,comp,req_comp, ri);
1172
792
   #endif
1173
1174
   #ifndef STBI_NO_HDR
1175
792
   if (stbi__hdr_test(s)) {
1176
547
      float *hdr = stbi__hdr_load(s, x,y,comp,req_comp, ri);
1177
547
      return stbi__hdr_to_ldr(hdr, *x, *y, req_comp ? req_comp : *comp);
1178
547
   }
1179
245
   #endif
1180
1181
   #ifndef STBI_NO_TGA
1182
   // test tga last because it's a crappy test!
1183
245
   if (stbi__tga_test(s))
1184
232
      return stbi__tga_load(s,x,y,comp,req_comp, ri);
1185
13
   #endif
1186
1187
13
   return stbi__errpuc("unknown image type", "Image not of any known type, or corrupt");
1188
245
}
stbi_read_fuzzer.c:stbi__load_main(stbi__context*, int*, int*, int*, int, stbi__result_info*, int)
Line
Count
Source
1138
6.21k
{
1139
6.21k
   memset(ri, 0, sizeof(*ri)); // make sure it's initialized if we add new fields
1140
6.21k
   ri->bits_per_channel = 8; // default is 8 so most paths don't have to be changed
1141
6.21k
   ri->channel_order = STBI_ORDER_RGB; // all current input & output are this, but this is here so we can add BGR order
1142
6.21k
   ri->num_channels = 0;
1143
1144
   // test the formats with a very explicit header first (at least a FOURCC
1145
   // or distinctive magic number first)
1146
6.21k
   #ifndef STBI_NO_PNG
1147
6.21k
   if (stbi__png_test(s))  return stbi__png_load(s,x,y,comp,req_comp, ri);
1148
5.02k
   #endif
1149
5.02k
   #ifndef STBI_NO_BMP
1150
5.02k
   if (stbi__bmp_test(s))  return stbi__bmp_load(s,x,y,comp,req_comp, ri);
1151
3.90k
   #endif
1152
3.90k
   #ifndef STBI_NO_GIF
1153
3.90k
   if (stbi__gif_test(s))  return stbi__gif_load(s,x,y,comp,req_comp, ri);
1154
3.46k
   #endif
1155
3.46k
   #ifndef STBI_NO_PSD
1156
3.46k
   if (stbi__psd_test(s))  return stbi__psd_load(s,x,y,comp,req_comp, ri, bpc);
1157
   #else
1158
   STBI_NOTUSED(bpc);
1159
   #endif
1160
3.15k
   #ifndef STBI_NO_PIC
1161
3.15k
   if (stbi__pic_test(s))  return stbi__pic_load(s,x,y,comp,req_comp, ri);
1162
2.92k
   #endif
1163
1164
   // then the formats that can end up attempting to load with just 1 or 2
1165
   // bytes matching expectations; these are prone to false positives, so
1166
   // try them later
1167
2.92k
   #ifndef STBI_NO_JPEG
1168
2.92k
   if (stbi__jpeg_test(s)) return stbi__jpeg_load(s,x,y,comp,req_comp, ri);
1169
1.10k
   #endif
1170
1.10k
   #ifndef STBI_NO_PNM
1171
1.10k
   if (stbi__pnm_test(s))  return stbi__pnm_load(s,x,y,comp,req_comp, ri);
1172
792
   #endif
1173
1174
792
   #ifndef STBI_NO_HDR
1175
792
   if (stbi__hdr_test(s)) {
1176
547
      float *hdr = stbi__hdr_load(s, x,y,comp,req_comp, ri);
1177
547
      return stbi__hdr_to_ldr(hdr, *x, *y, req_comp ? req_comp : *comp);
1178
547
   }
1179
245
   #endif
1180
1181
245
   #ifndef STBI_NO_TGA
1182
   // test tga last because it's a crappy test!
1183
245
   if (stbi__tga_test(s))
1184
232
      return stbi__tga_load(s,x,y,comp,req_comp, ri);
1185
13
   #endif
1186
1187
13
   return stbi__errpuc("unknown image type", "Image not of any known type, or corrupt");
1188
245
}
stbi_read_fuzzer.c:stbi__load_main(stbi__context*, int*, int*, int*, int, stbi__result_info*, int)
Line
Count
Source
1138
267
{
1139
267
   memset(ri, 0, sizeof(*ri)); // make sure it's initialized if we add new fields
1140
267
   ri->bits_per_channel = 8; // default is 8 so most paths don't have to be changed
1141
267
   ri->channel_order = STBI_ORDER_RGB; // all current input & output are this, but this is here so we can add BGR order
1142
267
   ri->num_channels = 0;
1143
1144
   // test the formats with a very explicit header first (at least a FOURCC
1145
   // or distinctive magic number first)
1146
267
   #ifndef STBI_NO_PNG
1147
267
   if (stbi__png_test(s))  return stbi__png_load(s,x,y,comp,req_comp, ri);
1148
0
   #endif
1149
   #ifndef STBI_NO_BMP
1150
   if (stbi__bmp_test(s))  return stbi__bmp_load(s,x,y,comp,req_comp, ri);
1151
   #endif
1152
   #ifndef STBI_NO_GIF
1153
   if (stbi__gif_test(s))  return stbi__gif_load(s,x,y,comp,req_comp, ri);
1154
   #endif
1155
   #ifndef STBI_NO_PSD
1156
   if (stbi__psd_test(s))  return stbi__psd_load(s,x,y,comp,req_comp, ri, bpc);
1157
   #else
1158
0
   STBI_NOTUSED(bpc);
1159
0
   #endif
1160
   #ifndef STBI_NO_PIC
1161
   if (stbi__pic_test(s))  return stbi__pic_load(s,x,y,comp,req_comp, ri);
1162
   #endif
1163
1164
   // then the formats that can end up attempting to load with just 1 or 2
1165
   // bytes matching expectations; these are prone to false positives, so
1166
   // try them later
1167
   #ifndef STBI_NO_JPEG
1168
   if (stbi__jpeg_test(s)) return stbi__jpeg_load(s,x,y,comp,req_comp, ri);
1169
   #endif
1170
   #ifndef STBI_NO_PNM
1171
   if (stbi__pnm_test(s))  return stbi__pnm_load(s,x,y,comp,req_comp, ri);
1172
   #endif
1173
1174
   #ifndef STBI_NO_HDR
1175
   if (stbi__hdr_test(s)) {
1176
      float *hdr = stbi__hdr_load(s, x,y,comp,req_comp, ri);
1177
      return stbi__hdr_to_ldr(hdr, *x, *y, req_comp ? req_comp : *comp);
1178
   }
1179
   #endif
1180
1181
   #ifndef STBI_NO_TGA
1182
   // test tga last because it's a crappy test!
1183
   if (stbi__tga_test(s))
1184
      return stbi__tga_load(s,x,y,comp,req_comp, ri);
1185
   #endif
1186
1187
   return stbi__errpuc("unknown image type", "Image not of any known type, or corrupt");
1188
267
}
1189
1190
static stbi_uc *stbi__convert_16_to_8(stbi__uint16 *orig, int w, int h, int channels)
1191
177
{
1192
177
   int i;
1193
177
   int img_len = w * h * channels;
1194
177
   stbi_uc *reduced;
1195
1196
177
   reduced = (stbi_uc *) stbi__malloc(img_len);
1197
177
   if (reduced == NULL) return stbi__errpuc("outofmem", "Out of memory");
1198
1199
989M
   for (i = 0; i < img_len; ++i)
1200
989M
      reduced[i] = (stbi_uc)((orig[i] >> 8) & 0xFF); // top half of each byte is sufficient approx of 16->8 bit scaling
1201
1202
177
   STBI_FREE(orig);
1203
177
   return reduced;
1204
177
}
1205
1206
static stbi__uint16 *stbi__convert_8_to_16(stbi_uc *orig, int w, int h, int channels)
1207
0
{
1208
0
   int i;
1209
0
   int img_len = w * h * channels;
1210
0
   stbi__uint16 *enlarged;
1211
1212
0
   enlarged = (stbi__uint16 *) stbi__malloc(img_len*2);
1213
0
   if (enlarged == NULL) return (stbi__uint16 *) stbi__errpuc("outofmem", "Out of memory");
1214
1215
0
   for (i = 0; i < img_len; ++i)
1216
0
      enlarged[i] = (stbi__uint16)((orig[i] << 8) + orig[i]); // replicate to high and low byte, maps 0->0, 255->0xffff
1217
1218
0
   STBI_FREE(orig);
1219
0
   return enlarged;
1220
0
}
1221
1222
static void stbi__vertical_flip(void *image, int w, int h, int bytes_per_pixel)
1223
0
{
1224
0
   int row;
1225
0
   size_t bytes_per_row = (size_t)w * bytes_per_pixel;
1226
0
   stbi_uc temp[2048];
1227
0
   stbi_uc *bytes = (stbi_uc *)image;
1228
1229
0
   for (row = 0; row < (h>>1); row++) {
1230
0
      stbi_uc *row0 = bytes + row*bytes_per_row;
1231
0
      stbi_uc *row1 = bytes + (h - row - 1)*bytes_per_row;
1232
      // swap row0 with row1
1233
0
      size_t bytes_left = bytes_per_row;
1234
0
      while (bytes_left) {
1235
0
         size_t bytes_copy = (bytes_left < sizeof(temp)) ? bytes_left : sizeof(temp);
1236
0
         memcpy(temp, row0, bytes_copy);
1237
0
         memcpy(row0, row1, bytes_copy);
1238
0
         memcpy(row1, temp, bytes_copy);
1239
0
         row0 += bytes_copy;
1240
0
         row1 += bytes_copy;
1241
0
         bytes_left -= bytes_copy;
1242
0
      }
1243
0
   }
1244
0
}
1245
1246
#ifndef STBI_NO_GIF
1247
static void stbi__vertical_flip_slices(void *image, int w, int h, int z, int bytes_per_pixel)
1248
0
{
1249
0
   int slice;
1250
0
   int slice_size = w * h * bytes_per_pixel;
1251
1252
0
   stbi_uc *bytes = (stbi_uc *)image;
1253
0
   for (slice = 0; slice < z; ++slice) {
1254
0
      stbi__vertical_flip(bytes, w, h, bytes_per_pixel);
1255
0
      bytes += slice_size;
1256
0
   }
1257
0
}
1258
#endif
1259
1260
static unsigned char *stbi__load_and_postprocess_8bit(stbi__context *s, int *x, int *y, int *comp, int req_comp)
1261
6.48k
{
1262
6.48k
   stbi__result_info ri;
1263
6.48k
   void *result = stbi__load_main(s, x, y, comp, req_comp, &ri, 8);
1264
1265
6.48k
   if (result == NULL)
1266
3.22k
      return NULL;
1267
1268
   // it is the responsibility of the loaders to make sure we get either 8 or 16 bit.
1269
3.26k
   STBI_ASSERT(ri.bits_per_channel == 8 || ri.bits_per_channel == 16);
1270
1271
3.26k
   if (ri.bits_per_channel != 8) {
1272
177
      result = stbi__convert_16_to_8((stbi__uint16 *) result, *x, *y, req_comp == 0 ? *comp : req_comp);
1273
177
      ri.bits_per_channel = 8;
1274
177
   }
1275
1276
   // @TODO: move stbi__convert_format to here
1277
1278
3.26k
   if (stbi__vertically_flip_on_load) {
1279
0
      int channels = req_comp ? req_comp : *comp;
1280
0
      stbi__vertical_flip(result, *x, *y, channels * sizeof(stbi_uc));
1281
0
   }
1282
1283
3.26k
   return (unsigned char *) result;
1284
3.26k
}
1285
1286
static stbi__uint16 *stbi__load_and_postprocess_16bit(stbi__context *s, int *x, int *y, int *comp, int req_comp)
1287
0
{
1288
0
   stbi__result_info ri;
1289
0
   void *result = stbi__load_main(s, x, y, comp, req_comp, &ri, 16);
1290
1291
0
   if (result == NULL)
1292
0
      return NULL;
1293
1294
   // it is the responsibility of the loaders to make sure we get either 8 or 16 bit.
1295
0
   STBI_ASSERT(ri.bits_per_channel == 8 || ri.bits_per_channel == 16);
1296
1297
0
   if (ri.bits_per_channel != 16) {
1298
0
      result = stbi__convert_8_to_16((stbi_uc *) result, *x, *y, req_comp == 0 ? *comp : req_comp);
1299
0
      ri.bits_per_channel = 16;
1300
0
   }
1301
1302
   // @TODO: move stbi__convert_format16 to here
1303
   // @TODO: special case RGB-to-Y (and RGBA-to-YA) for 8-bit-to-16-bit case to keep more precision
1304
1305
0
   if (stbi__vertically_flip_on_load) {
1306
0
      int channels = req_comp ? req_comp : *comp;
1307
0
      stbi__vertical_flip(result, *x, *y, channels * sizeof(stbi__uint16));
1308
0
   }
1309
1310
0
   return (stbi__uint16 *) result;
1311
0
}
1312
1313
#if !defined(STBI_NO_HDR) && !defined(STBI_NO_LINEAR)
1314
static void stbi__float_postprocess(float *result, int *x, int *y, int *comp, int req_comp)
1315
0
{
1316
0
   if (stbi__vertically_flip_on_load && result != NULL) {
1317
0
      int channels = req_comp ? req_comp : *comp;
1318
0
      stbi__vertical_flip(result, *x, *y, channels * sizeof(float));
1319
0
   }
1320
0
}
1321
#endif
1322
1323
#ifndef STBI_NO_STDIO
1324
1325
#if defined(_WIN32) && defined(STBI_WINDOWS_UTF8)
1326
STBI_EXTERN __declspec(dllimport) int __stdcall MultiByteToWideChar(unsigned int cp, unsigned long flags, const char *str, int cbmb, wchar_t *widestr, int cchwide);
1327
STBI_EXTERN __declspec(dllimport) int __stdcall WideCharToMultiByte(unsigned int cp, unsigned long flags, const wchar_t *widestr, int cchwide, char *str, int cbmb, const char *defchar, int *used_default);
1328
#endif
1329
1330
#if defined(_WIN32) && defined(STBI_WINDOWS_UTF8)
1331
STBIDEF int stbi_convert_wchar_to_utf8(char *buffer, size_t bufferlen, const wchar_t* input)
1332
{
1333
  return WideCharToMultiByte(65001 /* UTF8 */, 0, input, -1, buffer, (int) bufferlen, NULL, NULL);
1334
}
1335
#endif
1336
1337
static FILE *stbi__fopen(char const *filename, char const *mode)
1338
0
{
1339
0
   FILE *f;
1340
#if defined(_WIN32) && defined(STBI_WINDOWS_UTF8)
1341
   wchar_t wMode[64];
1342
   wchar_t wFilename[1024];
1343
  if (0 == MultiByteToWideChar(65001 /* UTF8 */, 0, filename, -1, wFilename, sizeof(wFilename)/sizeof(*wFilename)))
1344
      return 0;
1345
1346
  if (0 == MultiByteToWideChar(65001 /* UTF8 */, 0, mode, -1, wMode, sizeof(wMode)/sizeof(*wMode)))
1347
      return 0;
1348
1349
#if defined(_MSC_VER) && _MSC_VER >= 1400
1350
  if (0 != _wfopen_s(&f, wFilename, wMode))
1351
    f = 0;
1352
#else
1353
   f = _wfopen(wFilename, wMode);
1354
#endif
1355
1356
#elif defined(_MSC_VER) && _MSC_VER >= 1400
1357
   if (0 != fopen_s(&f, filename, mode))
1358
      f=0;
1359
#else
1360
0
   f = fopen(filename, mode);
1361
0
#endif
1362
0
   return f;
1363
0
}
1364
1365
1366
STBIDEF stbi_uc *stbi_load(char const *filename, int *x, int *y, int *comp, int req_comp)
1367
0
{
1368
0
   FILE *f = stbi__fopen(filename, "rb");
1369
0
   unsigned char *result;
1370
0
   if (!f) return stbi__errpuc("can't fopen", "Unable to open file");
1371
0
   result = stbi_load_from_file(f,x,y,comp,req_comp);
1372
0
   fclose(f);
1373
0
   return result;
1374
0
}
1375
1376
STBIDEF stbi_uc *stbi_load_from_file(FILE *f, int *x, int *y, int *comp, int req_comp)
1377
0
{
1378
0
   unsigned char *result;
1379
0
   stbi__context s;
1380
0
   stbi__start_file(&s,f);
1381
0
   result = stbi__load_and_postprocess_8bit(&s,x,y,comp,req_comp);
1382
0
   if (result) {
1383
      // need to 'unget' all the characters in the IO buffer
1384
0
      fseek(f, - (int) (s.img_buffer_end - s.img_buffer), SEEK_CUR);
1385
0
   }
1386
0
   return result;
1387
0
}
1388
1389
STBIDEF stbi__uint16 *stbi_load_from_file_16(FILE *f, int *x, int *y, int *comp, int req_comp)
1390
0
{
1391
0
   stbi__uint16 *result;
1392
0
   stbi__context s;
1393
0
   stbi__start_file(&s,f);
1394
0
   result = stbi__load_and_postprocess_16bit(&s,x,y,comp,req_comp);
1395
0
   if (result) {
1396
      // need to 'unget' all the characters in the IO buffer
1397
0
      fseek(f, - (int) (s.img_buffer_end - s.img_buffer), SEEK_CUR);
1398
0
   }
1399
0
   return result;
1400
0
}
1401
1402
STBIDEF stbi_us *stbi_load_16(char const *filename, int *x, int *y, int *comp, int req_comp)
1403
0
{
1404
0
   FILE *f = stbi__fopen(filename, "rb");
1405
0
   stbi__uint16 *result;
1406
0
   if (!f) return (stbi_us *) stbi__errpuc("can't fopen", "Unable to open file");
1407
0
   result = stbi_load_from_file_16(f,x,y,comp,req_comp);
1408
0
   fclose(f);
1409
0
   return result;
1410
0
}
1411
1412
1413
#endif //!STBI_NO_STDIO
1414
1415
STBIDEF stbi_us *stbi_load_16_from_memory(stbi_uc const *buffer, int len, int *x, int *y, int *channels_in_file, int desired_channels)
1416
0
{
1417
0
   stbi__context s;
1418
0
   stbi__start_mem(&s,buffer,len);
1419
0
   return stbi__load_and_postprocess_16bit(&s,x,y,channels_in_file,desired_channels);
1420
0
}
1421
1422
STBIDEF stbi_us *stbi_load_16_from_callbacks(stbi_io_callbacks const *clbk, void *user, int *x, int *y, int *channels_in_file, int desired_channels)
1423
0
{
1424
0
   stbi__context s;
1425
0
   stbi__start_callbacks(&s, (stbi_io_callbacks *)clbk, user);
1426
0
   return stbi__load_and_postprocess_16bit(&s,x,y,channels_in_file,desired_channels);
1427
0
}
1428
1429
STBIDEF stbi_uc *stbi_load_from_memory(stbi_uc const *buffer, int len, int *x, int *y, int *comp, int req_comp)
1430
6.48k
{
1431
6.48k
   stbi__context s;
1432
6.48k
   stbi__start_mem(&s,buffer,len);
1433
6.48k
   return stbi__load_and_postprocess_8bit(&s,x,y,comp,req_comp);
1434
6.48k
}
1435
1436
STBIDEF stbi_uc *stbi_load_from_callbacks(stbi_io_callbacks const *clbk, void *user, int *x, int *y, int *comp, int req_comp)
1437
0
{
1438
0
   stbi__context s;
1439
0
   stbi__start_callbacks(&s, (stbi_io_callbacks *) clbk, user);
1440
0
   return stbi__load_and_postprocess_8bit(&s,x,y,comp,req_comp);
1441
0
}
1442
1443
#ifndef STBI_NO_GIF
1444
STBIDEF stbi_uc *stbi_load_gif_from_memory(stbi_uc const *buffer, int len, int **delays, int *x, int *y, int *z, int *comp, int req_comp)
1445
0
{
1446
0
   unsigned char *result;
1447
0
   stbi__context s;
1448
0
   stbi__start_mem(&s,buffer,len);
1449
1450
0
   result = (unsigned char*) stbi__load_gif_main(&s, delays, x, y, z, comp, req_comp);
1451
0
   if (stbi__vertically_flip_on_load) {
1452
0
      stbi__vertical_flip_slices( result, *x, *y, *z, *comp );
1453
0
   }
1454
1455
0
   return result;
1456
0
}
1457
#endif
1458
1459
#ifndef STBI_NO_LINEAR
1460
static float *stbi__loadf_main(stbi__context *s, int *x, int *y, int *comp, int req_comp)
1461
0
{
1462
0
   unsigned char *data;
1463
0
   #ifndef STBI_NO_HDR
1464
0
   if (stbi__hdr_test(s)) {
1465
0
      stbi__result_info ri;
1466
0
      float *hdr_data = stbi__hdr_load(s,x,y,comp,req_comp, &ri);
1467
0
      if (hdr_data)
1468
0
         stbi__float_postprocess(hdr_data,x,y,comp,req_comp);
1469
0
      return hdr_data;
1470
0
   }
1471
0
   #endif
1472
0
   data = stbi__load_and_postprocess_8bit(s, x, y, comp, req_comp);
1473
0
   if (data)
1474
0
      return stbi__ldr_to_hdr(data, *x, *y, req_comp ? req_comp : *comp);
1475
0
   return stbi__errpf("unknown image type", "Image not of any known type, or corrupt");
1476
0
}
1477
1478
STBIDEF float *stbi_loadf_from_memory(stbi_uc const *buffer, int len, int *x, int *y, int *comp, int req_comp)
1479
0
{
1480
0
   stbi__context s;
1481
0
   stbi__start_mem(&s,buffer,len);
1482
0
   return stbi__loadf_main(&s,x,y,comp,req_comp);
1483
0
}
1484
1485
STBIDEF float *stbi_loadf_from_callbacks(stbi_io_callbacks const *clbk, void *user, int *x, int *y, int *comp, int req_comp)
1486
0
{
1487
0
   stbi__context s;
1488
0
   stbi__start_callbacks(&s, (stbi_io_callbacks *) clbk, user);
1489
0
   return stbi__loadf_main(&s,x,y,comp,req_comp);
1490
0
}
1491
1492
#ifndef STBI_NO_STDIO
1493
STBIDEF float *stbi_loadf(char const *filename, int *x, int *y, int *comp, int req_comp)
1494
0
{
1495
0
   float *result;
1496
0
   FILE *f = stbi__fopen(filename, "rb");
1497
0
   if (!f) return stbi__errpf("can't fopen", "Unable to open file");
1498
0
   result = stbi_loadf_from_file(f,x,y,comp,req_comp);
1499
0
   fclose(f);
1500
0
   return result;
1501
0
}
1502
1503
STBIDEF float *stbi_loadf_from_file(FILE *f, int *x, int *y, int *comp, int req_comp)
1504
0
{
1505
0
   stbi__context s;
1506
0
   stbi__start_file(&s,f);
1507
0
   return stbi__loadf_main(&s,x,y,comp,req_comp);
1508
0
}
1509
#endif // !STBI_NO_STDIO
1510
1511
#endif // !STBI_NO_LINEAR
1512
1513
// these is-hdr-or-not is defined independent of whether STBI_NO_LINEAR is
1514
// defined, for API simplicity; if STBI_NO_LINEAR is defined, it always
1515
// reports false!
1516
1517
STBIDEF int stbi_is_hdr_from_memory(stbi_uc const *buffer, int len)
1518
0
{
1519
   #ifndef STBI_NO_HDR
1520
   stbi__context s;
1521
   stbi__start_mem(&s,buffer,len);
1522
   return stbi__hdr_test(&s);
1523
   #else
1524
0
   STBI_NOTUSED(buffer);
1525
0
   STBI_NOTUSED(len);
1526
   return 0;
1527
   #endif
1528
0
}
Unexecuted instantiation: stbi_is_hdr_from_memory
Unexecuted instantiation: stbi_is_hdr_from_memory
1529
1530
#ifndef STBI_NO_STDIO
1531
STBIDEF int      stbi_is_hdr          (char const *filename)
1532
0
{
1533
0
   FILE *f = stbi__fopen(filename, "rb");
1534
0
   int result=0;
1535
0
   if (f) {
1536
0
      result = stbi_is_hdr_from_file(f);
1537
0
      fclose(f);
1538
0
   }
1539
0
   return result;
1540
0
}
1541
1542
STBIDEF int stbi_is_hdr_from_file(FILE *f)
1543
0
{
1544
   #ifndef STBI_NO_HDR
1545
   long pos = ftell(f);
1546
   int res;
1547
   stbi__context s;
1548
   stbi__start_file(&s,f);
1549
   res = stbi__hdr_test(&s);
1550
   fseek(f, pos, SEEK_SET);
1551
   return res;
1552
   #else
1553
0
   STBI_NOTUSED(f);
1554
   return 0;
1555
   #endif
1556
0
}
Unexecuted instantiation: stbi_is_hdr_from_file
Unexecuted instantiation: stbi_is_hdr_from_file
1557
#endif // !STBI_NO_STDIO
1558
1559
STBIDEF int      stbi_is_hdr_from_callbacks(stbi_io_callbacks const *clbk, void *user)
1560
0
{
1561
   #ifndef STBI_NO_HDR
1562
   stbi__context s;
1563
   stbi__start_callbacks(&s, (stbi_io_callbacks *) clbk, user);
1564
   return stbi__hdr_test(&s);
1565
   #else
1566
0
   STBI_NOTUSED(clbk);
1567
0
   STBI_NOTUSED(user);
1568
   return 0;
1569
   #endif
1570
0
}
Unexecuted instantiation: stbi_is_hdr_from_callbacks
Unexecuted instantiation: stbi_is_hdr_from_callbacks
1571
1572
#ifndef STBI_NO_LINEAR
1573
static float stbi__l2h_gamma=2.2f, stbi__l2h_scale=1.0f;
1574
1575
0
STBIDEF void   stbi_ldr_to_hdr_gamma(float gamma) { stbi__l2h_gamma = gamma; }
1576
0
STBIDEF void   stbi_ldr_to_hdr_scale(float scale) { stbi__l2h_scale = scale; }
1577
#endif
1578
1579
static float stbi__h2l_gamma_i=1.0f/2.2f, stbi__h2l_scale_i=1.0f;
1580
1581
0
STBIDEF void   stbi_hdr_to_ldr_gamma(float gamma) { stbi__h2l_gamma_i = 1/gamma; }
1582
0
STBIDEF void   stbi_hdr_to_ldr_scale(float scale) { stbi__h2l_scale_i = 1/scale; }
1583
1584
1585
//////////////////////////////////////////////////////////////////////////////
1586
//
1587
// Common code used by all image loaders
1588
//
1589
1590
enum
1591
{
1592
   STBI__SCAN_load=0,
1593
   STBI__SCAN_type,
1594
   STBI__SCAN_header
1595
};
1596
1597
static void stbi__refill_buffer(stbi__context *s)
1598
0
{
1599
0
   int n = (s->io.read)(s->io_user_data,(char*)s->buffer_start,s->buflen);
1600
0
   s->callback_already_read += (int) (s->img_buffer - s->img_buffer_original);
1601
0
   if (n == 0) {
1602
      // at end of file, treat same as if from memory, but need to handle case
1603
      // where s->img_buffer isn't pointing to safe memory, e.g. 0-byte file
1604
0
      s->read_from_callbacks = 0;
1605
0
      s->img_buffer = s->buffer_start;
1606
0
      s->img_buffer_end = s->buffer_start+1;
1607
0
      *s->img_buffer = 0;
1608
0
   } else {
1609
0
      s->img_buffer = s->buffer_start;
1610
0
      s->img_buffer_end = s->buffer_start + n;
1611
0
   }
1612
0
}
1613
1614
stbi_inline static stbi_uc stbi__get8(stbi__context *s)
1615
7.57G
{
1616
7.57G
   if (s->img_buffer < s->img_buffer_end)
1617
1.81M
      return *s->img_buffer++;
1618
7.57G
   if (s->read_from_callbacks) {
1619
0
      stbi__refill_buffer(s);
1620
0
      return *s->img_buffer++;
1621
0
   }
1622
7.57G
   return 0;
1623
7.57G
}
1624
1625
#if defined(STBI_NO_JPEG) && defined(STBI_NO_HDR) && defined(STBI_NO_PIC) && defined(STBI_NO_PNM)
1626
// nothing
1627
#else
1628
stbi_inline static int stbi__at_eof(stbi__context *s)
1629
368k
{
1630
368k
   if (s->io.read) {
1631
0
      if (!(s->io.eof)(s->io_user_data)) return 0;
1632
      // if feof() is true, check if buffer = end
1633
      // special case: we've only got the special 0 character at the end
1634
0
      if (s->read_from_callbacks == 0) return 1;
1635
0
   }
1636
1637
368k
   return s->img_buffer >= s->img_buffer_end;
1638
368k
}
1639
#endif
1640
1641
#if defined(STBI_NO_JPEG) && defined(STBI_NO_PNG) && defined(STBI_NO_BMP) && defined(STBI_NO_PSD) && defined(STBI_NO_TGA) && defined(STBI_NO_GIF) && defined(STBI_NO_PIC)
1642
// nothing
1643
#else
1644
static void stbi__skip(stbi__context *s, int n)
1645
715M
{
1646
715M
   if (n == 0) return;  // already there!
1647
64.3M
   if (n < 0) {
1648
221
      s->img_buffer = s->img_buffer_end;
1649
221
      return;
1650
221
   }
1651
64.3M
   if (s->io.read) {
1652
0
      int blen = (int) (s->img_buffer_end - s->img_buffer);
1653
0
      if (blen < n) {
1654
0
         s->img_buffer = s->img_buffer_end;
1655
0
         (s->io.skip)(s->io_user_data, n - blen);
1656
0
         return;
1657
0
      }
1658
0
   }
1659
64.3M
   s->img_buffer += n;
1660
64.3M
}
1661
#endif
1662
1663
#if defined(STBI_NO_PNG) && defined(STBI_NO_TGA) && defined(STBI_NO_HDR) && defined(STBI_NO_PNM)
1664
// nothing
1665
#else
1666
static int stbi__getn(stbi__context *s, stbi_uc *buffer, int n)
1667
235M
{
1668
235M
   if (s->io.read) {
1669
0
      int blen = (int) (s->img_buffer_end - s->img_buffer);
1670
0
      if (blen < n) {
1671
0
         int res, count;
1672
1673
0
         memcpy(buffer, s->img_buffer, blen);
1674
1675
0
         count = (s->io.read)(s->io_user_data, (char*) buffer + blen, n - blen);
1676
0
         res = (count == (n-blen));
1677
0
         s->img_buffer = s->img_buffer_end;
1678
0
         return res;
1679
0
      }
1680
0
   }
1681
1682
235M
   if (s->img_buffer+n <= s->img_buffer_end) {
1683
4.73k
      memcpy(buffer, s->img_buffer, n);
1684
4.73k
      s->img_buffer += n;
1685
4.73k
      return 1;
1686
4.73k
   } else
1687
235M
      return 0;
1688
235M
}
1689
#endif
1690
1691
#if defined(STBI_NO_JPEG) && defined(STBI_NO_PNG) && defined(STBI_NO_PSD) && defined(STBI_NO_PIC)
1692
// nothing
1693
#else
1694
static int stbi__get16be(stbi__context *s)
1695
145M
{
1696
145M
   int z = stbi__get8(s);
1697
145M
   return (z << 8) + stbi__get8(s);
1698
145M
}
1699
#endif
1700
1701
#if defined(STBI_NO_PNG) && defined(STBI_NO_PSD) && defined(STBI_NO_PIC)
1702
// nothing
1703
#else
1704
static stbi__uint32 stbi__get32be(stbi__context *s)
1705
63.7k
{
1706
63.7k
   stbi__uint32 z = stbi__get16be(s);
1707
63.7k
   return (z << 16) + stbi__get16be(s);
1708
63.7k
}
1709
#endif
1710
1711
#if defined(STBI_NO_BMP) && defined(STBI_NO_TGA) && defined(STBI_NO_GIF)
1712
// nothing
1713
#else
1714
static int stbi__get16le(stbi__context *s)
1715
1.30G
{
1716
1.30G
   int z = stbi__get8(s);
1717
1.30G
   return z + (stbi__get8(s) << 8);
1718
1.30G
}
1719
#endif
1720
1721
#ifndef STBI_NO_BMP
1722
static stbi__uint32 stbi__get32le(stbi__context *s)
1723
323M
{
1724
323M
   stbi__uint32 z = stbi__get16le(s);
1725
323M
   z += (stbi__uint32)stbi__get16le(s) << 16;
1726
323M
   return z;
1727
323M
}
1728
#endif
1729
1730
3.11G
#define STBI__BYTECAST(x)  ((stbi_uc) ((x) & 255))  // truncate int to byte without warnings
1731
1732
#if defined(STBI_NO_JPEG) && defined(STBI_NO_PNG) && defined(STBI_NO_BMP) && defined(STBI_NO_PSD) && defined(STBI_NO_TGA) && defined(STBI_NO_GIF) && defined(STBI_NO_PIC) && defined(STBI_NO_PNM)
1733
// nothing
1734
#else
1735
//////////////////////////////////////////////////////////////////////////////
1736
//
1737
//  generic converter from built-in img_n to req_comp
1738
//    individual types do this automatically as much as possible (e.g. jpeg
1739
//    does all cases internally since it needs to colorspace convert anyway,
1740
//    and it never has alpha, so very few cases ). png can automatically
1741
//    interleave an alpha=255 channel, but falls back to this for other cases
1742
//
1743
//  assume data buffer is malloced, so malloc a new one and free that one
1744
//  only failure mode is malloc failing
1745
1746
static stbi_uc stbi__compute_y(int r, int g, int b)
1747
0
{
1748
0
   return (stbi_uc) (((r*77) + (g*150) +  (29*b)) >> 8);
1749
0
}
1750
#endif
1751
1752
#if defined(STBI_NO_PNG) && defined(STBI_NO_BMP) && defined(STBI_NO_PSD) && defined(STBI_NO_TGA) && defined(STBI_NO_GIF) && defined(STBI_NO_PIC) && defined(STBI_NO_PNM)
1753
// nothing
1754
#else
1755
static unsigned char *stbi__convert_format(unsigned char *data, int img_n, int req_comp, unsigned int x, unsigned int y)
1756
555
{
1757
555
   int i,j;
1758
555
   unsigned char *good;
1759
1760
555
   if (req_comp == img_n) return data;
1761
323
   STBI_ASSERT(req_comp >= 1 && req_comp <= 4);
1762
1763
323
   good = (unsigned char *) stbi__malloc_mad3(req_comp, x, y, 0);
1764
323
   if (good == NULL) {
1765
0
      STBI_FREE(data);
1766
0
      return stbi__errpuc("outofmem", "Out of memory");
1767
0
   }
1768
1769
2.37M
   for (j=0; j < (int) y; ++j) {
1770
2.37M
      unsigned char *src  = data + j * x * img_n   ;
1771
2.37M
      unsigned char *dest = good + j * x * req_comp;
1772
1773
4.74M
      #define STBI__COMBO(a,b)  ((a)*8+(b))
1774
1.15G
      #define STBI__CASE(a,b)   case STBI__COMBO(a,b): for(i=x-1; i >= 0; --i, src += a, dest += b)
1775
      // convert source image with img_n components to one with req_comp components;
1776
      // avoid switch per pixel, so use switch per scanline and massive macros
1777
2.37M
      switch (STBI__COMBO(img_n, req_comp)) {
1778
0
         STBI__CASE(1,2) { dest[0]=src[0]; dest[1]=255;                                     } break;
1779
0
         STBI__CASE(1,3) { dest[0]=dest[1]=dest[2]=src[0];                                  } break;
1780
373M
         STBI__CASE(1,4) { dest[0]=dest[1]=dest[2]=src[0]; dest[3]=255;                     } break;
1781
0
         STBI__CASE(2,1) { dest[0]=src[0];                                                  } break;
1782
0
         STBI__CASE(2,3) { dest[0]=dest[1]=dest[2]=src[0];                                  } break;
1783
180M
         STBI__CASE(2,4) { dest[0]=dest[1]=dest[2]=src[0]; dest[3]=src[1];                  } break;
1784
599M
         STBI__CASE(3,4) { dest[0]=src[0];dest[1]=src[1];dest[2]=src[2];dest[3]=255;        } break;
1785
0
         STBI__CASE(3,1) { dest[0]=stbi__compute_y(src[0],src[1],src[2]);                   } break;
1786
0
         STBI__CASE(3,2) { dest[0]=stbi__compute_y(src[0],src[1],src[2]); dest[1] = 255;    } break;
1787
0
         STBI__CASE(4,1) { dest[0]=stbi__compute_y(src[0],src[1],src[2]);                   } break;
1788
0
         STBI__CASE(4,2) { dest[0]=stbi__compute_y(src[0],src[1],src[2]); dest[1] = src[3]; } break;
1789
0
         STBI__CASE(4,3) { dest[0]=src[0];dest[1]=src[1];dest[2]=src[2];                    } break;
1790
0
         default: STBI_ASSERT(0); STBI_FREE(data); STBI_FREE(good); return stbi__errpuc("unsupported", "Unsupported format conversion");
1791
2.37M
      }
1792
2.37M
      #undef STBI__CASE
1793
2.37M
   }
1794
1795
323
   STBI_FREE(data);
1796
323
   return good;
1797
323
}
1798
#endif
1799
1800
#if defined(STBI_NO_PNG) && defined(STBI_NO_PSD)
1801
// nothing
1802
#else
1803
static stbi__uint16 stbi__compute_y_16(int r, int g, int b)
1804
0
{
1805
0
   return (stbi__uint16) (((r*77) + (g*150) +  (29*b)) >> 8);
1806
0
}
1807
#endif
1808
1809
#if defined(STBI_NO_PNG) && defined(STBI_NO_PSD)
1810
// nothing
1811
#else
1812
static stbi__uint16 *stbi__convert_format16(stbi__uint16 *data, int img_n, int req_comp, unsigned int x, unsigned int y)
1813
127
{
1814
127
   int i,j;
1815
127
   stbi__uint16 *good;
1816
1817
127
   if (req_comp == img_n) return data;
1818
127
   STBI_ASSERT(req_comp >= 1 && req_comp <= 4);
1819
1820
127
   good = (stbi__uint16 *) stbi__malloc(req_comp * x * y * 2);
1821
127
   if (good == NULL) {
1822
0
      STBI_FREE(data);
1823
0
      return (stbi__uint16 *) stbi__errpuc("outofmem", "Out of memory");
1824
0
   }
1825
1826
142k
   for (j=0; j < (int) y; ++j) {
1827
142k
      stbi__uint16 *src  = data + j * x * img_n   ;
1828
142k
      stbi__uint16 *dest = good + j * x * req_comp;
1829
1830
284k
      #define STBI__COMBO(a,b)  ((a)*8+(b))
1831
246M
      #define STBI__CASE(a,b)   case STBI__COMBO(a,b): for(i=x-1; i >= 0; --i, src += a, dest += b)
1832
      // convert source image with img_n components to one with req_comp components;
1833
      // avoid switch per pixel, so use switch per scanline and massive macros
1834
142k
      switch (STBI__COMBO(img_n, req_comp)) {
1835
0
         STBI__CASE(1,2) { dest[0]=src[0]; dest[1]=0xffff;                                     } break;
1836
0
         STBI__CASE(1,3) { dest[0]=dest[1]=dest[2]=src[0];                                     } break;
1837
245M
         STBI__CASE(1,4) { dest[0]=dest[1]=dest[2]=src[0]; dest[3]=0xffff;                     } break;
1838
0
         STBI__CASE(2,1) { dest[0]=src[0];                                                     } break;
1839
0
         STBI__CASE(2,3) { dest[0]=dest[1]=dest[2]=src[0];                                     } break;
1840
87.4k
         STBI__CASE(2,4) { dest[0]=dest[1]=dest[2]=src[0]; dest[3]=src[1];                     } break;
1841
414
         STBI__CASE(3,4) { dest[0]=src[0];dest[1]=src[1];dest[2]=src[2];dest[3]=0xffff;        } break;
1842
0
         STBI__CASE(3,1) { dest[0]=stbi__compute_y_16(src[0],src[1],src[2]);                   } break;
1843
0
         STBI__CASE(3,2) { dest[0]=stbi__compute_y_16(src[0],src[1],src[2]); dest[1] = 0xffff; } break;
1844
0
         STBI__CASE(4,1) { dest[0]=stbi__compute_y_16(src[0],src[1],src[2]);                   } break;
1845
0
         STBI__CASE(4,2) { dest[0]=stbi__compute_y_16(src[0],src[1],src[2]); dest[1] = src[3]; } break;
1846
0
         STBI__CASE(4,3) { dest[0]=src[0];dest[1]=src[1];dest[2]=src[2];                       } break;
1847
0
         default: STBI_ASSERT(0); STBI_FREE(data); STBI_FREE(good); return (stbi__uint16*) stbi__errpuc("unsupported", "Unsupported format conversion");
1848
142k
      }
1849
142k
      #undef STBI__CASE
1850
142k
   }
1851
1852
127
   STBI_FREE(data);
1853
127
   return good;
1854
127
}
1855
#endif
1856
1857
#ifndef STBI_NO_LINEAR
1858
static float   *stbi__ldr_to_hdr(stbi_uc *data, int x, int y, int comp)
1859
0
{
1860
0
   int i,k,n;
1861
0
   float *output;
1862
0
   if (!data) return NULL;
1863
0
   output = (float *) stbi__malloc_mad4(x, y, comp, sizeof(float), 0);
1864
0
   if (output == NULL) { STBI_FREE(data); return stbi__errpf("outofmem", "Out of memory"); }
1865
   // compute number of non-alpha components
1866
0
   if (comp & 1) n = comp; else n = comp-1;
1867
0
   for (i=0; i < x*y; ++i) {
1868
0
      for (k=0; k < n; ++k) {
1869
0
         output[i*comp + k] = (float) (pow(data[i*comp+k]/255.0f, stbi__l2h_gamma) * stbi__l2h_scale);
1870
0
      }
1871
0
   }
1872
0
   if (n < comp) {
1873
0
      for (i=0; i < x*y; ++i) {
1874
0
         output[i*comp + n] = data[i*comp + n]/255.0f;
1875
0
      }
1876
0
   }
1877
0
   STBI_FREE(data);
1878
0
   return output;
1879
0
}
1880
#endif
1881
1882
#ifndef STBI_NO_HDR
1883
938M
#define stbi__float2int(x)   ((int) (x))
1884
static stbi_uc *stbi__hdr_to_ldr(float   *data, int x, int y, int comp)
1885
547
{
1886
547
   int i,k,n;
1887
547
   stbi_uc *output;
1888
547
   if (!data) return NULL;
1889
382
   output = (stbi_uc *) stbi__malloc_mad3(x, y, comp, 0);
1890
382
   if (output == NULL) { STBI_FREE(data); return stbi__errpuc("outofmem", "Out of memory"); }
1891
   // compute number of non-alpha components
1892
382
   if (comp & 1) n = comp; else n = comp-1;
1893
234M
   for (i=0; i < x*y; ++i) {
1894
938M
      for (k=0; k < n; ++k) {
1895
703M
         float z = (float) pow(data[i*comp+k]*stbi__h2l_scale_i, stbi__h2l_gamma_i) * 255 + 0.5f;
1896
703M
         if (z < 0) z = 0;
1897
703M
         if (z > 255) z = 255;
1898
703M
         output[i*comp + k] = (stbi_uc) stbi__float2int(z);
1899
703M
      }
1900
234M
      if (k < comp) {
1901
234M
         float z = data[i*comp+k] * 255 + 0.5f;
1902
234M
         if (z < 0) z = 0;
1903
234M
         if (z > 255) z = 255;
1904
234M
         output[i*comp + k] = (stbi_uc) stbi__float2int(z);
1905
234M
      }
1906
234M
   }
1907
382
   STBI_FREE(data);
1908
382
   return output;
1909
382
}
1910
#endif
1911
1912
//////////////////////////////////////////////////////////////////////////////
1913
//
1914
//  "baseline" JPEG/JFIF decoder
1915
//
1916
//    simple implementation
1917
//      - doesn't support delayed output of y-dimension
1918
//      - simple interface (only one output format: 8-bit interleaved RGB)
1919
//      - doesn't try to recover corrupt jpegs
1920
//      - doesn't allow partial loading, loading multiple at once
1921
//      - still fast on x86 (copying globals into locals doesn't help x86)
1922
//      - allocates lots of intermediate memory (full size of all components)
1923
//        - non-interleaved case requires this anyway
1924
//        - allows good upsampling (see next)
1925
//    high-quality
1926
//      - upsampled channels are bilinearly interpolated, even across blocks
1927
//      - quality integer IDCT derived from IJG's 'slow'
1928
//    performance
1929
//      - fast huffman; reasonable integer IDCT
1930
//      - some SIMD kernels for common paths on targets with SSE2/NEON
1931
//      - uses a lot of intermediate memory, could cache poorly
1932
1933
#ifndef STBI_NO_JPEG
1934
1935
// huffman decoding acceleration
1936
13.8G
#define FAST_BITS   9  // larger handles more cases; smaller stomps less cache
1937
1938
typedef struct
1939
{
1940
   stbi_uc  fast[1 << FAST_BITS];
1941
   // weirdly, repacking this into AoS is a 10% speed loss, instead of a win
1942
   stbi__uint16 code[256];
1943
   stbi_uc  values[256];
1944
   stbi_uc  size[257];
1945
   unsigned int maxcode[18];
1946
   int    delta[17];   // old 'firstsymbol' - old 'firstcode'
1947
} stbi__huffman;
1948
1949
typedef struct
1950
{
1951
   stbi__context *s;
1952
   stbi__huffman huff_dc[4];
1953
   stbi__huffman huff_ac[4];
1954
   stbi__uint16 dequant[4][64];
1955
   stbi__int16 fast_ac[4][1 << FAST_BITS];
1956
1957
// sizes for components, interleaved MCUs
1958
   int img_h_max, img_v_max;
1959
   int img_mcu_x, img_mcu_y;
1960
   int img_mcu_w, img_mcu_h;
1961
1962
// definition of jpeg image component
1963
   struct
1964
   {
1965
      int id;
1966
      int h,v;
1967
      int tq;
1968
      int hd,ha;
1969
      int dc_pred;
1970
1971
      int x,y,w2,h2;
1972
      stbi_uc *data;
1973
      void *raw_data, *raw_coeff;
1974
      stbi_uc *linebuf;
1975
      short   *coeff;   // progressive only
1976
      int      coeff_w, coeff_h; // number of 8x8 coefficient blocks
1977
   } img_comp[4];
1978
1979
   stbi__uint32   code_buffer; // jpeg entropy-coded buffer
1980
   int            code_bits;   // number of valid bits
1981
   unsigned char  marker;      // marker seen while filling entropy buffer
1982
   int            nomore;      // flag if we saw a marker so must stop
1983
1984
   int            progressive;
1985
   int            spec_start;
1986
   int            spec_end;
1987
   int            succ_high;
1988
   int            succ_low;
1989
   int            eob_run;
1990
   int            jfif;
1991
   int            app14_color_transform; // Adobe APP14 tag
1992
   int            rgb;
1993
1994
   int scan_n, order[4];
1995
   int restart_interval, todo;
1996
1997
// kernels
1998
   void (*idct_block_kernel)(stbi_uc *out, int out_stride, short data[64]);
1999
   void (*YCbCr_to_RGB_kernel)(stbi_uc *out, const stbi_uc *y, const stbi_uc *pcb, const stbi_uc *pcr, int count, int step);
2000
   stbi_uc *(*resample_row_hv_2_kernel)(stbi_uc *out, stbi_uc *in_near, stbi_uc *in_far, int w, int hs);
2001
} stbi__jpeg;
2002
2003
static int stbi__build_huffman(stbi__huffman *h, int *count)
2004
111k
{
2005
111k
   int i,j,k=0;
2006
111k
   unsigned int code;
2007
   // build size list for each symbol (from JPEG spec)
2008
1.89M
   for (i=0; i < 16; ++i) {
2009
1.81M
      for (j=0; j < count[i]; ++j) {
2010
29.2k
         h->size[k++] = (stbi_uc) (i+1);
2011
29.2k
         if(k >= 257) return stbi__err("bad size list","Corrupt JPEG");
2012
29.2k
      }
2013
1.78M
   }
2014
111k
   h->size[k] = 0;
2015
2016
   // compute actual symbols (from jpeg spec)
2017
111k
   code = 0;
2018
111k
   k = 0;
2019
1.89M
   for(j=1; j <= 16; ++j) {
2020
      // compute delta to add to code to compute symbol id
2021
1.78M
      h->delta[j] = k - code;
2022
1.78M
      if (h->size[k] == j) {
2023
39.3k
         while (h->size[k] == j)
2024
28.9k
            h->code[k++] = (stbi__uint16) (code++);
2025
10.3k
         if (code-1 >= (1u << j)) return stbi__err("bad code lengths","Corrupt JPEG");
2026
10.3k
      }
2027
      // compute largest code + 1 for this size, preshifted as needed later
2028
1.78M
      h->maxcode[j] = code << (16-j);
2029
1.78M
      code <<= 1;
2030
1.78M
   }
2031
111k
   h->maxcode[j] = 0xffffffff;
2032
2033
   // build non-spec acceleration table; 255 is flag for not-accelerated
2034
111k
   memset(h->fast, 255, 1 << FAST_BITS);
2035
136k
   for (i=0; i < k; ++i) {
2036
24.8k
      int s = h->size[i];
2037
24.8k
      if (s <= FAST_BITS) {
2038
17.0k
         int c = h->code[i] << (FAST_BITS-s);
2039
17.0k
         int m = 1 << (FAST_BITS-s);
2040
1.48M
         for (j=0; j < m; ++j) {
2041
1.46M
            h->fast[c+j] = (stbi_uc) i;
2042
1.46M
         }
2043
17.0k
      }
2044
24.8k
   }
2045
111k
   return 1;
2046
111k
}
2047
2048
// build a table that decodes both magnitude and value of small ACs in
2049
// one go.
2050
static void stbi__build_fast_ac(stbi__int16 *fast_ac, stbi__huffman *h)
2051
3.17k
{
2052
3.17k
   int i;
2053
1.62M
   for (i=0; i < (1 << FAST_BITS); ++i) {
2054
1.62M
      stbi_uc fast = h->fast[i];
2055
1.62M
      fast_ac[i] = 0;
2056
1.62M
      if (fast < 255) {
2057
1.31M
         int rs = h->values[fast];
2058
1.31M
         int run = (rs >> 4) & 15;
2059
1.31M
         int magbits = rs & 15;
2060
1.31M
         int len = h->size[fast];
2061
2062
1.31M
         if (magbits && len + magbits <= FAST_BITS) {
2063
            // magnitude code followed by receive_extend code
2064
1.18M
            int k = ((i << len) & ((1 << FAST_BITS) - 1)) >> (FAST_BITS - magbits);
2065
1.18M
            int m = 1 << (magbits - 1);
2066
1.18M
            if (k < m) k += (~0U << magbits) + 1;
2067
            // if the result is small enough, we can fit it in fast_ac table
2068
1.18M
            if (k >= -128 && k <= 127)
2069
548k
               fast_ac[i] = (stbi__int16) ((k * 256) + (run * 16) + (len + magbits));
2070
1.18M
         }
2071
1.31M
      }
2072
1.62M
   }
2073
3.17k
}
2074
2075
static void stbi__grow_buffer_unsafe(stbi__jpeg *j)
2076
1.27G
{
2077
2.60G
   do {
2078
2.60G
      unsigned int b = j->nomore ? 0 : stbi__get8(j->s);
2079
2.60G
      if (b == 0xff) {
2080
10.5k
         int c = stbi__get8(j->s);
2081
14.2k
         while (c == 0xff) c = stbi__get8(j->s); // consume fill bytes
2082
10.5k
         if (c != 0) {
2083
9.13k
            j->marker = (unsigned char) c;
2084
9.13k
            j->nomore = 1;
2085
9.13k
            return;
2086
9.13k
         }
2087
10.5k
      }
2088
2.60G
      j->code_buffer |= b << (24 - j->code_bits);
2089
2.60G
      j->code_bits += 8;
2090
2.60G
   } while (j->code_bits <= 24);
2091
1.27G
}
2092
2093
// (1 << n) - 1
2094
static const stbi__uint32 stbi__bmask[17]={0,1,3,7,15,31,63,127,255,511,1023,2047,4095,8191,16383,32767,65535};
2095
2096
// decode a jpeg huffman value from the bitstream
2097
stbi_inline static int stbi__jpeg_huff_decode(stbi__jpeg *j, stbi__huffman *h)
2098
333M
{
2099
333M
   unsigned int temp;
2100
333M
   int c,k;
2101
2102
333M
   if (j->code_bits < 16) stbi__grow_buffer_unsafe(j);
2103
2104
   // look at the top FAST_BITS and determine what symbol ID it is,
2105
   // if the code is <= FAST_BITS
2106
333M
   c = (j->code_buffer >> (32 - FAST_BITS)) & ((1 << FAST_BITS)-1);
2107
333M
   k = h->fast[c];
2108
333M
   if (k < 255) {
2109
308M
      int s = h->size[k];
2110
308M
      if (s > j->code_bits)
2111
1
         return -1;
2112
308M
      j->code_buffer <<= s;
2113
308M
      j->code_bits -= s;
2114
308M
      return h->values[k];
2115
308M
   }
2116
2117
   // naive test is to shift the code_buffer down so k bits are
2118
   // valid, then test against maxcode. To speed this up, we've
2119
   // preshifted maxcode left so that it has (16-k) 0s at the
2120
   // end; in other words, regardless of the number of bits, it
2121
   // wants to be compared against something shifted to have 16;
2122
   // that way we don't need to shift inside the loop.
2123
25.1M
   temp = j->code_buffer >> 16;
2124
128M
   for (k=FAST_BITS+1 ; ; ++k)
2125
153M
      if (temp < h->maxcode[k])
2126
25.1M
         break;
2127
25.1M
   if (k == 17) {
2128
      // error! code not found
2129
36
      j->code_bits -= 16;
2130
36
      return -1;
2131
36
   }
2132
2133
25.1M
   if (k > j->code_bits)
2134
4
      return -1;
2135
2136
   // convert the huffman code to the symbol id
2137
25.1M
   c = ((j->code_buffer >> (32 - k)) & stbi__bmask[k]) + h->delta[k];
2138
25.1M
   if(c < 0 || c >= 256) // symbol id out of bounds!
2139
0
       return -1;
2140
25.1M
   STBI_ASSERT((((j->code_buffer) >> (32 - h->size[c])) & stbi__bmask[h->size[c]]) == h->code[c]);
2141
2142
   // convert the id to a symbol
2143
25.1M
   j->code_bits -= k;
2144
25.1M
   j->code_buffer <<= k;
2145
25.1M
   return h->values[c];
2146
25.1M
}
2147
2148
// bias[n] = (-1<<n) + 1
2149
static const int stbi__jbias[16] = {0,-1,-3,-7,-15,-31,-63,-127,-255,-511,-1023,-2047,-4095,-8191,-16383,-32767};
2150
2151
// combined JPEG 'receive' and JPEG 'extend', since baseline
2152
// always extends everything it receives.
2153
stbi_inline static int stbi__extend_receive(stbi__jpeg *j, int n)
2154
82.5M
{
2155
82.5M
   unsigned int k;
2156
82.5M
   int sgn;
2157
82.5M
   if (j->code_bits < n) stbi__grow_buffer_unsafe(j);
2158
82.5M
   if (j->code_bits < n) return 0; // ran out of bits from stream, return 0s intead of continuing
2159
2160
82.5M
   sgn = j->code_buffer >> 31; // sign bit always in MSB; 0 if MSB clear (positive), 1 if MSB set (negative)
2161
82.5M
   k = stbi_lrot(j->code_buffer, n);
2162
82.5M
   j->code_buffer = k & ~stbi__bmask[n];
2163
82.5M
   k &= stbi__bmask[n];
2164
82.5M
   j->code_bits -= n;
2165
82.5M
   return k + (stbi__jbias[n] & (sgn - 1));
2166
82.5M
}
2167
2168
// get some unsigned bits
2169
stbi_inline static int stbi__jpeg_get_bits(stbi__jpeg *j, int n)
2170
1.15M
{
2171
1.15M
   unsigned int k;
2172
1.15M
   if (j->code_bits < n) stbi__grow_buffer_unsafe(j);
2173
1.15M
   if (j->code_bits < n) return 0; // ran out of bits from stream, return 0s intead of continuing
2174
1.14M
   k = stbi_lrot(j->code_buffer, n);
2175
1.14M
   j->code_buffer = k & ~stbi__bmask[n];
2176
1.14M
   k &= stbi__bmask[n];
2177
1.14M
   j->code_bits -= n;
2178
1.14M
   return k;
2179
1.15M
}
2180
2181
stbi_inline static int stbi__jpeg_get_bit(stbi__jpeg *j)
2182
237M
{
2183
237M
   unsigned int k;
2184
237M
   if (j->code_bits < 1) stbi__grow_buffer_unsafe(j);
2185
237M
   if (j->code_bits < 1) return 0; // ran out of bits from stream, return 0s intead of continuing
2186
237M
   k = j->code_buffer;
2187
237M
   j->code_buffer <<= 1;
2188
237M
   --j->code_bits;
2189
237M
   return k & 0x80000000;
2190
237M
}
2191
2192
// given a value that's at position X in the zigzag stream,
2193
// where does it appear in the 8x8 matrix coded as row-major?
2194
static const stbi_uc stbi__jpeg_dezigzag[64+15] =
2195
{
2196
    0,  1,  8, 16,  9,  2,  3, 10,
2197
   17, 24, 32, 25, 18, 11,  4,  5,
2198
   12, 19, 26, 33, 40, 48, 41, 34,
2199
   27, 20, 13,  6,  7, 14, 21, 28,
2200
   35, 42, 49, 56, 57, 50, 43, 36,
2201
   29, 22, 15, 23, 30, 37, 44, 51,
2202
   58, 59, 52, 45, 38, 31, 39, 46,
2203
   53, 60, 61, 54, 47, 55, 62, 63,
2204
   // let corrupt input sample past end
2205
   63, 63, 63, 63, 63, 63, 63, 63,
2206
   63, 63, 63, 63, 63, 63, 63
2207
};
2208
2209
// decode one 64-entry block--
2210
static int stbi__jpeg_decode_block(stbi__jpeg *j, short data[64], stbi__huffman *hdc, stbi__huffman *hac, stbi__int16 *fac, int b, stbi__uint16 *dequant)
2211
112M
{
2212
112M
   int diff,dc,k;
2213
112M
   int t;
2214
2215
112M
   if (j->code_bits < 16) stbi__grow_buffer_unsafe(j);
2216
112M
   t = stbi__jpeg_huff_decode(j, hdc);
2217
112M
   if (t < 0 || t > 15) return stbi__err("bad huffman code","Corrupt JPEG");
2218
2219
   // 0 all the ac values now so we can do it 32-bits at a time
2220
112M
   memset(data,0,64*sizeof(data[0]));
2221
2222
112M
   diff = t ? stbi__extend_receive(j, t) : 0;
2223
112M
   if (!stbi__addints_valid(j->img_comp[b].dc_pred, diff)) return stbi__err("bad delta","Corrupt JPEG");
2224
112M
   dc = j->img_comp[b].dc_pred + diff;
2225
112M
   j->img_comp[b].dc_pred = dc;
2226
112M
   if (!stbi__mul2shorts_valid(dc, dequant[0])) return stbi__err("can't merge dc and ac", "Corrupt JPEG");
2227
112M
   data[0] = (short) (dc * dequant[0]);
2228
2229
   // decode AC components, see JPEG spec
2230
112M
   k = 1;
2231
164M
   do {
2232
164M
      unsigned int zig;
2233
164M
      int c,r,s;
2234
164M
      if (j->code_bits < 16) stbi__grow_buffer_unsafe(j);
2235
164M
      c = (j->code_buffer >> (32 - FAST_BITS)) & ((1 << FAST_BITS)-1);
2236
164M
      r = fac[c];
2237
164M
      if (r) { // fast-AC path
2238
40.9M
         k += (r >> 4) & 15; // run
2239
40.9M
         s = r & 15; // combined length
2240
40.9M
         if (s > j->code_bits) return stbi__err("bad huffman code", "Combined length longer than code bits available");
2241
40.9M
         j->code_buffer <<= s;
2242
40.9M
         j->code_bits -= s;
2243
         // decode into unzigzag'd location
2244
40.9M
         zig = stbi__jpeg_dezigzag[k++];
2245
40.9M
         data[zig] = (short) ((r >> 8) * dequant[zig]);
2246
123M
      } else {
2247
123M
         int rs = stbi__jpeg_huff_decode(j, hac);
2248
123M
         if (rs < 0) return stbi__err("bad huffman code","Corrupt JPEG");
2249
123M
         s = rs & 15;
2250
123M
         r = rs >> 4;
2251
123M
         if (s == 0) {
2252
106M
            if (rs != 0xf0) break; // end block
2253
174k
            k += 16;
2254
16.7M
         } else {
2255
16.7M
            k += r;
2256
            // decode into unzigzag'd location
2257
16.7M
            zig = stbi__jpeg_dezigzag[k++];
2258
16.7M
            data[zig] = (short) (stbi__extend_receive(j,s) * dequant[zig]);
2259
16.7M
         }
2260
123M
      }
2261
164M
   } while (k < 64);
2262
112M
   return 1;
2263
112M
}
2264
2265
static int stbi__jpeg_decode_block_prog_dc(stbi__jpeg *j, short data[64], stbi__huffman *hdc, int b)
2266
31.0M
{
2267
31.0M
   int diff,dc;
2268
31.0M
   int t;
2269
31.0M
   if (j->spec_end != 0) return stbi__err("can't merge dc and ac", "Corrupt JPEG");
2270
2271
31.0M
   if (j->code_bits < 16) stbi__grow_buffer_unsafe(j);
2272
2273
31.0M
   if (j->succ_high == 0) {
2274
      // first scan for DC coefficient, must be first
2275
15.0M
      memset(data,0,64*sizeof(data[0])); // 0 all the ac values now
2276
15.0M
      t = stbi__jpeg_huff_decode(j, hdc);
2277
15.0M
      if (t < 0 || t > 15) return stbi__err("can't merge dc and ac", "Corrupt JPEG");
2278
15.0M
      diff = t ? stbi__extend_receive(j, t) : 0;
2279
2280
15.0M
      if (!stbi__addints_valid(j->img_comp[b].dc_pred, diff)) return stbi__err("bad delta", "Corrupt JPEG");
2281
15.0M
      dc = j->img_comp[b].dc_pred + diff;
2282
15.0M
      j->img_comp[b].dc_pred = dc;
2283
15.0M
      if (!stbi__mul2shorts_valid(dc, 1 << j->succ_low)) return stbi__err("can't merge dc and ac", "Corrupt JPEG");
2284
15.0M
      data[0] = (short) (dc * (1 << j->succ_low));
2285
16.0M
   } else {
2286
      // refinement scan for DC coefficient
2287
16.0M
      if (stbi__jpeg_get_bit(j))
2288
3.67k
         data[0] += (short) (1 << j->succ_low);
2289
16.0M
   }
2290
31.0M
   return 1;
2291
31.0M
}
2292
2293
// @OPTIMIZE: store non-zigzagged during the decode passes,
2294
// and only de-zigzag when dequantizing
2295
static int stbi__jpeg_decode_block_prog_ac(stbi__jpeg *j, short data[64], stbi__huffman *hac, stbi__int16 *fac)
2296
216M
{
2297
216M
   int k;
2298
216M
   if (j->spec_start == 0) return stbi__err("can't merge dc and ac", "Corrupt JPEG");
2299
2300
216M
   if (j->succ_high == 0) {
2301
196M
      int shift = j->succ_low;
2302
2303
196M
      if (j->eob_run) {
2304
85.4M
         --j->eob_run;
2305
85.4M
         return 1;
2306
85.4M
      }
2307
2308
110M
      k = j->spec_start;
2309
6.43G
      do {
2310
6.43G
         unsigned int zig;
2311
6.43G
         int c,r,s;
2312
6.43G
         if (j->code_bits < 16) stbi__grow_buffer_unsafe(j);
2313
6.43G
         c = (j->code_buffer >> (32 - FAST_BITS)) & ((1 << FAST_BITS)-1);
2314
6.43G
         r = fac[c];
2315
6.43G
         if (r) { // fast-AC path
2316
6.37G
            k += (r >> 4) & 15; // run
2317
6.37G
            s = r & 15; // combined length
2318
6.37G
            if (s > j->code_bits) return stbi__err("bad huffman code", "Combined length longer than code bits available");
2319
6.37G
            j->code_buffer <<= s;
2320
6.37G
            j->code_bits -= s;
2321
6.37G
            zig = stbi__jpeg_dezigzag[k++];
2322
6.37G
            data[zig] = (short) ((r >> 8) * (1 << shift));
2323
6.37G
         } else {
2324
59.0M
            int rs = stbi__jpeg_huff_decode(j, hac);
2325
59.0M
            if (rs < 0) return stbi__err("bad huffman code","Corrupt JPEG");
2326
59.0M
            s = rs & 15;
2327
59.0M
            r = rs >> 4;
2328
59.0M
            if (s == 0) {
2329
3.29M
               if (r < 15) {
2330
3.21M
                  j->eob_run = (1 << r);
2331
3.21M
                  if (r)
2332
31.7k
                     j->eob_run += stbi__jpeg_get_bits(j, r);
2333
3.21M
                  --j->eob_run;
2334
3.21M
                  break;
2335
3.21M
               }
2336
81.3k
               k += 16;
2337
55.7M
            } else {
2338
55.7M
               k += r;
2339
55.7M
               zig = stbi__jpeg_dezigzag[k++];
2340
55.7M
               data[zig] = (short) (stbi__extend_receive(j,s) * (1 << shift));
2341
55.7M
            }
2342
59.0M
         }
2343
6.43G
      } while (k <= j->spec_end);
2344
110M
   } else {
2345
      // refinement scan for these AC coefficients
2346
2347
20.4M
      short bit = (short) (1 << j->succ_low);
2348
2349
20.4M
      if (j->eob_run) {
2350
5.34M
         --j->eob_run;
2351
81.0M
         for (k = j->spec_start; k <= j->spec_end; ++k) {
2352
75.7M
            short *p = &data[stbi__jpeg_dezigzag[k]];
2353
75.7M
            if (*p != 0)
2354
24.3M
               if (stbi__jpeg_get_bit(j))
2355
1.12k
                  if ((*p & bit)==0) {
2356
561
                     if (*p > 0)
2357
246
                        *p += bit;
2358
315
                     else
2359
315
                        *p -= bit;
2360
561
                  }
2361
75.7M
         }
2362
15.1M
      } else {
2363
15.1M
         k = j->spec_start;
2364
23.0M
         do {
2365
23.0M
            int r,s;
2366
23.0M
            int rs = stbi__jpeg_huff_decode(j, hac); // @OPTIMIZE see if we can use the fast path here, advance-by-r is so slow, eh
2367
23.0M
            if (rs < 0) return stbi__err("bad huffman code","Corrupt JPEG");
2368
23.0M
            s = rs & 15;
2369
23.0M
            r = rs >> 4;
2370
23.0M
            if (s == 0) {
2371
7.42M
               if (r < 15) {
2372
7.21M
                  j->eob_run = (1 << r) - 1;
2373
7.21M
                  if (r)
2374
1.11M
                     j->eob_run += stbi__jpeg_get_bits(j, r);
2375
7.21M
                  r = 64; // force end of block
2376
7.21M
               } else {
2377
                  // r=15 s=0 should write 16 0s, so we just do
2378
                  // a run of 15 0s and then write s (which is 0),
2379
                  // so we don't have to do anything special here
2380
211k
               }
2381
15.6M
            } else {
2382
15.6M
               if (s != 1) return stbi__err("bad huffman code", "Corrupt JPEG");
2383
               // sign bit
2384
15.6M
               if (stbi__jpeg_get_bit(j))
2385
620
                  s = bit;
2386
15.6M
               else
2387
15.6M
                  s = -bit;
2388
15.6M
            }
2389
2390
            // advance by r
2391
302M
            while (k <= j->spec_end) {
2392
288M
               short *p = &data[stbi__jpeg_dezigzag[k++]];
2393
288M
               if (*p != 0) {
2394
181M
                  if (stbi__jpeg_get_bit(j))
2395
6.35k
                     if ((*p & bit)==0) {
2396
2.32k
                        if (*p > 0)
2397
427
                           *p += bit;
2398
1.89k
                        else
2399
1.89k
                           *p -= bit;
2400
2.32k
                     }
2401
181M
               } else {
2402
106M
                  if (r == 0) {
2403
8.50M
                     *p = (short) s;
2404
8.50M
                     break;
2405
8.50M
                  }
2406
98.1M
                  --r;
2407
98.1M
               }
2408
288M
            }
2409
23.0M
         } while (k <= j->spec_end);
2410
15.1M
      }
2411
20.4M
   }
2412
131M
   return 1;
2413
216M
}
2414
2415
// take a -128..127 value and stbi__clamp it and convert to 0..255
2416
stbi_inline static stbi_uc stbi__clamp(int x)
2417
0
{
2418
   // trick to use a single test to catch both cases
2419
0
   if ((unsigned int) x > 255) {
2420
0
      if (x < 0) return 0;
2421
0
      if (x > 255) return 255;
2422
0
   }
2423
0
   return (stbi_uc) x;
2424
0
}
2425
2426
0
#define stbi__f2f(x)  ((int) (((x) * 4096 + 0.5)))
2427
0
#define stbi__fsh(x)  ((x) * 4096)
2428
2429
// derived from jidctint -- DCT_ISLOW
2430
#define STBI__IDCT_1D(s0,s1,s2,s3,s4,s5,s6,s7) \
2431
0
   int t0,t1,t2,t3,p1,p2,p3,p4,p5,x0,x1,x2,x3; \
2432
0
   p2 = s2;                                    \
2433
0
   p3 = s6;                                    \
2434
0
   p1 = (p2+p3) * stbi__f2f(0.5411961f);       \
2435
0
   t2 = p1 + p3*stbi__f2f(-1.847759065f);      \
2436
0
   t3 = p1 + p2*stbi__f2f( 0.765366865f);      \
2437
0
   p2 = s0;                                    \
2438
0
   p3 = s4;                                    \
2439
0
   t0 = stbi__fsh(p2+p3);                      \
2440
0
   t1 = stbi__fsh(p2-p3);                      \
2441
0
   x0 = t0+t3;                                 \
2442
0
   x3 = t0-t3;                                 \
2443
0
   x1 = t1+t2;                                 \
2444
0
   x2 = t1-t2;                                 \
2445
0
   t0 = s7;                                    \
2446
0
   t1 = s5;                                    \
2447
0
   t2 = s3;                                    \
2448
0
   t3 = s1;                                    \
2449
0
   p3 = t0+t2;                                 \
2450
0
   p4 = t1+t3;                                 \
2451
0
   p1 = t0+t3;                                 \
2452
0
   p2 = t1+t2;                                 \
2453
0
   p5 = (p3+p4)*stbi__f2f( 1.175875602f);      \
2454
0
   t0 = t0*stbi__f2f( 0.298631336f);           \
2455
0
   t1 = t1*stbi__f2f( 2.053119869f);           \
2456
0
   t2 = t2*stbi__f2f( 3.072711026f);           \
2457
0
   t3 = t3*stbi__f2f( 1.501321110f);           \
2458
0
   p1 = p5 + p1*stbi__f2f(-0.899976223f);      \
2459
0
   p2 = p5 + p2*stbi__f2f(-2.562915447f);      \
2460
0
   p3 = p3*stbi__f2f(-1.961570560f);           \
2461
0
   p4 = p4*stbi__f2f(-0.390180644f);           \
2462
0
   t3 += p1+p4;                                \
2463
0
   t2 += p2+p3;                                \
2464
0
   t1 += p2+p4;                                \
2465
0
   t0 += p1+p3;
2466
2467
static void stbi__idct_block(stbi_uc *out, int out_stride, short data[64])
2468
0
{
2469
0
   int i,val[64],*v=val;
2470
0
   stbi_uc *o;
2471
0
   short *d = data;
2472
2473
   // columns
2474
0
   for (i=0; i < 8; ++i,++d, ++v) {
2475
      // if all zeroes, shortcut -- this avoids dequantizing 0s and IDCTing
2476
0
      if (d[ 8]==0 && d[16]==0 && d[24]==0 && d[32]==0
2477
0
           && d[40]==0 && d[48]==0 && d[56]==0) {
2478
         //    no shortcut                 0     seconds
2479
         //    (1|2|3|4|5|6|7)==0          0     seconds
2480
         //    all separate               -0.047 seconds
2481
         //    1 && 2|3 && 4|5 && 6|7:    -0.047 seconds
2482
0
         int dcterm = d[0]*4;
2483
0
         v[0] = v[8] = v[16] = v[24] = v[32] = v[40] = v[48] = v[56] = dcterm;
2484
0
      } else {
2485
0
         STBI__IDCT_1D(d[ 0],d[ 8],d[16],d[24],d[32],d[40],d[48],d[56])
2486
         // constants scaled things up by 1<<12; let's bring them back
2487
         // down, but keep 2 extra bits of precision
2488
0
         x0 += 512; x1 += 512; x2 += 512; x3 += 512;
2489
0
         v[ 0] = (x0+t3) >> 10;
2490
0
         v[56] = (x0-t3) >> 10;
2491
0
         v[ 8] = (x1+t2) >> 10;
2492
0
         v[48] = (x1-t2) >> 10;
2493
0
         v[16] = (x2+t1) >> 10;
2494
0
         v[40] = (x2-t1) >> 10;
2495
0
         v[24] = (x3+t0) >> 10;
2496
0
         v[32] = (x3-t0) >> 10;
2497
0
      }
2498
0
   }
2499
2500
0
   for (i=0, v=val, o=out; i < 8; ++i,v+=8,o+=out_stride) {
2501
      // no fast case since the first 1D IDCT spread components out
2502
0
      STBI__IDCT_1D(v[0],v[1],v[2],v[3],v[4],v[5],v[6],v[7])
2503
      // constants scaled things up by 1<<12, plus we had 1<<2 from first
2504
      // loop, plus horizontal and vertical each scale by sqrt(8) so together
2505
      // we've got an extra 1<<3, so 1<<17 total we need to remove.
2506
      // so we want to round that, which means adding 0.5 * 1<<17,
2507
      // aka 65536. Also, we'll end up with -128 to 127 that we want
2508
      // to encode as 0..255 by adding 128, so we'll add that before the shift
2509
0
      x0 += 65536 + (128<<17);
2510
0
      x1 += 65536 + (128<<17);
2511
0
      x2 += 65536 + (128<<17);
2512
0
      x3 += 65536 + (128<<17);
2513
      // tried computing the shifts into temps, or'ing the temps to see
2514
      // if any were out of range, but that was slower
2515
0
      o[0] = stbi__clamp((x0+t3) >> 17);
2516
0
      o[7] = stbi__clamp((x0-t3) >> 17);
2517
0
      o[1] = stbi__clamp((x1+t2) >> 17);
2518
0
      o[6] = stbi__clamp((x1-t2) >> 17);
2519
0
      o[2] = stbi__clamp((x2+t1) >> 17);
2520
0
      o[5] = stbi__clamp((x2-t1) >> 17);
2521
0
      o[3] = stbi__clamp((x3+t0) >> 17);
2522
0
      o[4] = stbi__clamp((x3-t0) >> 17);
2523
0
   }
2524
0
}
2525
2526
#ifdef STBI_SSE2
2527
// sse2 integer IDCT. not the fastest possible implementation but it
2528
// produces bit-identical results to the generic C version so it's
2529
// fully "transparent".
2530
static void stbi__idct_simd(stbi_uc *out, int out_stride, short data[64])
2531
115M
{
2532
   // This is constructed to match our regular (generic) integer IDCT exactly.
2533
115M
   __m128i row0, row1, row2, row3, row4, row5, row6, row7;
2534
115M
   __m128i tmp;
2535
2536
   // dot product constant: even elems=x, odd elems=y
2537
921M
   #define dct_const(x,y)  _mm_setr_epi16((x),(y),(x),(y),(x),(y),(x),(y))
2538
2539
   // out(0) = c0[even]*x + c0[odd]*y   (c0, x, y 16-bit, out 32-bit)
2540
   // out(1) = c1[even]*x + c1[odd]*y
2541
115M
   #define dct_rot(out0,out1, x,y,c0,c1) \
2542
921M
      __m128i c0##lo = _mm_unpacklo_epi16((x),(y)); \
2543
921M
      __m128i c0##hi = _mm_unpackhi_epi16((x),(y)); \
2544
921M
      __m128i out0##_l = _mm_madd_epi16(c0##lo, c0); \
2545
921M
      __m128i out0##_h = _mm_madd_epi16(c0##hi, c0); \
2546
921M
      __m128i out1##_l = _mm_madd_epi16(c0##lo, c1); \
2547
921M
      __m128i out1##_h = _mm_madd_epi16(c0##hi, c1)
2548
2549
   // out = in << 12  (in 16-bit, out 32-bit)
2550
115M
   #define dct_widen(out, in) \
2551
460M
      __m128i out##_l = _mm_srai_epi32(_mm_unpacklo_epi16(_mm_setzero_si128(), (in)), 4); \
2552
460M
      __m128i out##_h = _mm_srai_epi32(_mm_unpackhi_epi16(_mm_setzero_si128(), (in)), 4)
2553
2554
   // wide add
2555
115M
   #define dct_wadd(out, a, b) \
2556
2.30G
      __m128i out##_l = _mm_add_epi32(a##_l, b##_l); \
2557
2.30G
      __m128i out##_h = _mm_add_epi32(a##_h, b##_h)
2558
2559
   // wide sub
2560
115M
   #define dct_wsub(out, a, b) \
2561
1.38G
      __m128i out##_l = _mm_sub_epi32(a##_l, b##_l); \
2562
1.38G
      __m128i out##_h = _mm_sub_epi32(a##_h, b##_h)
2563
2564
   // butterfly a/b, add bias, then shift by "s" and pack
2565
115M
   #define dct_bfly32o(out0, out1, a,b,bias,s) \
2566
921M
      { \
2567
921M
         __m128i abiased_l = _mm_add_epi32(a##_l, bias); \
2568
921M
         __m128i abiased_h = _mm_add_epi32(a##_h, bias); \
2569
921M
         dct_wadd(sum, abiased, b); \
2570
921M
         dct_wsub(dif, abiased, b); \
2571
921M
         out0 = _mm_packs_epi32(_mm_srai_epi32(sum_l, s), _mm_srai_epi32(sum_h, s)); \
2572
921M
         out1 = _mm_packs_epi32(_mm_srai_epi32(dif_l, s), _mm_srai_epi32(dif_h, s)); \
2573
921M
      }
2574
2575
   // 8-bit interleave step (for transposes)
2576
115M
   #define dct_interleave8(a, b) \
2577
690M
      tmp = a; \
2578
690M
      a = _mm_unpacklo_epi8(a, b); \
2579
690M
      b = _mm_unpackhi_epi8(tmp, b)
2580
2581
   // 16-bit interleave step (for transposes)
2582
115M
   #define dct_interleave16(a, b) \
2583
1.38G
      tmp = a; \
2584
1.38G
      a = _mm_unpacklo_epi16(a, b); \
2585
1.38G
      b = _mm_unpackhi_epi16(tmp, b)
2586
2587
115M
   #define dct_pass(bias,shift) \
2588
230M
      { \
2589
         /* even part */ \
2590
230M
         dct_rot(t2e,t3e, row2,row6, rot0_0,rot0_1); \
2591
230M
         __m128i sum04 = _mm_add_epi16(row0, row4); \
2592
230M
         __m128i dif04 = _mm_sub_epi16(row0, row4); \
2593
230M
         dct_widen(t0e, sum04); \
2594
230M
         dct_widen(t1e, dif04); \
2595
230M
         dct_wadd(x0, t0e, t3e); \
2596
230M
         dct_wsub(x3, t0e, t3e); \
2597
230M
         dct_wadd(x1, t1e, t2e); \
2598
230M
         dct_wsub(x2, t1e, t2e); \
2599
         /* odd part */ \
2600
230M
         dct_rot(y0o,y2o, row7,row3, rot2_0,rot2_1); \
2601
230M
         dct_rot(y1o,y3o, row5,row1, rot3_0,rot3_1); \
2602
230M
         __m128i sum17 = _mm_add_epi16(row1, row7); \
2603
230M
         __m128i sum35 = _mm_add_epi16(row3, row5); \
2604
230M
         dct_rot(y4o,y5o, sum17,sum35, rot1_0,rot1_1); \
2605
230M
         dct_wadd(x4, y0o, y4o); \
2606
230M
         dct_wadd(x5, y1o, y5o); \
2607
230M
         dct_wadd(x6, y2o, y5o); \
2608
230M
         dct_wadd(x7, y3o, y4o); \
2609
230M
         dct_bfly32o(row0,row7, x0,x7,bias,shift); \
2610
230M
         dct_bfly32o(row1,row6, x1,x6,bias,shift); \
2611
230M
         dct_bfly32o(row2,row5, x2,x5,bias,shift); \
2612
230M
         dct_bfly32o(row3,row4, x3,x4,bias,shift); \
2613
230M
      }
2614
2615
115M
   __m128i rot0_0 = dct_const(stbi__f2f(0.5411961f), stbi__f2f(0.5411961f) + stbi__f2f(-1.847759065f));
2616
115M
   __m128i rot0_1 = dct_const(stbi__f2f(0.5411961f) + stbi__f2f( 0.765366865f), stbi__f2f(0.5411961f));
2617
115M
   __m128i rot1_0 = dct_const(stbi__f2f(1.175875602f) + stbi__f2f(-0.899976223f), stbi__f2f(1.175875602f));
2618
115M
   __m128i rot1_1 = dct_const(stbi__f2f(1.175875602f), stbi__f2f(1.175875602f) + stbi__f2f(-2.562915447f));
2619
115M
   __m128i rot2_0 = dct_const(stbi__f2f(-1.961570560f) + stbi__f2f( 0.298631336f), stbi__f2f(-1.961570560f));
2620
115M
   __m128i rot2_1 = dct_const(stbi__f2f(-1.961570560f), stbi__f2f(-1.961570560f) + stbi__f2f( 3.072711026f));
2621
115M
   __m128i rot3_0 = dct_const(stbi__f2f(-0.390180644f) + stbi__f2f( 2.053119869f), stbi__f2f(-0.390180644f));
2622
115M
   __m128i rot3_1 = dct_const(stbi__f2f(-0.390180644f), stbi__f2f(-0.390180644f) + stbi__f2f( 1.501321110f));
2623
2624
   // rounding biases in column/row passes, see stbi__idct_block for explanation.
2625
115M
   __m128i bias_0 = _mm_set1_epi32(512);
2626
115M
   __m128i bias_1 = _mm_set1_epi32(65536 + (128<<17));
2627
2628
   // load
2629
115M
   row0 = _mm_load_si128((const __m128i *) (data + 0*8));
2630
115M
   row1 = _mm_load_si128((const __m128i *) (data + 1*8));
2631
115M
   row2 = _mm_load_si128((const __m128i *) (data + 2*8));
2632
115M
   row3 = _mm_load_si128((const __m128i *) (data + 3*8));
2633
115M
   row4 = _mm_load_si128((const __m128i *) (data + 4*8));
2634
115M
   row5 = _mm_load_si128((const __m128i *) (data + 5*8));
2635
115M
   row6 = _mm_load_si128((const __m128i *) (data + 6*8));
2636
115M
   row7 = _mm_load_si128((const __m128i *) (data + 7*8));
2637
2638
   // column pass
2639
115M
   dct_pass(bias_0, 10);
2640
2641
115M
   {
2642
      // 16bit 8x8 transpose pass 1
2643
115M
      dct_interleave16(row0, row4);
2644
115M
      dct_interleave16(row1, row5);
2645
115M
      dct_interleave16(row2, row6);
2646
115M
      dct_interleave16(row3, row7);
2647
2648
      // transpose pass 2
2649
115M
      dct_interleave16(row0, row2);
2650
115M
      dct_interleave16(row1, row3);
2651
115M
      dct_interleave16(row4, row6);
2652
115M
      dct_interleave16(row5, row7);
2653
2654
      // transpose pass 3
2655
115M
      dct_interleave16(row0, row1);
2656
115M
      dct_interleave16(row2, row3);
2657
115M
      dct_interleave16(row4, row5);
2658
115M
      dct_interleave16(row6, row7);
2659
115M
   }
2660
2661
   // row pass
2662
115M
   dct_pass(bias_1, 17);
2663
2664
115M
   {
2665
      // pack
2666
115M
      __m128i p0 = _mm_packus_epi16(row0, row1); // a0a1a2a3...a7b0b1b2b3...b7
2667
115M
      __m128i p1 = _mm_packus_epi16(row2, row3);
2668
115M
      __m128i p2 = _mm_packus_epi16(row4, row5);
2669
115M
      __m128i p3 = _mm_packus_epi16(row6, row7);
2670
2671
      // 8bit 8x8 transpose pass 1
2672
115M
      dct_interleave8(p0, p2); // a0e0a1e1...
2673
115M
      dct_interleave8(p1, p3); // c0g0c1g1...
2674
2675
      // transpose pass 2
2676
115M
      dct_interleave8(p0, p1); // a0c0e0g0...
2677
115M
      dct_interleave8(p2, p3); // b0d0f0h0...
2678
2679
      // transpose pass 3
2680
115M
      dct_interleave8(p0, p2); // a0b0c0d0...
2681
115M
      dct_interleave8(p1, p3); // a4b4c4d4...
2682
2683
      // store
2684
115M
      _mm_storel_epi64((__m128i *) out, p0); out += out_stride;
2685
115M
      _mm_storel_epi64((__m128i *) out, _mm_shuffle_epi32(p0, 0x4e)); out += out_stride;
2686
115M
      _mm_storel_epi64((__m128i *) out, p2); out += out_stride;
2687
115M
      _mm_storel_epi64((__m128i *) out, _mm_shuffle_epi32(p2, 0x4e)); out += out_stride;
2688
115M
      _mm_storel_epi64((__m128i *) out, p1); out += out_stride;
2689
115M
      _mm_storel_epi64((__m128i *) out, _mm_shuffle_epi32(p1, 0x4e)); out += out_stride;
2690
115M
      _mm_storel_epi64((__m128i *) out, p3); out += out_stride;
2691
115M
      _mm_storel_epi64((__m128i *) out, _mm_shuffle_epi32(p3, 0x4e));
2692
115M
   }
2693
2694
115M
#undef dct_const
2695
115M
#undef dct_rot
2696
115M
#undef dct_widen
2697
115M
#undef dct_wadd
2698
115M
#undef dct_wsub
2699
115M
#undef dct_bfly32o
2700
115M
#undef dct_interleave8
2701
115M
#undef dct_interleave16
2702
115M
#undef dct_pass
2703
115M
}
2704
2705
#endif // STBI_SSE2
2706
2707
#ifdef STBI_NEON
2708
2709
// NEON integer IDCT. should produce bit-identical
2710
// results to the generic C version.
2711
static void stbi__idct_simd(stbi_uc *out, int out_stride, short data[64])
2712
{
2713
   int16x8_t row0, row1, row2, row3, row4, row5, row6, row7;
2714
2715
   int16x4_t rot0_0 = vdup_n_s16(stbi__f2f(0.5411961f));
2716
   int16x4_t rot0_1 = vdup_n_s16(stbi__f2f(-1.847759065f));
2717
   int16x4_t rot0_2 = vdup_n_s16(stbi__f2f( 0.765366865f));
2718
   int16x4_t rot1_0 = vdup_n_s16(stbi__f2f( 1.175875602f));
2719
   int16x4_t rot1_1 = vdup_n_s16(stbi__f2f(-0.899976223f));
2720
   int16x4_t rot1_2 = vdup_n_s16(stbi__f2f(-2.562915447f));
2721
   int16x4_t rot2_0 = vdup_n_s16(stbi__f2f(-1.961570560f));
2722
   int16x4_t rot2_1 = vdup_n_s16(stbi__f2f(-0.390180644f));
2723
   int16x4_t rot3_0 = vdup_n_s16(stbi__f2f( 0.298631336f));
2724
   int16x4_t rot3_1 = vdup_n_s16(stbi__f2f( 2.053119869f));
2725
   int16x4_t rot3_2 = vdup_n_s16(stbi__f2f( 3.072711026f));
2726
   int16x4_t rot3_3 = vdup_n_s16(stbi__f2f( 1.501321110f));
2727
2728
#define dct_long_mul(out, inq, coeff) \
2729
   int32x4_t out##_l = vmull_s16(vget_low_s16(inq), coeff); \
2730
   int32x4_t out##_h = vmull_s16(vget_high_s16(inq), coeff)
2731
2732
#define dct_long_mac(out, acc, inq, coeff) \
2733
   int32x4_t out##_l = vmlal_s16(acc##_l, vget_low_s16(inq), coeff); \
2734
   int32x4_t out##_h = vmlal_s16(acc##_h, vget_high_s16(inq), coeff)
2735
2736
#define dct_widen(out, inq) \
2737
   int32x4_t out##_l = vshll_n_s16(vget_low_s16(inq), 12); \
2738
   int32x4_t out##_h = vshll_n_s16(vget_high_s16(inq), 12)
2739
2740
// wide add
2741
#define dct_wadd(out, a, b) \
2742
   int32x4_t out##_l = vaddq_s32(a##_l, b##_l); \
2743
   int32x4_t out##_h = vaddq_s32(a##_h, b##_h)
2744
2745
// wide sub
2746
#define dct_wsub(out, a, b) \
2747
   int32x4_t out##_l = vsubq_s32(a##_l, b##_l); \
2748
   int32x4_t out##_h = vsubq_s32(a##_h, b##_h)
2749
2750
// butterfly a/b, then shift using "shiftop" by "s" and pack
2751
#define dct_bfly32o(out0,out1, a,b,shiftop,s) \
2752
   { \
2753
      dct_wadd(sum, a, b); \
2754
      dct_wsub(dif, a, b); \
2755
      out0 = vcombine_s16(shiftop(sum_l, s), shiftop(sum_h, s)); \
2756
      out1 = vcombine_s16(shiftop(dif_l, s), shiftop(dif_h, s)); \
2757
   }
2758
2759
#define dct_pass(shiftop, shift) \
2760
   { \
2761
      /* even part */ \
2762
      int16x8_t sum26 = vaddq_s16(row2, row6); \
2763
      dct_long_mul(p1e, sum26, rot0_0); \
2764
      dct_long_mac(t2e, p1e, row6, rot0_1); \
2765
      dct_long_mac(t3e, p1e, row2, rot0_2); \
2766
      int16x8_t sum04 = vaddq_s16(row0, row4); \
2767
      int16x8_t dif04 = vsubq_s16(row0, row4); \
2768
      dct_widen(t0e, sum04); \
2769
      dct_widen(t1e, dif04); \
2770
      dct_wadd(x0, t0e, t3e); \
2771
      dct_wsub(x3, t0e, t3e); \
2772
      dct_wadd(x1, t1e, t2e); \
2773
      dct_wsub(x2, t1e, t2e); \
2774
      /* odd part */ \
2775
      int16x8_t sum15 = vaddq_s16(row1, row5); \
2776
      int16x8_t sum17 = vaddq_s16(row1, row7); \
2777
      int16x8_t sum35 = vaddq_s16(row3, row5); \
2778
      int16x8_t sum37 = vaddq_s16(row3, row7); \
2779
      int16x8_t sumodd = vaddq_s16(sum17, sum35); \
2780
      dct_long_mul(p5o, sumodd, rot1_0); \
2781
      dct_long_mac(p1o, p5o, sum17, rot1_1); \
2782
      dct_long_mac(p2o, p5o, sum35, rot1_2); \
2783
      dct_long_mul(p3o, sum37, rot2_0); \
2784
      dct_long_mul(p4o, sum15, rot2_1); \
2785
      dct_wadd(sump13o, p1o, p3o); \
2786
      dct_wadd(sump24o, p2o, p4o); \
2787
      dct_wadd(sump23o, p2o, p3o); \
2788
      dct_wadd(sump14o, p1o, p4o); \
2789
      dct_long_mac(x4, sump13o, row7, rot3_0); \
2790
      dct_long_mac(x5, sump24o, row5, rot3_1); \
2791
      dct_long_mac(x6, sump23o, row3, rot3_2); \
2792
      dct_long_mac(x7, sump14o, row1, rot3_3); \
2793
      dct_bfly32o(row0,row7, x0,x7,shiftop,shift); \
2794
      dct_bfly32o(row1,row6, x1,x6,shiftop,shift); \
2795
      dct_bfly32o(row2,row5, x2,x5,shiftop,shift); \
2796
      dct_bfly32o(row3,row4, x3,x4,shiftop,shift); \
2797
   }
2798
2799
   // load
2800
   row0 = vld1q_s16(data + 0*8);
2801
   row1 = vld1q_s16(data + 1*8);
2802
   row2 = vld1q_s16(data + 2*8);
2803
   row3 = vld1q_s16(data + 3*8);
2804
   row4 = vld1q_s16(data + 4*8);
2805
   row5 = vld1q_s16(data + 5*8);
2806
   row6 = vld1q_s16(data + 6*8);
2807
   row7 = vld1q_s16(data + 7*8);
2808
2809
   // add DC bias
2810
   row0 = vaddq_s16(row0, vsetq_lane_s16(1024, vdupq_n_s16(0), 0));
2811
2812
   // column pass
2813
   dct_pass(vrshrn_n_s32, 10);
2814
2815
   // 16bit 8x8 transpose
2816
   {
2817
// these three map to a single VTRN.16, VTRN.32, and VSWP, respectively.
2818
// whether compilers actually get this is another story, sadly.
2819
#define dct_trn16(x, y) { int16x8x2_t t = vtrnq_s16(x, y); x = t.val[0]; y = t.val[1]; }
2820
#define dct_trn32(x, y) { int32x4x2_t t = vtrnq_s32(vreinterpretq_s32_s16(x), vreinterpretq_s32_s16(y)); x = vreinterpretq_s16_s32(t.val[0]); y = vreinterpretq_s16_s32(t.val[1]); }
2821
#define dct_trn64(x, y) { int16x8_t x0 = x; int16x8_t y0 = y; x = vcombine_s16(vget_low_s16(x0), vget_low_s16(y0)); y = vcombine_s16(vget_high_s16(x0), vget_high_s16(y0)); }
2822
2823
      // pass 1
2824
      dct_trn16(row0, row1); // a0b0a2b2a4b4a6b6
2825
      dct_trn16(row2, row3);
2826
      dct_trn16(row4, row5);
2827
      dct_trn16(row6, row7);
2828
2829
      // pass 2
2830
      dct_trn32(row0, row2); // a0b0c0d0a4b4c4d4
2831
      dct_trn32(row1, row3);
2832
      dct_trn32(row4, row6);
2833
      dct_trn32(row5, row7);
2834
2835
      // pass 3
2836
      dct_trn64(row0, row4); // a0b0c0d0e0f0g0h0
2837
      dct_trn64(row1, row5);
2838
      dct_trn64(row2, row6);
2839
      dct_trn64(row3, row7);
2840
2841
#undef dct_trn16
2842
#undef dct_trn32
2843
#undef dct_trn64
2844
   }
2845
2846
   // row pass
2847
   // vrshrn_n_s32 only supports shifts up to 16, we need
2848
   // 17. so do a non-rounding shift of 16 first then follow
2849
   // up with a rounding shift by 1.
2850
   dct_pass(vshrn_n_s32, 16);
2851
2852
   {
2853
      // pack and round
2854
      uint8x8_t p0 = vqrshrun_n_s16(row0, 1);
2855
      uint8x8_t p1 = vqrshrun_n_s16(row1, 1);
2856
      uint8x8_t p2 = vqrshrun_n_s16(row2, 1);
2857
      uint8x8_t p3 = vqrshrun_n_s16(row3, 1);
2858
      uint8x8_t p4 = vqrshrun_n_s16(row4, 1);
2859
      uint8x8_t p5 = vqrshrun_n_s16(row5, 1);
2860
      uint8x8_t p6 = vqrshrun_n_s16(row6, 1);
2861
      uint8x8_t p7 = vqrshrun_n_s16(row7, 1);
2862
2863
      // again, these can translate into one instruction, but often don't.
2864
#define dct_trn8_8(x, y) { uint8x8x2_t t = vtrn_u8(x, y); x = t.val[0]; y = t.val[1]; }
2865
#define dct_trn8_16(x, y) { uint16x4x2_t t = vtrn_u16(vreinterpret_u16_u8(x), vreinterpret_u16_u8(y)); x = vreinterpret_u8_u16(t.val[0]); y = vreinterpret_u8_u16(t.val[1]); }
2866
#define dct_trn8_32(x, y) { uint32x2x2_t t = vtrn_u32(vreinterpret_u32_u8(x), vreinterpret_u32_u8(y)); x = vreinterpret_u8_u32(t.val[0]); y = vreinterpret_u8_u32(t.val[1]); }
2867
2868
      // sadly can't use interleaved stores here since we only write
2869
      // 8 bytes to each scan line!
2870
2871
      // 8x8 8-bit transpose pass 1
2872
      dct_trn8_8(p0, p1);
2873
      dct_trn8_8(p2, p3);
2874
      dct_trn8_8(p4, p5);
2875
      dct_trn8_8(p6, p7);
2876
2877
      // pass 2
2878
      dct_trn8_16(p0, p2);
2879
      dct_trn8_16(p1, p3);
2880
      dct_trn8_16(p4, p6);
2881
      dct_trn8_16(p5, p7);
2882
2883
      // pass 3
2884
      dct_trn8_32(p0, p4);
2885
      dct_trn8_32(p1, p5);
2886
      dct_trn8_32(p2, p6);
2887
      dct_trn8_32(p3, p7);
2888
2889
      // store
2890
      vst1_u8(out, p0); out += out_stride;
2891
      vst1_u8(out, p1); out += out_stride;
2892
      vst1_u8(out, p2); out += out_stride;
2893
      vst1_u8(out, p3); out += out_stride;
2894
      vst1_u8(out, p4); out += out_stride;
2895
      vst1_u8(out, p5); out += out_stride;
2896
      vst1_u8(out, p6); out += out_stride;
2897
      vst1_u8(out, p7);
2898
2899
#undef dct_trn8_8
2900
#undef dct_trn8_16
2901
#undef dct_trn8_32
2902
   }
2903
2904
#undef dct_long_mul
2905
#undef dct_long_mac
2906
#undef dct_widen
2907
#undef dct_wadd
2908
#undef dct_wsub
2909
#undef dct_bfly32o
2910
#undef dct_pass
2911
}
2912
2913
#endif // STBI_NEON
2914
2915
143k
#define STBI__MARKER_none  0xff
2916
// if there's a pending marker from the entropy stream, return that
2917
// otherwise, fetch from the stream and get a marker. if there's no
2918
// marker, return 0xff, which is never a valid marker value
2919
static stbi_uc stbi__get_marker(stbi__jpeg *j)
2920
53.5k
{
2921
53.5k
   stbi_uc x;
2922
53.5k
   if (j->marker != STBI__MARKER_none) { x = j->marker; j->marker = STBI__MARKER_none; return x; }
2923
46.5k
   x = stbi__get8(j->s);
2924
46.5k
   if (x != 0xff) return STBI__MARKER_none;
2925
44.8k
   while (x == 0xff)
2926
22.8k
      x = stbi__get8(j->s); // consume repeated 0xff fill bytes
2927
21.9k
   return x;
2928
46.5k
}
2929
2930
// in each scan, we'll have scan_n components, and the order
2931
// of the components is specified by order[]
2932
13.6k
#define STBI__RESTART(x)     ((x) >= 0xd0 && (x) <= 0xd7)
2933
2934
// after a restart interval, stbi__jpeg_reset the entropy decoder and
2935
// the dc prediction
2936
static void stbi__jpeg_reset(stbi__jpeg *j)
2937
11.1k
{
2938
11.1k
   j->code_bits = 0;
2939
11.1k
   j->code_buffer = 0;
2940
11.1k
   j->nomore = 0;
2941
11.1k
   j->img_comp[0].dc_pred = j->img_comp[1].dc_pred = j->img_comp[2].dc_pred = j->img_comp[3].dc_pred = 0;
2942
11.1k
   j->marker = STBI__MARKER_none;
2943
11.1k
   j->todo = j->restart_interval ? j->restart_interval : 0x7fffffff;
2944
11.1k
   j->eob_run = 0;
2945
   // no more than 1<<31 MCUs if no restart_interal? that's plenty safe,
2946
   // since we don't even allow 1<<30 pixels
2947
11.1k
}
2948
2949
static int stbi__parse_entropy_coded_data(stbi__jpeg *z)
2950
8.04k
{
2951
8.04k
   stbi__jpeg_reset(z);
2952
8.04k
   if (!z->progressive) {
2953
2.66k
      if (z->scan_n == 1) {
2954
2.07k
         int i,j;
2955
2.07k
         STBI_SIMD_ALIGN(short, data[64]);
2956
2.07k
         int n = z->order[0];
2957
         // non-interleaved data, we just need to process one block at a time,
2958
         // in trivial scanline order
2959
         // number of blocks to do just depends on how many actual "pixels" this
2960
         // component has, independent of interleaved MCU blocking and such
2961
2.07k
         int w = (z->img_comp[n].x+7) >> 3;
2962
2.07k
         int h = (z->img_comp[n].y+7) >> 3;
2963
853k
         for (j=0; j < h; ++j) {
2964
86.8M
            for (i=0; i < w; ++i) {
2965
86.0M
               int ha = z->img_comp[n].ha;
2966
86.0M
               if (!stbi__jpeg_decode_block(z, data, z->huff_dc+z->img_comp[n].hd, z->huff_ac+ha, z->fast_ac[ha], n, z->dequant[z->img_comp[n].tq])) return 0;
2967
86.0M
               z->idct_block_kernel(z->img_comp[n].data+z->img_comp[n].w2*j*8+i*8, z->img_comp[n].w2, data);
2968
               // every data block is an MCU, so countdown the restart interval
2969
86.0M
               if (--z->todo <= 0) {
2970
1.27k
                  if (z->code_bits < 24) stbi__grow_buffer_unsafe(z);
2971
                  // if it's NOT a restart, then just bail, so we get corrupt data
2972
                  // rather than no data
2973
1.27k
                  if (!STBI__RESTART(z->marker)) return 1;
2974
462
                  stbi__jpeg_reset(z);
2975
462
               }
2976
86.0M
            }
2977
852k
         }
2978
1.19k
         return 1;
2979
2.07k
      } else { // interleaved
2980
597
         int i,j,k,x,y;
2981
597
         STBI_SIMD_ALIGN(short, data[64]);
2982
34.9k
         for (j=0; j < z->img_mcu_y; ++j) {
2983
3.25M
            for (i=0; i < z->img_mcu_x; ++i) {
2984
               // scan an interleaved mcu... process scan_n components in order
2985
12.8M
               for (k=0; k < z->scan_n; ++k) {
2986
9.59M
                  int n = z->order[k];
2987
                  // scan out an mcu's worth of this component; that's just determined
2988
                  // by the basic H and V specified for the component
2989
24.0M
                  for (y=0; y < z->img_comp[n].v; ++y) {
2990
41.2M
                     for (x=0; x < z->img_comp[n].h; ++x) {
2991
26.7M
                        int x2 = (i*z->img_comp[n].h + x)*8;
2992
26.7M
                        int y2 = (j*z->img_comp[n].v + y)*8;
2993
26.7M
                        int ha = z->img_comp[n].ha;
2994
26.7M
                        if (!stbi__jpeg_decode_block(z, data, z->huff_dc+z->img_comp[n].hd, z->huff_ac+ha, z->fast_ac[ha], n, z->dequant[z->img_comp[n].tq])) return 0;
2995
26.7M
                        z->idct_block_kernel(z->img_comp[n].data+z->img_comp[n].w2*y2+x2, z->img_comp[n].w2, data);
2996
26.7M
                     }
2997
14.4M
                  }
2998
9.59M
               }
2999
               // after all interleaved components, that's an interleaved MCU,
3000
               // so now count down the restart interval
3001
3.21M
               if (--z->todo <= 0) {
3002
1.01k
                  if (z->code_bits < 24) stbi__grow_buffer_unsafe(z);
3003
1.01k
                  if (!STBI__RESTART(z->marker)) return 1;
3004
676
                  stbi__jpeg_reset(z);
3005
676
               }
3006
3.21M
            }
3007
34.7k
         }
3008
233
         return 1;
3009
597
      }
3010
5.37k
   } else {
3011
5.37k
      if (z->scan_n == 1) {
3012
4.67k
         int i,j;
3013
4.67k
         int n = z->order[0];
3014
         // non-interleaved data, we just need to process one block at a time,
3015
         // in trivial scanline order
3016
         // number of blocks to do just depends on how many actual "pixels" this
3017
         // component has, independent of interleaved MCU blocking and such
3018
4.67k
         int w = (z->img_comp[n].x+7) >> 3;
3019
4.67k
         int h = (z->img_comp[n].y+7) >> 3;
3020
1.01M
         for (j=0; j < h; ++j) {
3021
229M
            for (i=0; i < w; ++i) {
3022
228M
               short *data = z->img_comp[n].coeff + 64 * (i + j * z->img_comp[n].coeff_w);
3023
228M
               if (z->spec_start == 0) {
3024
11.5M
                  if (!stbi__jpeg_decode_block_prog_dc(z, data, &z->huff_dc[z->img_comp[n].hd], n))
3025
83
                     return 0;
3026
216M
               } else {
3027
216M
                  int ha = z->img_comp[n].ha;
3028
216M
                  if (!stbi__jpeg_decode_block_prog_ac(z, data, &z->huff_ac[ha], z->fast_ac[ha]))
3029
45
                     return 0;
3030
216M
               }
3031
               // every data block is an MCU, so countdown the restart interval
3032
228M
               if (--z->todo <= 0) {
3033
2.32k
                  if (z->code_bits < 24) stbi__grow_buffer_unsafe(z);
3034
2.32k
                  if (!STBI__RESTART(z->marker)) return 1;
3035
1.13k
                  stbi__jpeg_reset(z);
3036
1.13k
               }
3037
228M
            }
3038
1.00M
         }
3039
3.35k
         return 1;
3040
4.67k
      } else { // interleaved
3041
700
         int i,j,k,x,y;
3042
13.6k
         for (j=0; j < z->img_mcu_y; ++j) {
3043
2.59M
            for (i=0; i < z->img_mcu_x; ++i) {
3044
               // scan an interleaved mcu... process scan_n components in order
3045
10.1M
               for (k=0; k < z->scan_n; ++k) {
3046
7.55M
                  int n = z->order[k];
3047
                  // scan out an mcu's worth of this component; that's just determined
3048
                  // by the basic H and V specified for the component
3049
23.4M
                  for (y=0; y < z->img_comp[n].v; ++y) {
3050
35.3M
                     for (x=0; x < z->img_comp[n].h; ++x) {
3051
19.4M
                        int x2 = (i*z->img_comp[n].h + x);
3052
19.4M
                        int y2 = (j*z->img_comp[n].v + y);
3053
19.4M
                        short *data = z->img_comp[n].coeff + 64 * (x2 + y2 * z->img_comp[n].coeff_w);
3054
19.4M
                        if (!stbi__jpeg_decode_block_prog_dc(z, data, &z->huff_dc[z->img_comp[n].hd], n))
3055
5
                           return 0;
3056
19.4M
                     }
3057
15.9M
                  }
3058
7.55M
               }
3059
               // after all interleaved components, that's an interleaved MCU,
3060
               // so now count down the restart interval
3061
2.57M
               if (--z->todo <= 0) {
3062
1.23k
                  if (z->code_bits < 24) stbi__grow_buffer_unsafe(z);
3063
1.23k
                  if (!STBI__RESTART(z->marker)) return 1;
3064
872
                  stbi__jpeg_reset(z);
3065
872
               }
3066
2.57M
            }
3067
13.2k
         }
3068
334
         return 1;
3069
700
      }
3070
5.37k
   }
3071
8.04k
}
3072
3073
static void stbi__jpeg_dequantize(short *data, stbi__uint16 *dequant)
3074
2.30M
{
3075
2.30M
   int i;
3076
149M
   for (i=0; i < 64; ++i)
3077
147M
      data[i] *= dequant[i];
3078
2.30M
}
3079
3080
static void stbi__jpeg_finish(stbi__jpeg *z)
3081
41
{
3082
41
   if (z->progressive) {
3083
      // dequantize and idct the data
3084
41
      int i,j,n;
3085
107
      for (n=0; n < z->s->img_n; ++n) {
3086
66
         int w = (z->img_comp[n].x+7) >> 3;
3087
66
         int h = (z->img_comp[n].y+7) >> 3;
3088
118k
         for (j=0; j < h; ++j) {
3089
2.42M
            for (i=0; i < w; ++i) {
3090
2.30M
               short *data = z->img_comp[n].coeff + 64 * (i + j * z->img_comp[n].coeff_w);
3091
2.30M
               stbi__jpeg_dequantize(data, z->dequant[z->img_comp[n].tq]);
3092
2.30M
               z->idct_block_kernel(z->img_comp[n].data+z->img_comp[n].w2*j*8+i*8, z->img_comp[n].w2, data);
3093
2.30M
            }
3094
118k
         }
3095
66
      }
3096
41
   }
3097
41
}
3098
3099
static int stbi__process_marker(stbi__jpeg *z, int m)
3100
11.5k
{
3101
11.5k
   int L;
3102
11.5k
   switch (m) {
3103
1.17k
      case STBI__MARKER_none: // no marker found
3104
1.17k
         return stbi__err("expected marker","Corrupt JPEG");
3105
3106
1.18k
      case 0xDD: // DRI - specify restart interval
3107
1.18k
         if (stbi__get16be(z->s) != 4) return stbi__err("bad DRI len","Corrupt JPEG");
3108
1.16k
         z->restart_interval = stbi__get16be(z->s);
3109
1.16k
         return 1;
3110
3111
953
      case 0xDB: // DQT - define quantization table
3112
953
         L = stbi__get16be(z->s)-2;
3113
14.6k
         while (L > 0) {
3114
13.7k
            int q = stbi__get8(z->s);
3115
13.7k
            int p = q >> 4, sixteen = (p != 0);
3116
13.7k
            int t = q & 15,i;
3117
13.7k
            if (p != 0 && p != 1) return stbi__err("bad DQT type","Corrupt JPEG");
3118
13.7k
            if (t > 3) return stbi__err("bad DQT table","Corrupt JPEG");
3119
3120
892k
            for (i=0; i < 64; ++i)
3121
879k
               z->dequant[t][stbi__jpeg_dezigzag[i]] = (stbi__uint16)(sixteen ? stbi__get16be(z->s) : stbi__get8(z->s));
3122
13.7k
            L -= (sixteen ? 129 : 65);
3123
13.7k
         }
3124
944
         return L==0;
3125
3126
4.37k
      case 0xC4: // DHT - define huffman table
3127
4.37k
         L = stbi__get16be(z->s)-2;
3128
115k
         while (L > 0) {
3129
111k
            stbi_uc *v;
3130
111k
            int sizes[16],i,n=0;
3131
111k
            int q = stbi__get8(z->s);
3132
111k
            int tc = q >> 4;
3133
111k
            int th = q & 15;
3134
111k
            if (tc > 1 || th > 3) return stbi__err("bad DHT header","Corrupt JPEG");
3135
1.89M
            for (i=0; i < 16; ++i) {
3136
1.78M
               sizes[i] = stbi__get8(z->s);
3137
1.78M
               n += sizes[i];
3138
1.78M
            }
3139
111k
            if(n > 256) return stbi__err("bad DHT header","Corrupt JPEG"); // Loop over i < n would write past end of values!
3140
111k
            L -= 17;
3141
111k
            if (tc == 0) {
3142
108k
               if (!stbi__build_huffman(z->huff_dc+th, sizes)) return 0;
3143
108k
               v = z->huff_dc[th].values;
3144
108k
            } else {
3145
3.18k
               if (!stbi__build_huffman(z->huff_ac+th, sizes)) return 0;
3146
3.17k
               v = z->huff_ac[th].values;
3147
3.17k
            }
3148
136k
            for (i=0; i < n; ++i)
3149
24.8k
               v[i] = stbi__get8(z->s);
3150
111k
            if (tc != 0)
3151
3.17k
               stbi__build_fast_ac(z->fast_ac[th], z->huff_ac + th);
3152
111k
            L -= n;
3153
111k
         }
3154
4.32k
         return L==0;
3155
11.5k
   }
3156
3157
   // check for comment block or APP blocks
3158
3.88k
   if ((m >= 0xE0 && m <= 0xEF) || m == 0xFE) {
3159
3.69k
      L = stbi__get16be(z->s);
3160
3.69k
      if (L < 2) {
3161
18
         if (m == 0xFE)
3162
5
            return stbi__err("bad COM len","Corrupt JPEG");
3163
13
         else
3164
13
            return stbi__err("bad APP len","Corrupt JPEG");
3165
18
      }
3166
3.67k
      L -= 2;
3167
3168
3.67k
      if (m == 0xE0 && L >= 5) { // JFIF APP0 segment
3169
1.14k
         static const unsigned char tag[5] = {'J','F','I','F','\0'};
3170
1.14k
         int ok = 1;
3171
1.14k
         int i;
3172
6.89k
         for (i=0; i < 5; ++i)
3173
5.74k
            if (stbi__get8(z->s) != tag[i])
3174
4.37k
               ok = 0;
3175
1.14k
         L -= 5;
3176
1.14k
         if (ok)
3177
232
            z->jfif = 1;
3178
2.53k
      } else if (m == 0xEE && L >= 12) { // Adobe APP14 segment
3179
1.10k
         static const unsigned char tag[6] = {'A','d','o','b','e','\0'};
3180
1.10k
         int ok = 1;
3181
1.10k
         int i;
3182
7.72k
         for (i=0; i < 6; ++i)
3183
6.61k
            if (stbi__get8(z->s) != tag[i])
3184
1.74k
               ok = 0;
3185
1.10k
         L -= 6;
3186
1.10k
         if (ok) {
3187
708
            stbi__get8(z->s); // version
3188
708
            stbi__get16be(z->s); // flags0
3189
708
            stbi__get16be(z->s); // flags1
3190
708
            z->app14_color_transform = stbi__get8(z->s); // color transform
3191
708
            L -= 6;
3192
708
         }
3193
1.10k
      }
3194
3195
3.67k
      stbi__skip(z->s, L);
3196
3.67k
      return 1;
3197
3.69k
   }
3198
3199
183
   return stbi__err("unknown marker","Corrupt JPEG");
3200
3.88k
}
3201
3202
// after we see SOS
3203
static int stbi__process_scan_header(stbi__jpeg *z)
3204
8.19k
{
3205
8.19k
   int i;
3206
8.19k
   int Ls = stbi__get16be(z->s);
3207
8.19k
   z->scan_n = stbi__get8(z->s);
3208
8.19k
   if (z->scan_n < 1 || z->scan_n > 4 || z->scan_n > (int) z->s->img_n) return stbi__err("bad SOS component count","Corrupt JPEG");
3209
8.14k
   if (Ls != 6+2*z->scan_n) return stbi__err("bad SOS len","Corrupt JPEG");
3210
18.7k
   for (i=0; i < z->scan_n; ++i) {
3211
10.6k
      int id = stbi__get8(z->s), which;
3212
10.6k
      int q = stbi__get8(z->s);
3213
18.6k
      for (which = 0; which < z->s->img_n; ++which)
3214
18.6k
         if (z->img_comp[which].id == id)
3215
10.6k
            break;
3216
10.6k
      if (which == z->s->img_n) return 0; // no match
3217
10.6k
      z->img_comp[which].hd = q >> 4;   if (z->img_comp[which].hd > 3) return stbi__err("bad DC huff","Corrupt JPEG");
3218
10.6k
      z->img_comp[which].ha = q & 15;   if (z->img_comp[which].ha > 3) return stbi__err("bad AC huff","Corrupt JPEG");
3219
10.6k
      z->order[i] = which;
3220
10.6k
   }
3221
3222
8.08k
   {
3223
8.08k
      int aa;
3224
8.08k
      z->spec_start = stbi__get8(z->s);
3225
8.08k
      z->spec_end   = stbi__get8(z->s); // should be 63, but might be 0
3226
8.08k
      aa = stbi__get8(z->s);
3227
8.08k
      z->succ_high = (aa >> 4);
3228
8.08k
      z->succ_low  = (aa & 15);
3229
8.08k
      if (z->progressive) {
3230
5.40k
         if (z->spec_start > 63 || z->spec_end > 63  || z->spec_start > z->spec_end || z->succ_high > 13 || z->succ_low > 13)
3231
23
            return stbi__err("bad SOS", "Corrupt JPEG");
3232
5.40k
      } else {
3233
2.68k
         if (z->spec_start != 0) return stbi__err("bad SOS","Corrupt JPEG");
3234
2.67k
         if (z->succ_high != 0 || z->succ_low != 0) return stbi__err("bad SOS","Corrupt JPEG");
3235
2.66k
         z->spec_end = 63;
3236
2.66k
      }
3237
8.08k
   }
3238
3239
8.04k
   return 1;
3240
8.08k
}
3241
3242
static int stbi__free_jpeg_components(stbi__jpeg *z, int ncomp, int why)
3243
1.82k
{
3244
1.82k
   int i;
3245
4.88k
   for (i=0; i < ncomp; ++i) {
3246
3.05k
      if (z->img_comp[i].raw_data) {
3247
3.00k
         STBI_FREE(z->img_comp[i].raw_data);
3248
3.00k
         z->img_comp[i].raw_data = NULL;
3249
3.00k
         z->img_comp[i].data = NULL;
3250
3.00k
      }
3251
3.05k
      if (z->img_comp[i].raw_coeff) {
3252
1.55k
         STBI_FREE(z->img_comp[i].raw_coeff);
3253
1.55k
         z->img_comp[i].raw_coeff = 0;
3254
1.55k
         z->img_comp[i].coeff = 0;
3255
1.55k
      }
3256
3.05k
      if (z->img_comp[i].linebuf) {
3257
2.38k
         STBI_FREE(z->img_comp[i].linebuf);
3258
2.38k
         z->img_comp[i].linebuf = NULL;
3259
2.38k
      }
3260
3.05k
   }
3261
1.82k
   return why;
3262
1.82k
}
3263
3264
static int stbi__process_frame_header(stbi__jpeg *z, int scan)
3265
3.78k
{
3266
3.78k
   stbi__context *s = z->s;
3267
3.78k
   int Lf,p,i,q, h_max=1,v_max=1,c;
3268
3.78k
   Lf = stbi__get16be(s);         if (Lf < 11) return stbi__err("bad SOF len","Corrupt JPEG"); // JPEG
3269
3.78k
   p  = stbi__get8(s);            if (p != 8) return stbi__err("only 8-bit","JPEG format not supported: 8-bit only"); // JPEG baseline
3270
3.75k
   s->img_y = stbi__get16be(s);   if (s->img_y == 0) return stbi__err("no header height", "JPEG format not supported: delayed height"); // Legal, but we don't handle it--but neither does IJG
3271
3.75k
   s->img_x = stbi__get16be(s);   if (s->img_x == 0) return stbi__err("0 width","Corrupt JPEG"); // JPEG requires
3272
3.74k
   if (s->img_y > STBI_MAX_DIMENSIONS) return stbi__err("too large","Very large image (corrupt?)");
3273
3.74k
   if (s->img_x > STBI_MAX_DIMENSIONS) return stbi__err("too large","Very large image (corrupt?)");
3274
3.74k
   c = stbi__get8(s);
3275
3.74k
   if (c != 3 && c != 1 && c != 4) return stbi__err("bad component count","Corrupt JPEG");
3276
3.72k
   s->img_n = c;
3277
9.97k
   for (i=0; i < c; ++i) {
3278
6.25k
      z->img_comp[i].data = NULL;
3279
6.25k
      z->img_comp[i].linebuf = NULL;
3280
6.25k
   }
3281
3282
3.72k
   if (Lf != 8+3*s->img_n) return stbi__err("bad SOF len","Corrupt JPEG");
3283
3284
3.69k
   z->rgb = 0;
3285
9.82k
   for (i=0; i < s->img_n; ++i) {
3286
6.16k
      static const unsigned char rgb[3] = { 'R', 'G', 'B' };
3287
6.16k
      z->img_comp[i].id = stbi__get8(s);
3288
6.16k
      if (s->img_n == 3 && z->img_comp[i].id == rgb[i])
3289
259
         ++z->rgb;
3290
6.16k
      q = stbi__get8(s);
3291
6.16k
      z->img_comp[i].h = (q >> 4);  if (!z->img_comp[i].h || z->img_comp[i].h > 4) return stbi__err("bad H","Corrupt JPEG");
3292
6.14k
      z->img_comp[i].v = q & 15;    if (!z->img_comp[i].v || z->img_comp[i].v > 4) return stbi__err("bad V","Corrupt JPEG");
3293
6.13k
      z->img_comp[i].tq = stbi__get8(s);  if (z->img_comp[i].tq > 3) return stbi__err("bad TQ","Corrupt JPEG");
3294
6.13k
   }
3295
3296
3.65k
   if (scan != STBI__SCAN_load) return 1;
3297
3298
1.82k
   if (!stbi__mad3sizes_valid(s->img_x, s->img_y, s->img_n, 0)) return stbi__err("too large", "Image too large to decode");
3299
3300
4.88k
   for (i=0; i < s->img_n; ++i) {
3301
3.05k
      if (z->img_comp[i].h > h_max) h_max = z->img_comp[i].h;
3302
3.05k
      if (z->img_comp[i].v > v_max) v_max = z->img_comp[i].v;
3303
3.05k
   }
3304
3305
   // check that plane subsampling factors are integer ratios; our resamplers can't deal with fractional ratios
3306
   // and I've never seen a non-corrupted JPEG file actually use them
3307
4.83k
   for (i=0; i < s->img_n; ++i) {
3308
3.02k
      if (h_max % z->img_comp[i].h != 0) return stbi__err("bad H","Corrupt JPEG");
3309
3.02k
      if (v_max % z->img_comp[i].v != 0) return stbi__err("bad V","Corrupt JPEG");
3310
3.02k
   }
3311
3312
   // compute interleaved mcu info
3313
1.81k
   z->img_h_max = h_max;
3314
1.81k
   z->img_v_max = v_max;
3315
1.81k
   z->img_mcu_w = h_max * 8;
3316
1.81k
   z->img_mcu_h = v_max * 8;
3317
   // these sizes can't be more than 17 bits
3318
1.81k
   z->img_mcu_x = (s->img_x + z->img_mcu_w-1) / z->img_mcu_w;
3319
1.81k
   z->img_mcu_y = (s->img_y + z->img_mcu_h-1) / z->img_mcu_h;
3320
3321
4.81k
   for (i=0; i < s->img_n; ++i) {
3322
      // number of effective pixels (e.g. for non-interleaved MCU)
3323
3.00k
      z->img_comp[i].x = (s->img_x * z->img_comp[i].h + h_max-1) / h_max;
3324
3.00k
      z->img_comp[i].y = (s->img_y * z->img_comp[i].v + v_max-1) / v_max;
3325
      // to simplify generation, we'll allocate enough memory to decode
3326
      // the bogus oversized data from using interleaved MCUs and their
3327
      // big blocks (e.g. a 16x16 iMCU on an image of width 33); we won't
3328
      // discard the extra data until colorspace conversion
3329
      //
3330
      // img_mcu_x, img_mcu_y: <=17 bits; comp[i].h and .v are <=4 (checked earlier)
3331
      // so these muls can't overflow with 32-bit ints (which we require)
3332
3.00k
      z->img_comp[i].w2 = z->img_mcu_x * z->img_comp[i].h * 8;
3333
3.00k
      z->img_comp[i].h2 = z->img_mcu_y * z->img_comp[i].v * 8;
3334
3.00k
      z->img_comp[i].coeff = 0;
3335
3.00k
      z->img_comp[i].raw_coeff = 0;
3336
3.00k
      z->img_comp[i].linebuf = NULL;
3337
3.00k
      z->img_comp[i].raw_data = stbi__malloc_mad2(z->img_comp[i].w2, z->img_comp[i].h2, 15);
3338
3.00k
      if (z->img_comp[i].raw_data == NULL)
3339
0
         return stbi__free_jpeg_components(z, i+1, stbi__err("outofmem", "Out of memory"));
3340
      // align blocks for idct using mmx/sse
3341
3.00k
      z->img_comp[i].data = (stbi_uc*) (((size_t) z->img_comp[i].raw_data + 15) & ~15);
3342
3.00k
      if (z->progressive) {
3343
         // w2, h2 are multiples of 8 (see above)
3344
1.55k
         z->img_comp[i].coeff_w = z->img_comp[i].w2 / 8;
3345
1.55k
         z->img_comp[i].coeff_h = z->img_comp[i].h2 / 8;
3346
1.55k
         z->img_comp[i].raw_coeff = stbi__malloc_mad3(z->img_comp[i].w2, z->img_comp[i].h2, sizeof(short), 15);
3347
1.55k
         if (z->img_comp[i].raw_coeff == NULL)
3348
0
            return stbi__free_jpeg_components(z, i+1, stbi__err("outofmem", "Out of memory"));
3349
1.55k
         z->img_comp[i].coeff = (short*) (((size_t) z->img_comp[i].raw_coeff + 15) & ~15);
3350
1.55k
      }
3351
3.00k
   }
3352
3353
1.81k
   return 1;
3354
1.81k
}
3355
3356
// use comparisons since in some cases we handle more than one case (e.g. SOF)
3357
3.86k
#define stbi__DNL(x)         ((x) == 0xdc)
3358
13.5k
#define stbi__SOI(x)         ((x) == 0xd8)
3359
12.1k
#define stbi__EOI(x)         ((x) == 0xd9)
3360
11.7k
#define stbi__SOF(x)         ((x) == 0xc0 || (x) == 0xc1 || (x) == 0xc2)
3361
12.0k
#define stbi__SOS(x)         ((x) == 0xda)
3362
3363
3.78k
#define stbi__SOF_progressive(x)   ((x) == 0xc2)
3364
3365
static int stbi__decode_jpeg_header(stbi__jpeg *z, int scan)
3366
13.5k
{
3367
13.5k
   int m;
3368
13.5k
   z->jfif = 0;
3369
13.5k
   z->app14_color_transform = -1; // valid values are 0,1,2
3370
13.5k
   z->marker = STBI__MARKER_none; // initialize cached marker to empty
3371
13.5k
   m = stbi__get_marker(z);
3372
13.5k
   if (!stbi__SOI(m)) return stbi__err("no SOI","Corrupt JPEG");
3373
6.03k
   if (scan == STBI__SCAN_type) return 1;
3374
4.20k
   m = stbi__get_marker(z);
3375
11.7k
   while (!stbi__SOF(m)) {
3376
7.94k
      if (!stbi__process_marker(z,m)) return 0;
3377
7.67k
      m = stbi__get_marker(z);
3378
23.5k
      while (m == STBI__MARKER_none) {
3379
         // some files have extra padding after their blocks, so ok, we'll scan
3380
16.0k
         if (stbi__at_eof(z->s)) return stbi__err("no SOF", "Corrupt JPEG");
3381
15.8k
         m = stbi__get_marker(z);
3382
15.8k
      }
3383
7.67k
   }
3384
3.78k
   z->progressive = stbi__SOF_progressive(m);
3385
3.78k
   if (!stbi__process_frame_header(z, scan)) return 0;
3386
3.63k
   return 1;
3387
3.78k
}
3388
3389
static stbi_uc stbi__skip_jpeg_junk_at_end(stbi__jpeg *j)
3390
1.86k
{
3391
   // some JPEGs have junk at end, skip over it but if we find what looks
3392
   // like a valid marker, resume there
3393
6.50k
   while (!stbi__at_eof(j->s)) {
3394
5.71k
      stbi_uc x = stbi__get8(j->s);
3395
6.30k
      while (x == 0xff) { // might be a marker
3396
1.65k
         if (stbi__at_eof(j->s)) return STBI__MARKER_none;
3397
1.63k
         x = stbi__get8(j->s);
3398
1.63k
         if (x != 0x00 && x != 0xff) {
3399
            // not a stuffed zero or lead-in to another marker, looks
3400
            // like an actual marker, return it
3401
1.05k
            return x;
3402
1.05k
         }
3403
         // stuffed zero has x=0 now which ends the loop, meaning we go
3404
         // back to regular scan loop.
3405
         // repeated 0xff keeps trying to read the next byte of the marker.
3406
1.63k
      }
3407
5.71k
   }
3408
795
   return STBI__MARKER_none;
3409
1.86k
}
3410
3411
// decode image to YCbCr format
3412
static int stbi__decode_jpeg_image(stbi__jpeg *j)
3413
1.82k
{
3414
1.82k
   int m;
3415
9.13k
   for (m = 0; m < 4; m++) {
3416
7.30k
      j->img_comp[m].raw_data = NULL;
3417
7.30k
      j->img_comp[m].raw_coeff = NULL;
3418
7.30k
   }
3419
1.82k
   j->restart_interval = 0;
3420
1.82k
   if (!stbi__decode_jpeg_header(j, STBI__SCAN_load)) return 0;
3421
1.81k
   m = stbi__get_marker(j);
3422
12.1k
   while (!stbi__EOI(m)) {
3423
12.0k
      if (stbi__SOS(m)) {
3424
8.19k
         if (!stbi__process_scan_header(j)) return 0;
3425
8.04k
         if (!stbi__parse_entropy_coded_data(j)) return 0;
3426
7.82k
         if (j->marker == STBI__MARKER_none ) {
3427
1.86k
         j->marker = stbi__skip_jpeg_junk_at_end(j);
3428
            // if we reach eof without hitting a marker, stbi__get_marker() below will fail and we'll eventually return 0
3429
1.86k
         }
3430
7.82k
         m = stbi__get_marker(j);
3431
7.82k
         if (STBI__RESTART(m))
3432
199
            m = stbi__get_marker(j);
3433
7.82k
      } else if (stbi__DNL(m)) {
3434
245
         int Ld = stbi__get16be(j->s);
3435
245
         stbi__uint32 NL = stbi__get16be(j->s);
3436
245
         if (Ld != 4) return stbi__err("bad DNL len", "Corrupt JPEG");
3437
222
         if (NL != j->s->img_y) return stbi__err("bad DNL height", "Corrupt JPEG");
3438
195
         m = stbi__get_marker(j);
3439
3.61k
      } else {
3440
3.61k
         if (!stbi__process_marker(j, m)) return 1;
3441
2.27k
         m = stbi__get_marker(j);
3442
2.27k
      }
3443
12.0k
   }
3444
44
   if (j->progressive)
3445
41
      stbi__jpeg_finish(j);
3446
44
   return 1;
3447
1.81k
}
3448
3449
// static jfif-centered resampling (across block boundaries)
3450
3451
typedef stbi_uc *(*resample_row_func)(stbi_uc *out, stbi_uc *in0, stbi_uc *in1,
3452
                                    int w, int hs);
3453
3454
818M
#define stbi__div4(x) ((stbi_uc) ((x) >> 2))
3455
3456
static stbi_uc *resample_row_1(stbi_uc *out, stbi_uc *in_near, stbi_uc *in_far, int w, int hs)
3457
11.2M
{
3458
11.2M
   STBI_NOTUSED(out);
3459
11.2M
   STBI_NOTUSED(in_far);
3460
11.2M
   STBI_NOTUSED(w);
3461
11.2M
   STBI_NOTUSED(hs);
3462
11.2M
   return in_near;
3463
11.2M
}
3464
3465
static stbi_uc* stbi__resample_row_v_2(stbi_uc *out, stbi_uc *in_near, stbi_uc *in_far, int w, int hs)
3466
960k
{
3467
   // need to generate two samples vertically for every one in input
3468
960k
   int i;
3469
960k
   STBI_NOTUSED(hs);
3470
367M
   for (i=0; i < w; ++i)
3471
366M
      out[i] = stbi__div4(3*in_near[i] + in_far[i] + 2);
3472
960k
   return out;
3473
960k
}
3474
3475
static stbi_uc*  stbi__resample_row_h_2(stbi_uc *out, stbi_uc *in_near, stbi_uc *in_far, int w, int hs)
3476
1.07M
{
3477
   // need to generate two samples horizontally for every one in input
3478
1.07M
   int i;
3479
1.07M
   stbi_uc *input = in_near;
3480
3481
1.07M
   if (w == 1) {
3482
      // if only one sample, can't do any interpolation
3483
150k
      out[0] = out[1] = input[0];
3484
150k
      return out;
3485
150k
   }
3486
3487
922k
   out[0] = input[0];
3488
922k
   out[1] = stbi__div4(input[0]*3 + input[1] + 2);
3489
225M
   for (i=1; i < w-1; ++i) {
3490
224M
      int n = 3*input[i]+2;
3491
224M
      out[i*2+0] = stbi__div4(n+input[i-1]);
3492
224M
      out[i*2+1] = stbi__div4(n+input[i+1]);
3493
224M
   }
3494
922k
   out[i*2+0] = stbi__div4(input[w-2]*3 + input[w-1] + 2);
3495
922k
   out[i*2+1] = input[w-1];
3496
3497
922k
   STBI_NOTUSED(in_far);
3498
922k
   STBI_NOTUSED(hs);
3499
3500
922k
   return out;
3501
1.07M
}
3502
3503
9.60M
#define stbi__div16(x) ((stbi_uc) ((x) >> 4))
3504
3505
static stbi_uc *stbi__resample_row_hv_2(stbi_uc *out, stbi_uc *in_near, stbi_uc *in_far, int w, int hs)
3506
0
{
3507
   // need to generate 2x2 samples for every one in input
3508
0
   int i,t0,t1;
3509
0
   if (w == 1) {
3510
0
      out[0] = out[1] = stbi__div4(3*in_near[0] + in_far[0] + 2);
3511
0
      return out;
3512
0
   }
3513
3514
0
   t1 = 3*in_near[0] + in_far[0];
3515
0
   out[0] = stbi__div4(t1+2);
3516
0
   for (i=1; i < w; ++i) {
3517
0
      t0 = t1;
3518
0
      t1 = 3*in_near[i]+in_far[i];
3519
0
      out[i*2-1] = stbi__div16(3*t0 + t1 + 8);
3520
0
      out[i*2  ] = stbi__div16(3*t1 + t0 + 8);
3521
0
   }
3522
0
   out[w*2-1] = stbi__div4(t1+2);
3523
3524
0
   STBI_NOTUSED(hs);
3525
3526
0
   return out;
3527
0
}
3528
3529
#if defined(STBI_SSE2) || defined(STBI_NEON)
3530
static stbi_uc *stbi__resample_row_hv_2_simd(stbi_uc *out, stbi_uc *in_near, stbi_uc *in_far, int w, int hs)
3531
1.48M
{
3532
   // need to generate 2x2 samples for every one in input
3533
1.48M
   int i=0,t0,t1;
3534
3535
1.48M
   if (w == 1) {
3536
158k
      out[0] = out[1] = stbi__div4(3*in_near[0] + in_far[0] + 2);
3537
158k
      return out;
3538
158k
   }
3539
3540
1.32M
   t1 = 3*in_near[0] + in_far[0];
3541
   // process groups of 8 pixels for as long as we can.
3542
   // note we can't handle the last pixel in a row in this loop
3543
   // because we need to handle the filter boundary conditions.
3544
67.5M
   for (; i < ((w-1) & ~7); i += 8) {
3545
66.1M
#if defined(STBI_SSE2)
3546
      // load and perform the vertical filtering pass
3547
      // this uses 3*x + y = 4*x + (y - x)
3548
66.1M
      __m128i zero  = _mm_setzero_si128();
3549
66.1M
      __m128i farb  = _mm_loadl_epi64((__m128i *) (in_far + i));
3550
66.1M
      __m128i nearb = _mm_loadl_epi64((__m128i *) (in_near + i));
3551
66.1M
      __m128i farw  = _mm_unpacklo_epi8(farb, zero);
3552
66.1M
      __m128i nearw = _mm_unpacklo_epi8(nearb, zero);
3553
66.1M
      __m128i diff  = _mm_sub_epi16(farw, nearw);
3554
66.1M
      __m128i nears = _mm_slli_epi16(nearw, 2);
3555
66.1M
      __m128i curr  = _mm_add_epi16(nears, diff); // current row
3556
3557
      // horizontal filter works the same based on shifted vers of current
3558
      // row. "prev" is current row shifted right by 1 pixel; we need to
3559
      // insert the previous pixel value (from t1).
3560
      // "next" is current row shifted left by 1 pixel, with first pixel
3561
      // of next block of 8 pixels added in.
3562
66.1M
      __m128i prv0 = _mm_slli_si128(curr, 2);
3563
66.1M
      __m128i nxt0 = _mm_srli_si128(curr, 2);
3564
66.1M
      __m128i prev = _mm_insert_epi16(prv0, t1, 0);
3565
66.1M
      __m128i next = _mm_insert_epi16(nxt0, 3*in_near[i+8] + in_far[i+8], 7);
3566
3567
      // horizontal filter, polyphase implementation since it's convenient:
3568
      // even pixels = 3*cur + prev = cur*4 + (prev - cur)
3569
      // odd  pixels = 3*cur + next = cur*4 + (next - cur)
3570
      // note the shared term.
3571
66.1M
      __m128i bias  = _mm_set1_epi16(8);
3572
66.1M
      __m128i curs = _mm_slli_epi16(curr, 2);
3573
66.1M
      __m128i prvd = _mm_sub_epi16(prev, curr);
3574
66.1M
      __m128i nxtd = _mm_sub_epi16(next, curr);
3575
66.1M
      __m128i curb = _mm_add_epi16(curs, bias);
3576
66.1M
      __m128i even = _mm_add_epi16(prvd, curb);
3577
66.1M
      __m128i odd  = _mm_add_epi16(nxtd, curb);
3578
3579
      // interleave even and odd pixels, then undo scaling.
3580
66.1M
      __m128i int0 = _mm_unpacklo_epi16(even, odd);
3581
66.1M
      __m128i int1 = _mm_unpackhi_epi16(even, odd);
3582
66.1M
      __m128i de0  = _mm_srli_epi16(int0, 4);
3583
66.1M
      __m128i de1  = _mm_srli_epi16(int1, 4);
3584
3585
      // pack and write output
3586
66.1M
      __m128i outv = _mm_packus_epi16(de0, de1);
3587
66.1M
      _mm_storeu_si128((__m128i *) (out + i*2), outv);
3588
#elif defined(STBI_NEON)
3589
      // load and perform the vertical filtering pass
3590
      // this uses 3*x + y = 4*x + (y - x)
3591
      uint8x8_t farb  = vld1_u8(in_far + i);
3592
      uint8x8_t nearb = vld1_u8(in_near + i);
3593
      int16x8_t diff  = vreinterpretq_s16_u16(vsubl_u8(farb, nearb));
3594
      int16x8_t nears = vreinterpretq_s16_u16(vshll_n_u8(nearb, 2));
3595
      int16x8_t curr  = vaddq_s16(nears, diff); // current row
3596
3597
      // horizontal filter works the same based on shifted vers of current
3598
      // row. "prev" is current row shifted right by 1 pixel; we need to
3599
      // insert the previous pixel value (from t1).
3600
      // "next" is current row shifted left by 1 pixel, with first pixel
3601
      // of next block of 8 pixels added in.
3602
      int16x8_t prv0 = vextq_s16(curr, curr, 7);
3603
      int16x8_t nxt0 = vextq_s16(curr, curr, 1);
3604
      int16x8_t prev = vsetq_lane_s16(t1, prv0, 0);
3605
      int16x8_t next = vsetq_lane_s16(3*in_near[i+8] + in_far[i+8], nxt0, 7);
3606
3607
      // horizontal filter, polyphase implementation since it's convenient:
3608
      // even pixels = 3*cur + prev = cur*4 + (prev - cur)
3609
      // odd  pixels = 3*cur + next = cur*4 + (next - cur)
3610
      // note the shared term.
3611
      int16x8_t curs = vshlq_n_s16(curr, 2);
3612
      int16x8_t prvd = vsubq_s16(prev, curr);
3613
      int16x8_t nxtd = vsubq_s16(next, curr);
3614
      int16x8_t even = vaddq_s16(curs, prvd);
3615
      int16x8_t odd  = vaddq_s16(curs, nxtd);
3616
3617
      // undo scaling and round, then store with even/odd phases interleaved
3618
      uint8x8x2_t o;
3619
      o.val[0] = vqrshrun_n_s16(even, 4);
3620
      o.val[1] = vqrshrun_n_s16(odd,  4);
3621
      vst2_u8(out + i*2, o);
3622
#endif
3623
3624
      // "previous" value for next iter
3625
66.1M
      t1 = 3*in_near[i+7] + in_far[i+7];
3626
66.1M
   }
3627
3628
1.32M
   t0 = t1;
3629
1.32M
   t1 = 3*in_near[i] + in_far[i];
3630
1.32M
   out[i*2] = stbi__div16(3*t1 + t0 + 8);
3631
3632
5.46M
   for (++i; i < w; ++i) {
3633
4.13M
      t0 = t1;
3634
4.13M
      t1 = 3*in_near[i]+in_far[i];
3635
4.13M
      out[i*2-1] = stbi__div16(3*t0 + t1 + 8);
3636
4.13M
      out[i*2  ] = stbi__div16(3*t1 + t0 + 8);
3637
4.13M
   }
3638
1.32M
   out[w*2-1] = stbi__div4(t1+2);
3639
3640
1.32M
   STBI_NOTUSED(hs);
3641
3642
1.32M
   return out;
3643
1.48M
}
3644
#endif
3645
3646
static stbi_uc *stbi__resample_row_generic(stbi_uc *out, stbi_uc *in_near, stbi_uc *in_far, int w, int hs)
3647
2.16M
{
3648
   // resample with nearest-neighbor
3649
2.16M
   int i,j;
3650
2.16M
   STBI_NOTUSED(in_far);
3651
865M
   for (i=0; i < w; ++i)
3652
2.57G
      for (j=0; j < hs; ++j)
3653
1.71G
         out[i*hs+j] = in_near[i];
3654
2.16M
   return out;
3655
2.16M
}
3656
3657
// this is a reduced-precision calculation of YCbCr-to-RGB introduced
3658
// to make sure the code produces the same results in both SIMD and scalar
3659
23.9M
#define stbi__float2fixed(x)  (((int) ((x) * 4096.0f + 0.5f)) << 8)
3660
static void stbi__YCbCr_to_RGB_row(stbi_uc *out, const stbi_uc *y, const stbi_uc *pcb, const stbi_uc *pcr, int count, int step)
3661
0
{
3662
0
   int i;
3663
0
   for (i=0; i < count; ++i) {
3664
0
      int y_fixed = (y[i] << 20) + (1<<19); // rounding
3665
0
      int r,g,b;
3666
0
      int cr = pcr[i] - 128;
3667
0
      int cb = pcb[i] - 128;
3668
0
      r = y_fixed +  cr* stbi__float2fixed(1.40200f);
3669
0
      g = y_fixed + (cr*-stbi__float2fixed(0.71414f)) + ((cb*-stbi__float2fixed(0.34414f)) & 0xffff0000);
3670
0
      b = y_fixed                                     +   cb* stbi__float2fixed(1.77200f);
3671
0
      r >>= 20;
3672
0
      g >>= 20;
3673
0
      b >>= 20;
3674
0
      if ((unsigned) r > 255) { if (r < 0) r = 0; else r = 255; }
3675
0
      if ((unsigned) g > 255) { if (g < 0) g = 0; else g = 255; }
3676
0
      if ((unsigned) b > 255) { if (b < 0) b = 0; else b = 255; }
3677
0
      out[0] = (stbi_uc)r;
3678
0
      out[1] = (stbi_uc)g;
3679
0
      out[2] = (stbi_uc)b;
3680
0
      out[3] = 255;
3681
0
      out += step;
3682
0
   }
3683
0
}
3684
3685
#if defined(STBI_SSE2) || defined(STBI_NEON)
3686
static void stbi__YCbCr_to_RGB_simd(stbi_uc *out, stbi_uc const *y, stbi_uc const *pcb, stbi_uc const *pcr, int count, int step)
3687
2.16M
{
3688
2.16M
   int i = 0;
3689
3690
2.16M
#ifdef STBI_SSE2
3691
   // step == 3 is pretty ugly on the final interleave, and i'm not convinced
3692
   // it's useful in practice (you wouldn't use it for textures, for example).
3693
   // so just accelerate step == 4 case.
3694
2.16M
   if (step == 4) {
3695
      // this is a fairly straightforward implementation and not super-optimized.
3696
2.16M
      __m128i signflip  = _mm_set1_epi8(-0x80);
3697
2.16M
      __m128i cr_const0 = _mm_set1_epi16(   (short) ( 1.40200f*4096.0f+0.5f));
3698
2.16M
      __m128i cr_const1 = _mm_set1_epi16( - (short) ( 0.71414f*4096.0f+0.5f));
3699
2.16M
      __m128i cb_const0 = _mm_set1_epi16( - (short) ( 0.34414f*4096.0f+0.5f));
3700
2.16M
      __m128i cb_const1 = _mm_set1_epi16(   (short) ( 1.77200f*4096.0f+0.5f));
3701
2.16M
      __m128i y_bias = _mm_set1_epi8((char) (unsigned char) 128);
3702
2.16M
      __m128i xw = _mm_set1_epi16(255); // alpha channel
3703
3704
195M
      for (; i+7 < count; i += 8) {
3705
         // load
3706
193M
         __m128i y_bytes = _mm_loadl_epi64((__m128i *) (y+i));
3707
193M
         __m128i cr_bytes = _mm_loadl_epi64((__m128i *) (pcr+i));
3708
193M
         __m128i cb_bytes = _mm_loadl_epi64((__m128i *) (pcb+i));
3709
193M
         __m128i cr_biased = _mm_xor_si128(cr_bytes, signflip); // -128
3710
193M
         __m128i cb_biased = _mm_xor_si128(cb_bytes, signflip); // -128
3711
3712
         // unpack to short (and left-shift cr, cb by 8)
3713
193M
         __m128i yw  = _mm_unpacklo_epi8(y_bias, y_bytes);
3714
193M
         __m128i crw = _mm_unpacklo_epi8(_mm_setzero_si128(), cr_biased);
3715
193M
         __m128i cbw = _mm_unpacklo_epi8(_mm_setzero_si128(), cb_biased);
3716
3717
         // color transform
3718
193M
         __m128i yws = _mm_srli_epi16(yw, 4);
3719
193M
         __m128i cr0 = _mm_mulhi_epi16(cr_const0, crw);
3720
193M
         __m128i cb0 = _mm_mulhi_epi16(cb_const0, cbw);
3721
193M
         __m128i cb1 = _mm_mulhi_epi16(cbw, cb_const1);
3722
193M
         __m128i cr1 = _mm_mulhi_epi16(crw, cr_const1);
3723
193M
         __m128i rws = _mm_add_epi16(cr0, yws);
3724
193M
         __m128i gwt = _mm_add_epi16(cb0, yws);
3725
193M
         __m128i bws = _mm_add_epi16(yws, cb1);
3726
193M
         __m128i gws = _mm_add_epi16(gwt, cr1);
3727
3728
         // descale
3729
193M
         __m128i rw = _mm_srai_epi16(rws, 4);
3730
193M
         __m128i bw = _mm_srai_epi16(bws, 4);
3731
193M
         __m128i gw = _mm_srai_epi16(gws, 4);
3732
3733
         // back to byte, set up for transpose
3734
193M
         __m128i brb = _mm_packus_epi16(rw, bw);
3735
193M
         __m128i gxb = _mm_packus_epi16(gw, xw);
3736
3737
         // transpose to interleave channels
3738
193M
         __m128i t0 = _mm_unpacklo_epi8(brb, gxb);
3739
193M
         __m128i t1 = _mm_unpackhi_epi8(brb, gxb);
3740
193M
         __m128i o0 = _mm_unpacklo_epi16(t0, t1);
3741
193M
         __m128i o1 = _mm_unpackhi_epi16(t0, t1);
3742
3743
         // store
3744
193M
         _mm_storeu_si128((__m128i *) (out + 0), o0);
3745
193M
         _mm_storeu_si128((__m128i *) (out + 16), o1);
3746
193M
         out += 32;
3747
193M
      }
3748
2.16M
   }
3749
2.16M
#endif
3750
3751
#ifdef STBI_NEON
3752
   // in this version, step=3 support would be easy to add. but is there demand?
3753
   if (step == 4) {
3754
      // this is a fairly straightforward implementation and not super-optimized.
3755
      uint8x8_t signflip = vdup_n_u8(0x80);
3756
      int16x8_t cr_const0 = vdupq_n_s16(   (short) ( 1.40200f*4096.0f+0.5f));
3757
      int16x8_t cr_const1 = vdupq_n_s16( - (short) ( 0.71414f*4096.0f+0.5f));
3758
      int16x8_t cb_const0 = vdupq_n_s16( - (short) ( 0.34414f*4096.0f+0.5f));
3759
      int16x8_t cb_const1 = vdupq_n_s16(   (short) ( 1.77200f*4096.0f+0.5f));
3760
3761
      for (; i+7 < count; i += 8) {
3762
         // load
3763
         uint8x8_t y_bytes  = vld1_u8(y + i);
3764
         uint8x8_t cr_bytes = vld1_u8(pcr + i);
3765
         uint8x8_t cb_bytes = vld1_u8(pcb + i);
3766
         int8x8_t cr_biased = vreinterpret_s8_u8(vsub_u8(cr_bytes, signflip));
3767
         int8x8_t cb_biased = vreinterpret_s8_u8(vsub_u8(cb_bytes, signflip));
3768
3769
         // expand to s16
3770
         int16x8_t yws = vreinterpretq_s16_u16(vshll_n_u8(y_bytes, 4));
3771
         int16x8_t crw = vshll_n_s8(cr_biased, 7);
3772
         int16x8_t cbw = vshll_n_s8(cb_biased, 7);
3773
3774
         // color transform
3775
         int16x8_t cr0 = vqdmulhq_s16(crw, cr_const0);
3776
         int16x8_t cb0 = vqdmulhq_s16(cbw, cb_const0);
3777
         int16x8_t cr1 = vqdmulhq_s16(crw, cr_const1);
3778
         int16x8_t cb1 = vqdmulhq_s16(cbw, cb_const1);
3779
         int16x8_t rws = vaddq_s16(yws, cr0);
3780
         int16x8_t gws = vaddq_s16(vaddq_s16(yws, cb0), cr1);
3781
         int16x8_t bws = vaddq_s16(yws, cb1);
3782
3783
         // undo scaling, round, convert to byte
3784
         uint8x8x4_t o;
3785
         o.val[0] = vqrshrun_n_s16(rws, 4);
3786
         o.val[1] = vqrshrun_n_s16(gws, 4);
3787
         o.val[2] = vqrshrun_n_s16(bws, 4);
3788
         o.val[3] = vdup_n_u8(255);
3789
3790
         // store, interleaving r/g/b/a
3791
         vst4_u8(out, o);
3792
         out += 8*4;
3793
      }
3794
   }
3795
#endif
3796
3797
8.15M
   for (; i < count; ++i) {
3798
5.98M
      int y_fixed = (y[i] << 20) + (1<<19); // rounding
3799
5.98M
      int r,g,b;
3800
5.98M
      int cr = pcr[i] - 128;
3801
5.98M
      int cb = pcb[i] - 128;
3802
5.98M
      r = y_fixed + cr* stbi__float2fixed(1.40200f);
3803
5.98M
      g = y_fixed + cr*-stbi__float2fixed(0.71414f) + ((cb*-stbi__float2fixed(0.34414f)) & 0xffff0000);
3804
5.98M
      b = y_fixed                                   +   cb* stbi__float2fixed(1.77200f);
3805
5.98M
      r >>= 20;
3806
5.98M
      g >>= 20;
3807
5.98M
      b >>= 20;
3808
5.98M
      if ((unsigned) r > 255) { if (r < 0) r = 0; else r = 255; }
3809
5.98M
      if ((unsigned) g > 255) { if (g < 0) g = 0; else g = 255; }
3810
5.98M
      if ((unsigned) b > 255) { if (b < 0) b = 0; else b = 255; }
3811
5.98M
      out[0] = (stbi_uc)r;
3812
5.98M
      out[1] = (stbi_uc)g;
3813
5.98M
      out[2] = (stbi_uc)b;
3814
5.98M
      out[3] = 255;
3815
5.98M
      out += step;
3816
5.98M
   }
3817
2.16M
}
3818
#endif
3819
3820
// set up the kernels
3821
static void stbi__setup_jpeg(stbi__jpeg *j)
3822
4.75k
{
3823
4.75k
   j->idct_block_kernel = stbi__idct_block;
3824
4.75k
   j->YCbCr_to_RGB_kernel = stbi__YCbCr_to_RGB_row;
3825
4.75k
   j->resample_row_hv_2_kernel = stbi__resample_row_hv_2;
3826
3827
4.75k
#ifdef STBI_SSE2
3828
4.75k
   if (stbi__sse2_available()) {
3829
4.75k
      j->idct_block_kernel = stbi__idct_simd;
3830
4.75k
      j->YCbCr_to_RGB_kernel = stbi__YCbCr_to_RGB_simd;
3831
4.75k
      j->resample_row_hv_2_kernel = stbi__resample_row_hv_2_simd;
3832
4.75k
   }
3833
4.75k
#endif
3834
3835
#ifdef STBI_NEON
3836
   j->idct_block_kernel = stbi__idct_simd;
3837
   j->YCbCr_to_RGB_kernel = stbi__YCbCr_to_RGB_simd;
3838
   j->resample_row_hv_2_kernel = stbi__resample_row_hv_2_simd;
3839
#endif
3840
4.75k
}
3841
3842
// clean up the temporary component buffers
3843
static void stbi__cleanup_jpeg(stbi__jpeg *j)
3844
1.82k
{
3845
1.82k
   stbi__free_jpeg_components(j, j->s->img_n, 0);
3846
1.82k
}
3847
3848
typedef struct
3849
{
3850
   resample_row_func resample;
3851
   stbi_uc *line0,*line1;
3852
   int hs,vs;   // expansion factor in each axis
3853
   int w_lores; // horizontal pixels pre-expansion
3854
   int ystep;   // how far through vertical expansion we are
3855
   int ypos;    // which pre-expansion row we're on
3856
} stbi__resample;
3857
3858
// fast 0..255 * 0..255 => 0..255 rounded multiplication
3859
static stbi_uc stbi__blinn_8x8(stbi_uc x, stbi_uc y)
3860
394M
{
3861
394M
   unsigned int t = x*y + 128;
3862
394M
   return (stbi_uc) ((t + (t >>8)) >> 8);
3863
394M
}
3864
3865
static stbi_uc *load_jpeg_image(stbi__jpeg *z, int *out_x, int *out_y, int *comp, int req_comp)
3866
1.82k
{
3867
1.82k
   int n, decode_n, is_rgb;
3868
1.82k
   z->s->img_n = 0; // make stbi__cleanup_jpeg safe
3869
3870
   // validate req_comp
3871
1.82k
   if (req_comp < 0 || req_comp > 4) return stbi__errpuc("bad req_comp", "Internal error");
3872
3873
   // load a jpeg image from whichever source, but leave in YCbCr format
3874
1.82k
   if (!stbi__decode_jpeg_image(z)) { stbi__cleanup_jpeg(z); return NULL; }
3875
3876
   // determine actual number of components to generate
3877
1.38k
   n = req_comp ? req_comp : z->s->img_n >= 3 ? 3 : 1;
3878
3879
1.38k
   is_rgb = z->s->img_n == 3 && (z->rgb == 3 || (z->app14_color_transform == 0 && !z->jfif));
3880
3881
1.38k
   if (z->s->img_n == 3 && n < 3 && !is_rgb)
3882
0
      decode_n = 1;
3883
1.38k
   else
3884
1.38k
      decode_n = z->s->img_n;
3885
3886
   // nothing to do if no components requested; check this now to avoid
3887
   // accessing uninitialized coutput[0] later
3888
1.38k
   if (decode_n <= 0) { stbi__cleanup_jpeg(z); return NULL; }
3889
3890
   // resample and color-convert
3891
1.38k
   {
3892
1.38k
      int k;
3893
1.38k
      unsigned int i,j;
3894
1.38k
      stbi_uc *output;
3895
1.38k
      stbi_uc *coutput[4] = { NULL, NULL, NULL, NULL };
3896
3897
1.38k
      stbi__resample res_comp[4];
3898
3899
3.77k
      for (k=0; k < decode_n; ++k) {
3900
2.38k
         stbi__resample *r = &res_comp[k];
3901
3902
         // allocate line buffer big enough for upsampling off the edges
3903
         // with upsample factor of 4
3904
2.38k
         z->img_comp[k].linebuf = (stbi_uc *) stbi__malloc(z->s->img_x + 3);
3905
2.38k
         if (!z->img_comp[k].linebuf) { stbi__cleanup_jpeg(z); return stbi__errpuc("outofmem", "Out of memory"); }
3906
3907
2.38k
         r->hs      = z->img_h_max / z->img_comp[k].h;
3908
2.38k
         r->vs      = z->img_v_max / z->img_comp[k].v;
3909
2.38k
         r->ystep   = r->vs >> 1;
3910
2.38k
         r->w_lores = (z->s->img_x + r->hs-1) / r->hs;
3911
2.38k
         r->ypos    = 0;
3912
2.38k
         r->line0   = r->line1 = z->img_comp[k].data;
3913
3914
2.38k
         if      (r->hs == 1 && r->vs == 1) r->resample = resample_row_1;
3915
1.12k
         else if (r->hs == 1 && r->vs == 2) r->resample = stbi__resample_row_v_2;
3916
967
         else if (r->hs == 2 && r->vs == 1) r->resample = stbi__resample_row_h_2;
3917
792
         else if (r->hs == 2 && r->vs == 2) r->resample = z->resample_row_hv_2_kernel;
3918
480
         else                               r->resample = stbi__resample_row_generic;
3919
2.38k
      }
3920
3921
      // can't error after this so, this is safe
3922
1.38k
      output = (stbi_uc *) stbi__malloc_mad3(n, z->s->img_x, z->s->img_y, 1);
3923
1.38k
      if (!output) { stbi__cleanup_jpeg(z); return stbi__errpuc("outofmem", "Out of memory"); }
3924
3925
      // now go ahead and resample
3926
11.8M
      for (j=0; j < z->s->img_y; ++j) {
3927
11.8M
         stbi_uc *out = output + n * z->s->img_x * j;
3928
28.7M
         for (k=0; k < decode_n; ++k) {
3929
16.9M
            stbi__resample *r = &res_comp[k];
3930
16.9M
            int y_bot = r->ystep >= (r->vs >> 1);
3931
16.9M
            coutput[k] = r->resample(z->img_comp[k].linebuf,
3932
16.9M
                                     y_bot ? r->line1 : r->line0,
3933
16.9M
                                     y_bot ? r->line0 : r->line1,
3934
16.9M
                                     r->w_lores, r->hs);
3935
16.9M
            if (++r->ystep >= r->vs) {
3936
14.5M
               r->ystep = 0;
3937
14.5M
               r->line0 = r->line1;
3938
14.5M
               if (++r->ypos < z->img_comp[k].y)
3939
14.5M
                  r->line1 += z->img_comp[k].w2;
3940
14.5M
            }
3941
16.9M
         }
3942
11.8M
         if (n >= 3) {
3943
11.8M
            stbi_uc *y = coutput[0];
3944
11.8M
            if (z->s->img_n == 3) {
3945
1.91M
               if (is_rgb) {
3946
51.2M
                  for (i=0; i < z->s->img_x; ++i) {
3947
51.1M
                     out[0] = y[i];
3948
51.1M
                     out[1] = coutput[1][i];
3949
51.1M
                     out[2] = coutput[2][i];
3950
51.1M
                     out[3] = 255;
3951
51.1M
                     out += n;
3952
51.1M
                  }
3953
1.81M
               } else {
3954
1.81M
                  z->YCbCr_to_RGB_kernel(out, y, coutput[1], coutput[2], z->s->img_x, n);
3955
1.81M
               }
3956
9.93M
            } else if (z->s->img_n == 4) {
3957
414k
               if (z->app14_color_transform == 0) { // CMYK
3958
83.2M
                  for (i=0; i < z->s->img_x; ++i) {
3959
83.1M
                     stbi_uc m = coutput[3][i];
3960
83.1M
                     out[0] = stbi__blinn_8x8(coutput[0][i], m);
3961
83.1M
                     out[1] = stbi__blinn_8x8(coutput[1][i], m);
3962
83.1M
                     out[2] = stbi__blinn_8x8(coutput[2][i], m);
3963
83.1M
                     out[3] = 255;
3964
83.1M
                     out += n;
3965
83.1M
                  }
3966
347k
               } else if (z->app14_color_transform == 2) { // YCCK
3967
124k
                  z->YCbCr_to_RGB_kernel(out, y, coutput[1], coutput[2], z->s->img_x, n);
3968
48.4M
                  for (i=0; i < z->s->img_x; ++i) {
3969
48.3M
                     stbi_uc m = coutput[3][i];
3970
48.3M
                     out[0] = stbi__blinn_8x8(255 - out[0], m);
3971
48.3M
                     out[1] = stbi__blinn_8x8(255 - out[1], m);
3972
48.3M
                     out[2] = stbi__blinn_8x8(255 - out[2], m);
3973
48.3M
                     out += n;
3974
48.3M
                  }
3975
222k
               } else { // YCbCr + alpha?  Ignore the fourth channel for now
3976
222k
                  z->YCbCr_to_RGB_kernel(out, y, coutput[1], coutput[2], z->s->img_x, n);
3977
222k
               }
3978
414k
            } else
3979
4.50G
               for (i=0; i < z->s->img_x; ++i) {
3980
4.49G
                  out[0] = out[1] = out[2] = y[i];
3981
4.49G
                  out[3] = 255; // not used if n==3
3982
4.49G
                  out += n;
3983
4.49G
               }
3984
11.8M
         } else {
3985
0
            if (is_rgb) {
3986
0
               if (n == 1)
3987
0
                  for (i=0; i < z->s->img_x; ++i)
3988
0
                     *out++ = stbi__compute_y(coutput[0][i], coutput[1][i], coutput[2][i]);
3989
0
               else {
3990
0
                  for (i=0; i < z->s->img_x; ++i, out += 2) {
3991
0
                     out[0] = stbi__compute_y(coutput[0][i], coutput[1][i], coutput[2][i]);
3992
0
                     out[1] = 255;
3993
0
                  }
3994
0
               }
3995
0
            } else if (z->s->img_n == 4 && z->app14_color_transform == 0) {
3996
0
               for (i=0; i < z->s->img_x; ++i) {
3997
0
                  stbi_uc m = coutput[3][i];
3998
0
                  stbi_uc r = stbi__blinn_8x8(coutput[0][i], m);
3999
0
                  stbi_uc g = stbi__blinn_8x8(coutput[1][i], m);
4000
0
                  stbi_uc b = stbi__blinn_8x8(coutput[2][i], m);
4001
0
                  out[0] = stbi__compute_y(r, g, b);
4002
0
                  out[1] = 255;
4003
0
                  out += n;
4004
0
               }
4005
0
            } else if (z->s->img_n == 4 && z->app14_color_transform == 2) {
4006
0
               for (i=0; i < z->s->img_x; ++i) {
4007
0
                  out[0] = stbi__blinn_8x8(255 - coutput[0][i], coutput[3][i]);
4008
0
                  out[1] = 255;
4009
0
                  out += n;
4010
0
               }
4011
0
            } else {
4012
0
               stbi_uc *y = coutput[0];
4013
0
               if (n == 1)
4014
0
                  for (i=0; i < z->s->img_x; ++i) out[i] = y[i];
4015
0
               else
4016
0
                  for (i=0; i < z->s->img_x; ++i) { *out++ = y[i]; *out++ = 255; }
4017
0
            }
4018
0
         }
4019
11.8M
      }
4020
1.38k
      stbi__cleanup_jpeg(z);
4021
1.38k
      *out_x = z->s->img_x;
4022
1.38k
      *out_y = z->s->img_y;
4023
1.38k
      if (comp) *comp = z->s->img_n >= 3 ? 3 : 1; // report original components, not output
4024
1.38k
      return output;
4025
1.38k
   }
4026
1.38k
}
4027
4028
static void *stbi__jpeg_load(stbi__context *s, int *x, int *y, int *comp, int req_comp, stbi__result_info *ri)
4029
1.82k
{
4030
1.82k
   unsigned char* result;
4031
1.82k
   stbi__jpeg* j = (stbi__jpeg*) stbi__malloc(sizeof(stbi__jpeg));
4032
1.82k
   if (!j) return stbi__errpuc("outofmem", "Out of memory");
4033
1.82k
   memset(j, 0, sizeof(stbi__jpeg));
4034
1.82k
   STBI_NOTUSED(ri);
4035
1.82k
   j->s = s;
4036
1.82k
   stbi__setup_jpeg(j);
4037
1.82k
   result = load_jpeg_image(j, x,y,comp,req_comp);
4038
1.82k
   STBI_FREE(j);
4039
1.82k
   return result;
4040
1.82k
}
4041
4042
static int stbi__jpeg_test(stbi__context *s)
4043
2.92k
{
4044
2.92k
   int r;
4045
2.92k
   stbi__jpeg* j = (stbi__jpeg*)stbi__malloc(sizeof(stbi__jpeg));
4046
2.92k
   if (!j) return stbi__err("outofmem", "Out of memory");
4047
2.92k
   memset(j, 0, sizeof(stbi__jpeg));
4048
2.92k
   j->s = s;
4049
2.92k
   stbi__setup_jpeg(j);
4050
2.92k
   r = stbi__decode_jpeg_header(j, STBI__SCAN_type);
4051
2.92k
   stbi__rewind(s);
4052
2.92k
   STBI_FREE(j);
4053
2.92k
   return r;
4054
2.92k
}
4055
4056
static int stbi__jpeg_info_raw(stbi__jpeg *j, int *x, int *y, int *comp)
4057
8.78k
{
4058
8.78k
   if (!stbi__decode_jpeg_header(j, STBI__SCAN_header)) {
4059
6.95k
      stbi__rewind( j->s );
4060
6.95k
      return 0;
4061
6.95k
   }
4062
1.82k
   if (x) *x = j->s->img_x;
4063
1.82k
   if (y) *y = j->s->img_y;
4064
1.82k
   if (comp) *comp = j->s->img_n >= 3 ? 3 : 1;
4065
1.82k
   return 1;
4066
8.78k
}
4067
4068
static int stbi__jpeg_info(stbi__context *s, int *x, int *y, int *comp)
4069
8.78k
{
4070
8.78k
   int result;
4071
8.78k
   stbi__jpeg* j = (stbi__jpeg*) (stbi__malloc(sizeof(stbi__jpeg)));
4072
8.78k
   if (!j) return stbi__err("outofmem", "Out of memory");
4073
8.78k
   memset(j, 0, sizeof(stbi__jpeg));
4074
8.78k
   j->s = s;
4075
8.78k
   result = stbi__jpeg_info_raw(j, x, y, comp);
4076
8.78k
   STBI_FREE(j);
4077
8.78k
   return result;
4078
8.78k
}
4079
#endif
4080
4081
// public domain zlib decode    v0.2  Sean Barrett 2006-11-18
4082
//    simple implementation
4083
//      - all input must be provided in an upfront buffer
4084
//      - all output is written to a single output buffer (can malloc/realloc)
4085
//    performance
4086
//      - fast huffman
4087
4088
#ifndef STBI_NO_ZLIB
4089
4090
// fast-way is faster to check than jpeg huffman, but slow way is slower
4091
21.7M
#define STBI__ZFAST_BITS  9 // accelerate all cases in default tables
4092
8.55M
#define STBI__ZFAST_MASK  ((1 << STBI__ZFAST_BITS) - 1)
4093
76.1k
#define STBI__ZNSYMS 288 // number of symbols in literal/length alphabet
4094
4095
// zlib-style huffman encoding
4096
// (jpegs packs from left, zlib from right, so can't share code)
4097
typedef struct
4098
{
4099
   stbi__uint16 fast[1 << STBI__ZFAST_BITS];
4100
   stbi__uint16 firstcode[16];
4101
   int maxcode[17];
4102
   stbi__uint16 firstsymbol[16];
4103
   stbi_uc  size[STBI__ZNSYMS];
4104
   stbi__uint16 value[STBI__ZNSYMS];
4105
} stbi__zhuffman;
4106
4107
stbi_inline static int stbi__bitreverse16(int n)
4108
2.05M
{
4109
2.05M
  n = ((n & 0xAAAA) >>  1) | ((n & 0x5555) << 1);
4110
2.05M
  n = ((n & 0xCCCC) >>  2) | ((n & 0x3333) << 2);
4111
2.05M
  n = ((n & 0xF0F0) >>  4) | ((n & 0x0F0F) << 4);
4112
2.05M
  n = ((n & 0xFF00) >>  8) | ((n & 0x00FF) << 8);
4113
2.05M
  return n;
4114
2.05M
}
4115
4116
stbi_inline static int stbi__bit_reverse(int v, int bits)
4117
2.05M
{
4118
2.05M
   STBI_ASSERT(bits <= 16);
4119
   // to bit reverse n bits, reverse 16 and shift
4120
   // e.g. 11 bits, bit reverse and shift away 5
4121
2.05M
   return stbi__bitreverse16(v) >> (16-bits);
4122
2.05M
}
4123
4124
static int stbi__zbuild_huffman(stbi__zhuffman *z, const stbi_uc *sizelist, int num)
4125
24.7k
{
4126
24.7k
   int i,k=0;
4127
24.7k
   int code, next_code[16], sizes[17];
4128
4129
   // DEFLATE spec for generating codes
4130
24.7k
   memset(sizes, 0, sizeof(sizes));
4131
24.7k
   memset(z->fast, 0, sizeof(z->fast));
4132
3.12M
   for (i=0; i < num; ++i)
4133
3.10M
      ++sizes[sizelist[i]];
4134
24.7k
   sizes[0] = 0;
4135
395k
   for (i=1; i < 16; ++i)
4136
370k
      if (sizes[i] > (1 << i))
4137
14
         return stbi__err("bad sizes", "Corrupt PNG");
4138
24.7k
   code = 0;
4139
395k
   for (i=1; i < 16; ++i) {
4140
370k
      next_code[i] = code;
4141
370k
      z->firstcode[i] = (stbi__uint16) code;
4142
370k
      z->firstsymbol[i] = (stbi__uint16) k;
4143
370k
      code = (code + sizes[i]);
4144
370k
      if (sizes[i])
4145
70.2k
         if (code-1 >= (1 << i)) return stbi__err("bad codelengths","Corrupt PNG");
4146
370k
      z->maxcode[i] = code << (16-i); // preshift for inner loop
4147
370k
      code <<= 1;
4148
370k
      k += sizes[i];
4149
370k
   }
4150
24.6k
   z->maxcode[16] = 0x10000; // sentinel
4151
3.12M
   for (i=0; i < num; ++i) {
4152
3.09M
      int s = sizelist[i];
4153
3.09M
      if (s) {
4154
2.05M
         int c = next_code[s] - z->firstcode[s] + z->firstsymbol[s];
4155
2.05M
         stbi__uint16 fastv = (stbi__uint16) ((s << 9) | i);
4156
2.05M
         z->size [c] = (stbi_uc     ) s;
4157
2.05M
         z->value[c] = (stbi__uint16) i;
4158
2.05M
         if (s <= STBI__ZFAST_BITS) {
4159
1.98M
            int j = stbi__bit_reverse(next_code[s],s);
4160
11.0M
            while (j < (1 << STBI__ZFAST_BITS)) {
4161
9.08M
               z->fast[j] = fastv;
4162
9.08M
               j += (1 << s);
4163
9.08M
            }
4164
1.98M
         }
4165
2.05M
         ++next_code[s];
4166
2.05M
      }
4167
3.09M
   }
4168
24.6k
   return 1;
4169
24.7k
}
4170
4171
// zlib-from-memory implementation for PNG reading
4172
//    because PNG allows splitting the zlib stream arbitrarily,
4173
//    and it's annoying structurally to have PNG call ZLIB call PNG,
4174
//    we require PNG read all the IDATs and combine them into a single
4175
//    memory buffer
4176
4177
typedef struct
4178
{
4179
   stbi_uc *zbuffer, *zbuffer_end;
4180
   int num_bits;
4181
   int hit_zeof_once;
4182
   stbi__uint32 code_buffer;
4183
4184
   char *zout;
4185
   char *zout_start;
4186
   char *zout_end;
4187
   int   z_expandable;
4188
4189
   stbi__zhuffman z_length, z_distance;
4190
} stbi__zbuf;
4191
4192
stbi_inline static int stbi__zeof(stbi__zbuf *z)
4193
3.90M
{
4194
3.90M
   return (z->zbuffer >= z->zbuffer_end);
4195
3.90M
}
4196
4197
stbi_inline static stbi_uc stbi__zget8(stbi__zbuf *z)
4198
2.66M
{
4199
2.66M
   return stbi__zeof(z) ? 0 : *z->zbuffer++;
4200
2.66M
}
4201
4202
static void stbi__fill_bits(stbi__zbuf *z)
4203
1.25M
{
4204
2.65M
   do {
4205
2.65M
      if (z->code_buffer >= (1U << z->num_bits)) {
4206
0
        z->zbuffer = z->zbuffer_end;  /* treat this as EOF so we fail. */
4207
0
        return;
4208
0
      }
4209
2.65M
      z->code_buffer |= (unsigned int) stbi__zget8(z) << z->num_bits;
4210
2.65M
      z->num_bits += 8;
4211
2.65M
   } while (z->num_bits <= 24);
4212
1.25M
}
4213
4214
stbi_inline static unsigned int stbi__zreceive(stbi__zbuf *z, int n)
4215
449k
{
4216
449k
   unsigned int k;
4217
449k
   if (z->num_bits < n) stbi__fill_bits(z);
4218
449k
   k = z->code_buffer & ((1 << n) - 1);
4219
449k
   z->code_buffer >>= n;
4220
449k
   z->num_bits -= n;
4221
449k
   return k;
4222
449k
}
4223
4224
static int stbi__zhuffman_decode_slowpath(stbi__zbuf *a, stbi__zhuffman *z)
4225
70.4k
{
4226
70.4k
   int b,s,k;
4227
   // not resolved by fast table, so compute it the slow way
4228
   // use jpeg approach, which requires MSbits at top
4229
70.4k
   k = stbi__bit_reverse(a->code_buffer, 16);
4230
70.4k
   for (s=STBI__ZFAST_BITS+1; ; ++s)
4231
123k
      if (k < z->maxcode[s])
4232
70.4k
         break;
4233
70.4k
   if (s >= 16) return -1; // invalid code!
4234
   // code size is s, so:
4235
70.3k
   b = (k >> (16-s)) - z->firstcode[s] + z->firstsymbol[s];
4236
70.3k
   if (b >= STBI__ZNSYMS) return -1; // some data was corrupt somewhere!
4237
70.3k
   if (z->size[b] != s) return -1;  // was originally an assert, but report failure instead.
4238
70.3k
   a->code_buffer >>= s;
4239
70.3k
   a->num_bits -= s;
4240
70.3k
   return z->value[b];
4241
70.3k
}
4242
4243
stbi_inline static int stbi__zhuffman_decode(stbi__zbuf *a, stbi__zhuffman *z)
4244
8.55M
{
4245
8.55M
   int b,s;
4246
8.55M
   if (a->num_bits < 16) {
4247
1.23M
      if (stbi__zeof(a)) {
4248
703
         if (!a->hit_zeof_once) {
4249
            // This is the first time we hit eof, insert 16 extra padding btis
4250
            // to allow us to keep going; if we actually consume any of them
4251
            // though, that is invalid data. This is caught later.
4252
563
            a->hit_zeof_once = 1;
4253
563
            a->num_bits += 16; // add 16 implicit zero bits
4254
563
         } else {
4255
            // We already inserted our extra 16 padding bits and are again
4256
            // out, this stream is actually prematurely terminated.
4257
140
            return -1;
4258
140
         }
4259
1.23M
      } else {
4260
1.23M
         stbi__fill_bits(a);
4261
1.23M
      }
4262
1.23M
   }
4263
8.55M
   b = z->fast[a->code_buffer & STBI__ZFAST_MASK];
4264
8.55M
   if (b) {
4265
8.48M
      s = b >> 9;
4266
8.48M
      a->code_buffer >>= s;
4267
8.48M
      a->num_bits -= s;
4268
8.48M
      return b & 511;
4269
8.48M
   }
4270
70.4k
   return stbi__zhuffman_decode_slowpath(a, z);
4271
8.55M
}
4272
4273
static int stbi__zexpand(stbi__zbuf *z, char *zout, int n)  // need to make room for n bytes
4274
954
{
4275
954
   char *q;
4276
954
   unsigned int cur, limit, old_limit;
4277
954
   z->zout = zout;
4278
954
   if (!z->z_expandable) return stbi__err("output buffer limit","Corrupt PNG");
4279
954
   cur   = (unsigned int) (z->zout - z->zout_start);
4280
954
   limit = old_limit = (unsigned) (z->zout_end - z->zout_start);
4281
954
   if (UINT_MAX - cur < (unsigned) n) return stbi__err("outofmem", "Out of memory");
4282
2.17k
   while (cur + n > limit) {
4283
1.22k
      if(limit > UINT_MAX / 2) return stbi__err("outofmem", "Out of memory");
4284
1.22k
      limit *= 2;
4285
1.22k
   }
4286
954
   q = (char *) STBI_REALLOC_SIZED(z->zout_start, old_limit, limit);
4287
954
   STBI_NOTUSED(old_limit);
4288
954
   if (q == NULL) return stbi__err("outofmem", "Out of memory");
4289
954
   z->zout_start = q;
4290
954
   z->zout       = q + cur;
4291
954
   z->zout_end   = q + limit;
4292
954
   return 1;
4293
954
}
4294
4295
static const int stbi__zlength_base[31] = {
4296
   3,4,5,6,7,8,9,10,11,13,
4297
   15,17,19,23,27,31,35,43,51,59,
4298
   67,83,99,115,131,163,195,227,258,0,0 };
4299
4300
static const int stbi__zlength_extra[31]=
4301
{ 0,0,0,0,0,0,0,0,1,1,1,1,2,2,2,2,3,3,3,3,4,4,4,4,5,5,5,5,0,0,0 };
4302
4303
static const int stbi__zdist_base[32] = { 1,2,3,4,5,7,9,13,17,25,33,49,65,97,129,193,
4304
257,385,513,769,1025,1537,2049,3073,4097,6145,8193,12289,16385,24577,0,0};
4305
4306
static const int stbi__zdist_extra[32] =
4307
{ 0,0,0,0,1,1,2,2,3,3,4,4,5,5,6,6,7,7,8,8,9,9,10,10,11,11,12,12,13,13};
4308
4309
static int stbi__parse_huffman_block(stbi__zbuf *a)
4310
10.0k
{
4311
10.0k
   char *zout = a->zout;
4312
4.84M
   for(;;) {
4313
4.84M
      int z = stbi__zhuffman_decode(a, &a->z_length);
4314
4.84M
      if (z < 256) {
4315
1.31M
         if (z < 0) return stbi__err("bad huffman code","Corrupt PNG"); // error in huffman codes
4316
1.31M
         if (zout >= a->zout_end) {
4317
231
            if (!stbi__zexpand(a, zout, 1)) return 0;
4318
231
            zout = a->zout;
4319
231
         }
4320
1.31M
         *zout++ = (char) z;
4321
3.52M
      } else {
4322
3.52M
         stbi_uc *p;
4323
3.52M
         int len,dist;
4324
3.52M
         if (z == 256) {
4325
9.92k
            a->zout = zout;
4326
9.92k
            if (a->hit_zeof_once && a->num_bits < 16) {
4327
               // The first time we hit zeof, we inserted 16 extra zero bits into our bit
4328
               // buffer so the decoder can just do its speculative decoding. But if we
4329
               // actually consumed any of those bits (which is the case when num_bits < 16),
4330
               // the stream actually read past the end so it is malformed.
4331
29
               return stbi__err("unexpected end","Corrupt PNG");
4332
29
            }
4333
9.89k
            return 1;
4334
9.92k
         }
4335
3.51M
         if (z >= 286) return stbi__err("bad huffman code","Corrupt PNG"); // per DEFLATE, length codes 286 and 287 must not appear in compressed data
4336
3.51M
         z -= 257;
4337
3.51M
         len = stbi__zlength_base[z];
4338
3.51M
         if (stbi__zlength_extra[z]) len += stbi__zreceive(a, stbi__zlength_extra[z]);
4339
3.51M
         z = stbi__zhuffman_decode(a, &a->z_distance);
4340
3.51M
         if (z < 0 || z >= 30) return stbi__err("bad huffman code","Corrupt PNG"); // per DEFLATE, distance codes 30 and 31 must not appear in compressed data
4341
3.51M
         dist = stbi__zdist_base[z];
4342
3.51M
         if (stbi__zdist_extra[z]) dist += stbi__zreceive(a, stbi__zdist_extra[z]);
4343
3.51M
         if (zout - a->zout_start < dist) return stbi__err("bad dist","Corrupt PNG");
4344
3.51M
         if (len > a->zout_end - zout) {
4345
650
            if (!stbi__zexpand(a, zout, len)) return 0;
4346
650
            zout = a->zout;
4347
650
         }
4348
3.51M
         p = (stbi_uc *) (zout - dist);
4349
3.51M
         if (dist == 1) { // run of one byte; common in images.
4350
3.14M
            stbi_uc v = *p;
4351
809M
            if (len) { do *zout++ = v; while (--len); }
4352
3.14M
         } else {
4353
47.4M
            if (len) { do *zout++ = *p++; while (--len); }
4354
370k
         }
4355
3.51M
      }
4356
4.84M
   }
4357
10.0k
}
4358
4359
static int stbi__compute_huffman_codes(stbi__zbuf *a)
4360
4.54k
{
4361
4.54k
   static const stbi_uc length_dezigzag[19] = { 16,17,18,0,8,7,9,6,10,5,11,4,12,3,13,2,14,1,15 };
4362
4.54k
   stbi__zhuffman z_codelength;
4363
4.54k
   stbi_uc lencodes[286+32+137];//padding for maximum single op
4364
4.54k
   stbi_uc codelength_sizes[19];
4365
4.54k
   int i,n;
4366
4367
4.54k
   int hlit  = stbi__zreceive(a,5) + 257;
4368
4.54k
   int hdist = stbi__zreceive(a,5) + 1;
4369
4.54k
   int hclen = stbi__zreceive(a,4) + 4;
4370
4.54k
   int ntot  = hlit + hdist;
4371
4372
4.54k
   memset(codelength_sizes, 0, sizeof(codelength_sizes));
4373
78.6k
   for (i=0; i < hclen; ++i) {
4374
74.1k
      int s = stbi__zreceive(a,3);
4375
74.1k
      codelength_sizes[length_dezigzag[i]] = (stbi_uc) s;
4376
74.1k
   }
4377
4.54k
   if (!stbi__zbuild_huffman(&z_codelength, codelength_sizes, 19)) return 0;
4378
4379
4.51k
   n = 0;
4380
192k
   while (n < ntot) {
4381
188k
      int c = stbi__zhuffman_decode(a, &z_codelength);
4382
188k
      if (c < 0 || c >= 19) return stbi__err("bad codelengths", "Corrupt PNG");
4383
188k
      if (c < 16)
4384
164k
         lencodes[n++] = (stbi_uc) c;
4385
23.5k
      else {
4386
23.5k
         stbi_uc fill = 0;
4387
23.5k
         if (c == 16) {
4388
5.52k
            c = stbi__zreceive(a,2)+3;
4389
5.52k
            if (n == 0) return stbi__err("bad codelengths", "Corrupt PNG");
4390
5.50k
            fill = lencodes[n-1];
4391
17.9k
         } else if (c == 17) {
4392
7.85k
            c = stbi__zreceive(a,3)+3;
4393
10.1k
         } else if (c == 18) {
4394
10.1k
            c = stbi__zreceive(a,7)+11;
4395
10.1k
         } else {
4396
0
            return stbi__err("bad codelengths", "Corrupt PNG");
4397
0
         }
4398
23.4k
         if (ntot - n < c) return stbi__err("bad codelengths", "Corrupt PNG");
4399
23.4k
         memset(lencodes+n, fill, c);
4400
23.4k
         n += c;
4401
23.4k
      }
4402
188k
   }
4403
4.36k
   if (n != ntot) return stbi__err("bad codelengths","Corrupt PNG");
4404
4.36k
   if (!stbi__zbuild_huffman(&a->z_length, lencodes, hlit)) return 0;
4405
4.34k
   if (!stbi__zbuild_huffman(&a->z_distance, lencodes+hlit, hdist)) return 0;
4406
4.34k
   return 1;
4407
4.34k
}
4408
4409
static int stbi__parse_uncompressed_block(stbi__zbuf *a)
4410
3.12k
{
4411
3.12k
   stbi_uc header[4];
4412
3.12k
   int len,nlen,k;
4413
3.12k
   if (a->num_bits & 7)
4414
2.06k
      stbi__zreceive(a, a->num_bits & 7); // discard
4415
   // drain the bit-packed data into header
4416
3.12k
   k = 0;
4417
8.22k
   while (a->num_bits > 0) {
4418
5.10k
      header[k++] = (stbi_uc) (a->code_buffer & 255); // suppress MSVC run-time check
4419
5.10k
      a->code_buffer >>= 8;
4420
5.10k
      a->num_bits -= 8;
4421
5.10k
   }
4422
3.12k
   if (a->num_bits < 0) return stbi__err("zlib corrupt","Corrupt PNG");
4423
   // now fill header the normal way
4424
10.5k
   while (k < 4)
4425
7.38k
      header[k++] = stbi__zget8(a);
4426
3.12k
   len  = header[1] * 256 + header[0];
4427
3.12k
   nlen = header[3] * 256 + header[2];
4428
3.12k
   if (nlen != (len ^ 0xffff)) return stbi__err("zlib corrupt","Corrupt PNG");
4429
2.98k
   if (a->zbuffer + len > a->zbuffer_end) return stbi__err("read past buffer","Corrupt PNG");
4430
2.98k
   if (a->zout + len > a->zout_end)
4431
73
      if (!stbi__zexpand(a, a->zout, len)) return 0;
4432
2.98k
   memcpy(a->zout, a->zbuffer, len);
4433
2.98k
   a->zbuffer += len;
4434
2.98k
   a->zout += len;
4435
2.98k
   return 1;
4436
2.98k
}
4437
4438
static int stbi__parse_zlib_header(stbi__zbuf *a)
4439
1.14k
{
4440
1.14k
   int cmf   = stbi__zget8(a);
4441
1.14k
   int cm    = cmf & 15;
4442
   /* int cinfo = cmf >> 4; */
4443
1.14k
   int flg   = stbi__zget8(a);
4444
1.14k
   if (stbi__zeof(a)) return stbi__err("bad zlib header","Corrupt PNG"); // zlib spec
4445
1.14k
   if ((cmf*256+flg) % 31 != 0) return stbi__err("bad zlib header","Corrupt PNG"); // zlib spec
4446
1.14k
   if (flg & 32) return stbi__err("no preset dict","Corrupt PNG"); // preset dictionary not allowed in png
4447
1.14k
   if (cm != 8) return stbi__err("bad compression","Corrupt PNG"); // DEFLATE required for png
4448
   // window = 1 << (8 + cinfo)... but who cares, we fully buffer output
4449
1.13k
   return 1;
4450
1.14k
}
4451
4452
static const stbi_uc stbi__zdefault_length[STBI__ZNSYMS] =
4453
{
4454
   8,8,8,8,8,8,8,8,8,8,8,8,8,8,8,8, 8,8,8,8,8,8,8,8,8,8,8,8,8,8,8,8,
4455
   8,8,8,8,8,8,8,8,8,8,8,8,8,8,8,8, 8,8,8,8,8,8,8,8,8,8,8,8,8,8,8,8,
4456
   8,8,8,8,8,8,8,8,8,8,8,8,8,8,8,8, 8,8,8,8,8,8,8,8,8,8,8,8,8,8,8,8,
4457
   8,8,8,8,8,8,8,8,8,8,8,8,8,8,8,8, 8,8,8,8,8,8,8,8,8,8,8,8,8,8,8,8,
4458
   8,8,8,8,8,8,8,8,8,8,8,8,8,8,8,8, 9,9,9,9,9,9,9,9,9,9,9,9,9,9,9,9,
4459
   9,9,9,9,9,9,9,9,9,9,9,9,9,9,9,9, 9,9,9,9,9,9,9,9,9,9,9,9,9,9,9,9,
4460
   9,9,9,9,9,9,9,9,9,9,9,9,9,9,9,9, 9,9,9,9,9,9,9,9,9,9,9,9,9,9,9,9,
4461
   9,9,9,9,9,9,9,9,9,9,9,9,9,9,9,9, 9,9,9,9,9,9,9,9,9,9,9,9,9,9,9,9,
4462
   7,7,7,7,7,7,7,7,7,7,7,7,7,7,7,7, 7,7,7,7,7,7,7,7,8,8,8,8,8,8,8,8
4463
};
4464
static const stbi_uc stbi__zdefault_distance[32] =
4465
{
4466
   5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5,5
4467
};
4468
/*
4469
Init algorithm:
4470
{
4471
   int i;   // use <= to match clearly with spec
4472
   for (i=0; i <= 143; ++i)     stbi__zdefault_length[i]   = 8;
4473
   for (   ; i <= 255; ++i)     stbi__zdefault_length[i]   = 9;
4474
   for (   ; i <= 279; ++i)     stbi__zdefault_length[i]   = 7;
4475
   for (   ; i <= 287; ++i)     stbi__zdefault_length[i]   = 8;
4476
4477
   for (i=0; i <=  31; ++i)     stbi__zdefault_distance[i] = 5;
4478
}
4479
*/
4480
4481
static int stbi__parse_zlib(stbi__zbuf *a, int parse_header)
4482
1.14k
{
4483
1.14k
   int final, type;
4484
1.14k
   if (parse_header)
4485
1.14k
      if (!stbi__parse_zlib_header(a)) return 0;
4486
1.14k
   a->num_bits = 0;
4487
1.14k
   a->code_buffer = 0;
4488
1.14k
   a->hit_zeof_once = 0;
4489
13.4k
   do {
4490
13.4k
      final = stbi__zreceive(a,1);
4491
13.4k
      type = stbi__zreceive(a,2);
4492
13.4k
      if (type == 0) {
4493
3.12k
         if (!stbi__parse_uncompressed_block(a)) return 0;
4494
10.2k
      } else if (type == 3) {
4495
9
         return 0;
4496
10.2k
      } else {
4497
10.2k
         if (type == 1) {
4498
            // use fixed code lengths
4499
5.74k
            if (!stbi__zbuild_huffman(&a->z_length  , stbi__zdefault_length  , STBI__ZNSYMS)) return 0;
4500
5.74k
            if (!stbi__zbuild_huffman(&a->z_distance, stbi__zdefault_distance,  32)) return 0;
4501
5.74k
         } else {
4502
4.54k
            if (!stbi__compute_huffman_codes(a)) return 0;
4503
4.54k
         }
4504
10.0k
         if (!stbi__parse_huffman_block(a)) return 0;
4505
10.0k
      }
4506
13.4k
   } while (!final);
4507
603
   return 1;
4508
1.14k
}
4509
4510
static int stbi__do_zlib(stbi__zbuf *a, char *obuf, int olen, int exp, int parse_header)
4511
1.14k
{
4512
1.14k
   a->zout_start = obuf;
4513
1.14k
   a->zout       = obuf;
4514
1.14k
   a->zout_end   = obuf + olen;
4515
1.14k
   a->z_expandable = exp;
4516
4517
1.14k
   return stbi__parse_zlib(a, parse_header);
4518
1.14k
}
4519
4520
STBIDEF char *stbi_zlib_decode_malloc_guesssize(const char *buffer, int len, int initial_size, int *outlen)
4521
0
{
4522
0
   stbi__zbuf a;
4523
0
   char *p = (char *) stbi__malloc(initial_size);
4524
0
   if (p == NULL) return NULL;
4525
0
   a.zbuffer = (stbi_uc *) buffer;
4526
0
   a.zbuffer_end = (stbi_uc *) buffer + len;
4527
0
   if (stbi__do_zlib(&a, p, initial_size, 1, 1)) {
4528
0
      if (outlen) *outlen = (int) (a.zout - a.zout_start);
4529
0
      return a.zout_start;
4530
0
   } else {
4531
0
      STBI_FREE(a.zout_start);
4532
0
      return NULL;
4533
0
   }
4534
0
}
4535
4536
STBIDEF char *stbi_zlib_decode_malloc(char const *buffer, int len, int *outlen)
4537
0
{
4538
0
   return stbi_zlib_decode_malloc_guesssize(buffer, len, 16384, outlen);
4539
0
}
4540
4541
STBIDEF char *stbi_zlib_decode_malloc_guesssize_headerflag(const char *buffer, int len, int initial_size, int *outlen, int parse_header)
4542
1.14k
{
4543
1.14k
   stbi__zbuf a;
4544
1.14k
   char *p = (char *) stbi__malloc(initial_size);
4545
1.14k
   if (p == NULL) return NULL;
4546
1.14k
   a.zbuffer = (stbi_uc *) buffer;
4547
1.14k
   a.zbuffer_end = (stbi_uc *) buffer + len;
4548
1.14k
   if (stbi__do_zlib(&a, p, initial_size, 1, parse_header)) {
4549
603
      if (outlen) *outlen = (int) (a.zout - a.zout_start);
4550
603
      return a.zout_start;
4551
603
   } else {
4552
545
      STBI_FREE(a.zout_start);
4553
545
      return NULL;
4554
545
   }
4555
1.14k
}
4556
4557
STBIDEF int stbi_zlib_decode_buffer(char *obuffer, int olen, char const *ibuffer, int ilen)
4558
0
{
4559
0
   stbi__zbuf a;
4560
0
   a.zbuffer = (stbi_uc *) ibuffer;
4561
0
   a.zbuffer_end = (stbi_uc *) ibuffer + ilen;
4562
0
   if (stbi__do_zlib(&a, obuffer, olen, 0, 1))
4563
0
      return (int) (a.zout - a.zout_start);
4564
0
   else
4565
0
      return -1;
4566
0
}
4567
4568
STBIDEF char *stbi_zlib_decode_noheader_malloc(char const *buffer, int len, int *outlen)
4569
0
{
4570
0
   stbi__zbuf a;
4571
0
   char *p = (char *) stbi__malloc(16384);
4572
0
   if (p == NULL) return NULL;
4573
0
   a.zbuffer = (stbi_uc *) buffer;
4574
0
   a.zbuffer_end = (stbi_uc *) buffer+len;
4575
0
   if (stbi__do_zlib(&a, p, 16384, 1, 0)) {
4576
0
      if (outlen) *outlen = (int) (a.zout - a.zout_start);
4577
0
      return a.zout_start;
4578
0
   } else {
4579
0
      STBI_FREE(a.zout_start);
4580
0
      return NULL;
4581
0
   }
4582
0
}
4583
4584
STBIDEF int stbi_zlib_decode_noheader_buffer(char *obuffer, int olen, const char *ibuffer, int ilen)
4585
0
{
4586
0
   stbi__zbuf a;
4587
0
   a.zbuffer = (stbi_uc *) ibuffer;
4588
0
   a.zbuffer_end = (stbi_uc *) ibuffer + ilen;
4589
0
   if (stbi__do_zlib(&a, obuffer, olen, 0, 0))
4590
0
      return (int) (a.zout - a.zout_start);
4591
0
   else
4592
0
      return -1;
4593
0
}
4594
#endif
4595
4596
// public domain "baseline" PNG decoder   v0.10  Sean Barrett 2006-11-18
4597
//    simple implementation
4598
//      - only 8-bit samples
4599
//      - no CRC checking
4600
//      - allocates lots of intermediate memory
4601
//        - avoids problem of streaming data between subsystems
4602
//        - avoids explicit window management
4603
//    performance
4604
//      - uses stb_zlib, a PD zlib implementation with fast huffman decoding
4605
4606
#ifndef STBI_NO_PNG
4607
typedef struct
4608
{
4609
   stbi__uint32 length;
4610
   stbi__uint32 type;
4611
} stbi__pngchunk;
4612
4613
static stbi__pngchunk stbi__get_chunk_header(stbi__context *s)
4614
16.7k
{
4615
16.7k
   stbi__pngchunk c;
4616
16.7k
   c.length = stbi__get32be(s);
4617
16.7k
   c.type   = stbi__get32be(s);
4618
16.7k
   return c;
4619
16.7k
}
4620
4621
static int stbi__check_png_header(stbi__context *s)
4622
15.1k
{
4623
15.1k
   static const stbi_uc png_sig[8] = { 137,80,78,71,13,10,26,10 };
4624
15.1k
   int i;
4625
56.7k
   for (i=0; i < 8; ++i)
4626
51.5k
      if (stbi__get8(s) != png_sig[i]) return stbi__err("bad png sig","Not a PNG");
4627
5.19k
   return 1;
4628
15.1k
}
4629
4630
typedef struct
4631
{
4632
   stbi__context *s;
4633
   stbi_uc *idata, *expanded, *out;
4634
   int depth;
4635
} stbi__png;
4636
4637
4638
enum {
4639
   STBI__F_none=0,
4640
   STBI__F_sub=1,
4641
   STBI__F_up=2,
4642
   STBI__F_avg=3,
4643
   STBI__F_paeth=4,
4644
   // synthetic filter used for first scanline to avoid needing a dummy row of 0s
4645
   STBI__F_avg_first
4646
};
4647
4648
static stbi_uc first_row_filter[5] =
4649
{
4650
   STBI__F_none,
4651
   STBI__F_sub,
4652
   STBI__F_none,
4653
   STBI__F_avg_first,
4654
   STBI__F_sub // Paeth with b=c=0 turns out to be equivalent to sub
4655
};
4656
4657
static int stbi__paeth(int a, int b, int c)
4658
139k
{
4659
   // This formulation looks very different from the reference in the PNG spec, but is
4660
   // actually equivalent and has favorable data dependencies and admits straightforward
4661
   // generation of branch-free code, which helps performance significantly.
4662
139k
   int thresh = c*3 - (a + b);
4663
139k
   int lo = a < b ? a : b;
4664
139k
   int hi = a < b ? b : a;
4665
139k
   int t0 = (hi <= thresh) ? lo : c;
4666
139k
   int t1 = (thresh <= lo) ? hi : t0;
4667
139k
   return t1;
4668
139k
}
4669
4670
static const stbi_uc stbi__depth_scale_table[9] = { 0, 0xff, 0x55, 0, 0x11, 0,0,0, 0x01 };
4671
4672
// adds an extra all-255 alpha channel
4673
// dest == src is legal
4674
// img_n must be 1 or 3
4675
static void stbi__create_png_alpha_expand8(stbi_uc *dest, stbi_uc *src, stbi__uint32 x, int img_n)
4676
146k
{
4677
146k
   int i;
4678
   // must process data backwards since we allow dest==src
4679
146k
   if (img_n == 1) {
4680
705k
      for (i=x-1; i >= 0; --i) {
4681
624k
         dest[i*2+1] = 255;
4682
624k
         dest[i*2+0] = src[i];
4683
624k
      }
4684
81.0k
   } else {
4685
65.3k
      STBI_ASSERT(img_n == 3);
4686
4.34M
      for (i=x-1; i >= 0; --i) {
4687
4.27M
         dest[i*4+3] = 255;
4688
4.27M
         dest[i*4+2] = src[i*3+2];
4689
4.27M
         dest[i*4+1] = src[i*3+1];
4690
4.27M
         dest[i*4+0] = src[i*3+0];
4691
4.27M
      }
4692
65.3k
   }
4693
146k
}
4694
4695
// create the png data from post-deflated data
4696
static int stbi__create_png_image_raw(stbi__png *a, stbi_uc *raw, stbi__uint32 raw_len, int out_n, stbi__uint32 x, stbi__uint32 y, int depth, int color)
4697
1.25k
{
4698
1.25k
   int bytes = (depth == 16 ? 2 : 1);
4699
1.25k
   stbi__context *s = a->s;
4700
1.25k
   stbi__uint32 i,j,stride = x*out_n*bytes;
4701
1.25k
   stbi__uint32 img_len, img_width_bytes;
4702
1.25k
   stbi_uc *filter_buf;
4703
1.25k
   int all_ok = 1;
4704
1.25k
   int k;
4705
1.25k
   int img_n = s->img_n; // copy it into a local for later
4706
4707
1.25k
   int output_bytes = out_n*bytes;
4708
1.25k
   int filter_bytes = img_n*bytes;
4709
1.25k
   int width = x;
4710
4711
1.25k
   STBI_ASSERT(out_n == s->img_n || out_n == s->img_n+1);
4712
1.25k
   a->out = (stbi_uc *) stbi__malloc_mad3(x, y, output_bytes, 0); // extra bytes to write off the end into
4713
1.25k
   if (!a->out) return stbi__err("outofmem", "Out of memory");
4714
4715
   // note: error exits here don't need to clean up a->out individually,
4716
   // stbi__do_png always does on error.
4717
1.25k
   if (!stbi__mad3sizes_valid(img_n, x, depth, 7)) return stbi__err("too large", "Corrupt PNG");
4718
1.25k
   img_width_bytes = (((img_n * x * depth) + 7) >> 3);
4719
1.25k
   if (!stbi__mad2sizes_valid(img_width_bytes, y, img_width_bytes)) return stbi__err("too large", "Corrupt PNG");
4720
1.25k
   img_len = (img_width_bytes + 1) * y;
4721
4722
   // we used to check for exact match between raw_len and img_len on non-interlaced PNGs,
4723
   // but issue #276 reported a PNG in the wild that had extra data at the end (all zeros),
4724
   // so just check for raw_len < img_len always.
4725
1.25k
   if (raw_len < img_len) return stbi__err("not enough pixels","Corrupt PNG");
4726
4727
   // Allocate two scan lines worth of filter workspace buffer.
4728
1.04k
   filter_buf = (stbi_uc *) stbi__malloc_mad2(img_width_bytes, 2, 0);
4729
1.04k
   if (!filter_buf) return stbi__err("outofmem", "Out of memory");
4730
4731
   // Filtering for low-bit-depth images
4732
1.04k
   if (depth < 8) {
4733
546
      filter_bytes = 1;
4734
546
      width = img_width_bytes;
4735
546
   }
4736
4737
951k
   for (j=0; j < y; ++j) {
4738
      // cur/prior filter buffers alternate
4739
950k
      stbi_uc *cur = filter_buf + (j & 1)*img_width_bytes;
4740
950k
      stbi_uc *prior = filter_buf + (~j & 1)*img_width_bytes;
4741
950k
      stbi_uc *dest = a->out + stride*j;
4742
950k
      int nk = width * filter_bytes;
4743
950k
      int filter = *raw++;
4744
4745
      // check filter type
4746
950k
      if (filter > 4) {
4747
50
         all_ok = stbi__err("invalid filter","Corrupt PNG");
4748
50
         break;
4749
50
      }
4750
4751
      // if first row, use special filter that doesn't sample previous row
4752
950k
      if (j == 0) filter = first_row_filter[filter];
4753
4754
      // perform actual filtering
4755
950k
      switch (filter) {
4756
342k
      case STBI__F_none:
4757
342k
         memcpy(cur, raw, nk);
4758
342k
         break;
4759
350k
      case STBI__F_sub:
4760
350k
         memcpy(cur, raw, filter_bytes);
4761
476M
         for (k = filter_bytes; k < nk; ++k)
4762
476M
            cur[k] = STBI__BYTECAST(raw[k] + cur[k-filter_bytes]);
4763
350k
         break;
4764
60.3k
      case STBI__F_up:
4765
148k
         for (k = 0; k < nk; ++k)
4766
87.8k
            cur[k] = STBI__BYTECAST(raw[k] + prior[k]);
4767
60.3k
         break;
4768
94.1k
      case STBI__F_avg:
4769
215k
         for (k = 0; k < filter_bytes; ++k)
4770
121k
            cur[k] = STBI__BYTECAST(raw[k] + (prior[k]>>1));
4771
159k
         for (k = filter_bytes; k < nk; ++k)
4772
65.7k
            cur[k] = STBI__BYTECAST(raw[k] + ((prior[k] + cur[k-filter_bytes])>>1));
4773
94.1k
         break;
4774
103k
      case STBI__F_paeth:
4775
256k
         for (k = 0; k < filter_bytes; ++k)
4776
153k
            cur[k] = STBI__BYTECAST(raw[k] + prior[k]); // prior[k] == stbi__paeth(0,prior[k],0)
4777
242k
         for (k = filter_bytes; k < nk; ++k)
4778
139k
            cur[k] = STBI__BYTECAST(raw[k] + stbi__paeth(cur[k-filter_bytes], prior[k], prior[k-filter_bytes]));
4779
103k
         break;
4780
243
      case STBI__F_avg_first:
4781
243
         memcpy(cur, raw, filter_bytes);
4782
282k
         for (k = filter_bytes; k < nk; ++k)
4783
282k
            cur[k] = STBI__BYTECAST(raw[k] + (cur[k-filter_bytes] >> 1));
4784
243
         break;
4785
950k
      }
4786
4787
950k
      raw += nk;
4788
4789
      // expand decoded bits in cur to dest, also adding an extra alpha channel if desired
4790
950k
      if (depth < 8) {
4791
781k
         stbi_uc scale = (color == 0) ? stbi__depth_scale_table[depth] : 1; // scale grayscale values to 0..255 range
4792
781k
         stbi_uc *in = cur;
4793
781k
         stbi_uc *out = dest;
4794
781k
         stbi_uc inb = 0;
4795
781k
         stbi__uint32 nsmp = x*img_n;
4796
4797
         // expand bits to bytes first
4798
781k
         if (depth == 4) {
4799
757k
            for (i=0; i < nsmp; ++i) {
4800
608k
               if ((i & 1) == 0) inb = *in++;
4801
608k
               *out++ = scale * (inb >> 4);
4802
608k
               inb <<= 4;
4803
608k
            }
4804
633k
         } else if (depth == 2) {
4805
16.7M
            for (i=0; i < nsmp; ++i) {
4806
16.7M
               if ((i & 3) == 0) inb = *in++;
4807
16.7M
               *out++ = scale * (inb >> 6);
4808
16.7M
               inb <<= 2;
4809
16.7M
            }
4810
629k
         } else {
4811
629k
            STBI_ASSERT(depth == 1);
4812
346M
            for (i=0; i < nsmp; ++i) {
4813
346M
               if ((i & 7) == 0) inb = *in++;
4814
346M
               *out++ = scale * (inb >> 7);
4815
346M
               inb <<= 1;
4816
346M
            }
4817
629k
         }
4818
4819
         // insert alpha=255 values if desired
4820
781k
         if (img_n != out_n)
4821
144k
            stbi__create_png_alpha_expand8(dest, dest, x, img_n);
4822
781k
      } else if (depth == 8) {
4823
23.1k
         if (img_n == out_n)
4824
21.5k
            memcpy(dest, cur, x*img_n);
4825
1.58k
         else
4826
1.58k
            stbi__create_png_alpha_expand8(dest, cur, x, img_n);
4827
145k
      } else if (depth == 16) {
4828
         // convert the image data from big-endian to platform-native
4829
145k
         stbi__uint16 *dest16 = (stbi__uint16*)dest;
4830
145k
         stbi__uint32 nsmp = x*img_n;
4831
4832
145k
         if (img_n == out_n) {
4833
245M
            for (i = 0; i < nsmp; ++i, ++dest16, cur += 2)
4834
245M
               *dest16 = (cur[0] << 8) | cur[1];
4835
74.7k
         } else {
4836
74.7k
            STBI_ASSERT(img_n+1 == out_n);
4837
74.7k
            if (img_n == 1) {
4838
163k
               for (i = 0; i < x; ++i, dest16 += 2, cur += 2) {
4839
90.1k
                  dest16[0] = (cur[0] << 8) | cur[1];
4840
90.1k
                  dest16[1] = 0xffff;
4841
90.1k
               }
4842
73.3k
            } else {
4843
1.44k
               STBI_ASSERT(img_n == 3);
4844
1.46M
               for (i = 0; i < x; ++i, dest16 += 4, cur += 6) {
4845
1.45M
                  dest16[0] = (cur[0] << 8) | cur[1];
4846
1.45M
                  dest16[1] = (cur[2] << 8) | cur[3];
4847
1.45M
                  dest16[2] = (cur[4] << 8) | cur[5];
4848
1.45M
                  dest16[3] = 0xffff;
4849
1.45M
               }
4850
1.44k
            }
4851
74.7k
         }
4852
145k
      }
4853
950k
   }
4854
4855
1.04k
   STBI_FREE(filter_buf);
4856
1.04k
   if (!all_ok) return 0;
4857
4858
993
   return 1;
4859
1.04k
}
4860
4861
static int stbi__create_png_image(stbi__png *a, stbi_uc *image_data, stbi__uint32 image_data_len, int out_n, int depth, int color, int interlaced)
4862
603
{
4863
603
   int bytes = (depth == 16 ? 2 : 1);
4864
603
   int out_bytes = out_n * bytes;
4865
603
   stbi_uc *final;
4866
603
   int p;
4867
603
   if (!interlaced)
4868
291
      return stbi__create_png_image_raw(a, image_data, image_data_len, out_n, a->s->img_x, a->s->img_y, depth, color);
4869
4870
   // de-interlacing
4871
312
   final = (stbi_uc *) stbi__malloc_mad3(a->s->img_x, a->s->img_y, out_bytes, 0);
4872
312
   if (!final) return stbi__err("outofmem", "Out of memory");
4873
1.35k
   for (p=0; p < 7; ++p) {
4874
1.25k
      int xorig[] = { 0,4,0,2,0,1,0 };
4875
1.25k
      int yorig[] = { 0,0,4,0,2,0,1 };
4876
1.25k
      int xspc[]  = { 8,8,4,4,2,2,1 };
4877
1.25k
      int yspc[]  = { 8,8,8,4,4,2,2 };
4878
1.25k
      int i,j,x,y;
4879
      // pass1_x[4] = 0, pass1_x[5] = 1, pass1_x[12] = 1
4880
1.25k
      x = (a->s->img_x - xorig[p] + xspc[p]-1) / xspc[p];
4881
1.25k
      y = (a->s->img_y - yorig[p] + yspc[p]-1) / yspc[p];
4882
1.25k
      if (x && y) {
4883
962
         stbi__uint32 img_len = ((((a->s->img_n * x * depth) + 7) >> 3) + 1) * y;
4884
962
         if (!stbi__create_png_image_raw(a, image_data, image_data_len, out_n, x, y, depth, color)) {
4885
205
            STBI_FREE(final);
4886
205
            return 0;
4887
205
         }
4888
514k
         for (j=0; j < y; ++j) {
4889
85.6M
            for (i=0; i < x; ++i) {
4890
85.1M
               int out_y = j*yspc[p]+yorig[p];
4891
85.1M
               int out_x = i*xspc[p]+xorig[p];
4892
85.1M
               memcpy(final + out_y*a->s->img_x*out_bytes + out_x*out_bytes,
4893
85.1M
                      a->out + (j*x+i)*out_bytes, out_bytes);
4894
85.1M
            }
4895
514k
         }
4896
757
         STBI_FREE(a->out);
4897
757
         image_data += img_len;
4898
757
         image_data_len -= img_len;
4899
757
      }
4900
1.25k
   }
4901
107
   a->out = final;
4902
4903
107
   return 1;
4904
312
}
4905
4906
static int stbi__compute_transparency(stbi__png *z, stbi_uc tc[3], int out_n)
4907
58
{
4908
58
   stbi__context *s = z->s;
4909
58
   stbi__uint32 i, pixel_count = s->img_x * s->img_y;
4910
58
   stbi_uc *p = z->out;
4911
4912
   // compute color-based transparency, assuming we've
4913
   // already got 255 as the alpha value in the output
4914
58
   STBI_ASSERT(out_n == 2 || out_n == 4);
4915
4916
58
   if (out_n == 2) {
4917
620k
      for (i=0; i < pixel_count; ++i) {
4918
620k
         p[1] = (p[0] == tc[0] ? 0 : 255);
4919
620k
         p += 2;
4920
620k
      }
4921
36
   } else {
4922
4.14M
      for (i=0; i < pixel_count; ++i) {
4923
4.14M
         if (p[0] == tc[0] && p[1] == tc[1] && p[2] == tc[2])
4924
179k
            p[3] = 0;
4925
4.14M
         p += 4;
4926
4.14M
      }
4927
36
   }
4928
58
   return 1;
4929
58
}
4930
4931
static int stbi__compute_transparency16(stbi__png *z, stbi__uint16 tc[3], int out_n)
4932
69
{
4933
69
   stbi__context *s = z->s;
4934
69
   stbi__uint32 i, pixel_count = s->img_x * s->img_y;
4935
69
   stbi__uint16 *p = (stbi__uint16*) z->out;
4936
4937
   // compute color-based transparency, assuming we've
4938
   // already got 65535 as the alpha value in the output
4939
69
   STBI_ASSERT(out_n == 2 || out_n == 4);
4940
4941
69
   if (out_n == 2) {
4942
87.1k
      for (i = 0; i < pixel_count; ++i) {
4943
87.1k
         p[1] = (p[0] == tc[0] ? 0 : 65535);
4944
87.1k
         p += 2;
4945
87.1k
      }
4946
46
   } else {
4947
1.45M
      for (i = 0; i < pixel_count; ++i) {
4948
1.45M
         if (p[0] == tc[0] && p[1] == tc[1] && p[2] == tc[2])
4949
99.3k
            p[3] = 0;
4950
1.45M
         p += 4;
4951
1.45M
      }
4952
46
   }
4953
69
   return 1;
4954
69
}
4955
4956
static int stbi__expand_png_palette(stbi__png *a, stbi_uc *palette, int len, int pal_img_n)
4957
35
{
4958
35
   stbi__uint32 i, pixel_count = a->s->img_x * a->s->img_y;
4959
35
   stbi_uc *p, *temp_out, *orig = a->out;
4960
4961
35
   p = (stbi_uc *) stbi__malloc_mad2(pixel_count, pal_img_n, 0);
4962
35
   if (p == NULL) return stbi__err("outofmem", "Out of memory");
4963
4964
   // between here and free(out) below, exitting would leak
4965
35
   temp_out = p;
4966
4967
35
   if (pal_img_n == 3) {
4968
0
      for (i=0; i < pixel_count; ++i) {
4969
0
         int n = orig[i]*4;
4970
0
         p[0] = palette[n  ];
4971
0
         p[1] = palette[n+1];
4972
0
         p[2] = palette[n+2];
4973
0
         p += 3;
4974
0
      }
4975
35
   } else {
4976
11.5M
      for (i=0; i < pixel_count; ++i) {
4977
11.5M
         int n = orig[i]*4;
4978
11.5M
         p[0] = palette[n  ];
4979
11.5M
         p[1] = palette[n+1];
4980
11.5M
         p[2] = palette[n+2];
4981
11.5M
         p[3] = palette[n+3];
4982
11.5M
         p += 4;
4983
11.5M
      }
4984
35
   }
4985
35
   STBI_FREE(a->out);
4986
35
   a->out = temp_out;
4987
4988
35
   STBI_NOTUSED(len);
4989
4990
35
   return 1;
4991
35
}
4992
4993
static int stbi__unpremultiply_on_load_global = 0;
4994
static int stbi__de_iphone_flag_global = 0;
4995
4996
STBIDEF void stbi_set_unpremultiply_on_load(int flag_true_if_should_unpremultiply)
4997
0
{
4998
0
   stbi__unpremultiply_on_load_global = flag_true_if_should_unpremultiply;
4999
0
}
5000
5001
STBIDEF void stbi_convert_iphone_png_to_rgb(int flag_true_if_should_convert)
5002
0
{
5003
0
   stbi__de_iphone_flag_global = flag_true_if_should_convert;
5004
0
}
5005
5006
#ifndef STBI_THREAD_LOCAL
5007
#define stbi__unpremultiply_on_load  stbi__unpremultiply_on_load_global
5008
#define stbi__de_iphone_flag  stbi__de_iphone_flag_global
5009
#else
5010
static STBI_THREAD_LOCAL int stbi__unpremultiply_on_load_local, stbi__unpremultiply_on_load_set;
5011
static STBI_THREAD_LOCAL int stbi__de_iphone_flag_local, stbi__de_iphone_flag_set;
5012
5013
STBIDEF void stbi_set_unpremultiply_on_load_thread(int flag_true_if_should_unpremultiply)
5014
0
{
5015
0
   stbi__unpremultiply_on_load_local = flag_true_if_should_unpremultiply;
5016
0
   stbi__unpremultiply_on_load_set = 1;
5017
0
}
5018
5019
STBIDEF void stbi_convert_iphone_png_to_rgb_thread(int flag_true_if_should_convert)
5020
0
{
5021
0
   stbi__de_iphone_flag_local = flag_true_if_should_convert;
5022
0
   stbi__de_iphone_flag_set = 1;
5023
0
}
5024
5025
0
#define stbi__unpremultiply_on_load  (stbi__unpremultiply_on_load_set           \
5026
0
                                       ? stbi__unpremultiply_on_load_local      \
5027
0
                                       : stbi__unpremultiply_on_load_global)
5028
344
#define stbi__de_iphone_flag  (stbi__de_iphone_flag_set                         \
5029
344
                                ? stbi__de_iphone_flag_local                    \
5030
344
                                : stbi__de_iphone_flag_global)
5031
#endif // STBI_THREAD_LOCAL
5032
5033
static void stbi__de_iphone(stbi__png *z)
5034
0
{
5035
0
   stbi__context *s = z->s;
5036
0
   stbi__uint32 i, pixel_count = s->img_x * s->img_y;
5037
0
   stbi_uc *p = z->out;
5038
5039
0
   if (s->img_out_n == 3) {  // convert bgr to rgb
5040
0
      for (i=0; i < pixel_count; ++i) {
5041
0
         stbi_uc t = p[0];
5042
0
         p[0] = p[2];
5043
0
         p[2] = t;
5044
0
         p += 3;
5045
0
      }
5046
0
   } else {
5047
0
      STBI_ASSERT(s->img_out_n == 4);
5048
0
      if (stbi__unpremultiply_on_load) {
5049
         // convert bgr to rgb and unpremultiply
5050
0
         for (i=0; i < pixel_count; ++i) {
5051
0
            stbi_uc a = p[3];
5052
0
            stbi_uc t = p[0];
5053
0
            if (a) {
5054
0
               stbi_uc half = a / 2;
5055
0
               p[0] = (p[2] * 255 + half) / a;
5056
0
               p[1] = (p[1] * 255 + half) / a;
5057
0
               p[2] = ( t   * 255 + half) / a;
5058
0
            } else {
5059
0
               p[0] = p[2];
5060
0
               p[2] = t;
5061
0
            }
5062
0
            p += 4;
5063
0
         }
5064
0
      } else {
5065
         // convert bgr to rgb
5066
0
         for (i=0; i < pixel_count; ++i) {
5067
0
            stbi_uc t = p[0];
5068
0
            p[0] = p[2];
5069
0
            p[2] = t;
5070
0
            p += 4;
5071
0
         }
5072
0
      }
5073
0
   }
5074
0
}
5075
5076
14.7k
#define STBI__PNG_TYPE(a,b,c,d)  (((unsigned) (a) << 24) + ((unsigned) (b) << 16) + ((unsigned) (c) << 8) + (unsigned) (d))
5077
5078
static int stbi__parse_png_file(stbi__png *z, int scan, int req_comp)
5079
8.68k
{
5080
8.68k
   stbi_uc palette[1024], pal_img_n=0;
5081
8.68k
   stbi_uc has_trans=0, tc[3]={0};
5082
8.68k
   stbi__uint16 tc16[3];
5083
8.68k
   stbi__uint32 ioff=0, idata_limit=0, i, pal_len=0;
5084
8.68k
   int first=1,k,interlace=0, color=0, is_iphone=0;
5085
8.68k
   stbi__context *s = z->s;
5086
5087
8.68k
   z->expanded = NULL;
5088
8.68k
   z->idata = NULL;
5089
8.68k
   z->out = NULL;
5090
5091
8.68k
   if (!stbi__check_png_header(s)) return 0;
5092
5093
3.74k
   if (scan == STBI__SCAN_type) return 1;
5094
5095
16.7k
   for (;;) {
5096
16.7k
      stbi__pngchunk c = stbi__get_chunk_header(s);
5097
16.7k
      switch (c.type) {
5098
1.03k
         case STBI__PNG_TYPE('C','g','B','I'):
5099
1.03k
            is_iphone = 1;
5100
1.03k
            stbi__skip(s, c.length);
5101
1.03k
            break;
5102
3.31k
         case STBI__PNG_TYPE('I','H','D','R'): {
5103
3.31k
            int comp,filter;
5104
3.31k
            if (!first) return stbi__err("multiple IHDR","Corrupt PNG");
5105
3.31k
            first = 0;
5106
3.31k
            if (c.length != 13) return stbi__err("bad IHDR len","Corrupt PNG");
5107
3.28k
            s->img_x = stbi__get32be(s);
5108
3.28k
            s->img_y = stbi__get32be(s);
5109
3.28k
            if (s->img_y > STBI_MAX_DIMENSIONS) return stbi__err("too large","Very large image (corrupt?)");
5110
3.27k
            if (s->img_x > STBI_MAX_DIMENSIONS) return stbi__err("too large","Very large image (corrupt?)");
5111
3.24k
            z->depth = stbi__get8(s);  if (z->depth != 1 && z->depth != 2 && z->depth != 4 && z->depth != 8 && z->depth != 16)  return stbi__err("1/2/4/8/16-bit only","PNG not supported: 1/2/4/8/16-bit only");
5112
3.19k
            color = stbi__get8(s);  if (color > 6)         return stbi__err("bad ctype","Corrupt PNG");
5113
3.18k
            if (color == 3 && z->depth == 16)                  return stbi__err("bad ctype","Corrupt PNG");
5114
3.18k
            if (color == 3) pal_img_n = 3; else if (color & 1) return stbi__err("bad ctype","Corrupt PNG");
5115
3.18k
            comp  = stbi__get8(s);  if (comp) return stbi__err("bad comp method","Corrupt PNG");
5116
3.18k
            filter= stbi__get8(s);  if (filter) return stbi__err("bad filter method","Corrupt PNG");
5117
3.18k
            interlace = stbi__get8(s); if (interlace>1) return stbi__err("bad interlace method","Corrupt PNG");
5118
3.18k
            if (!s->img_x || !s->img_y) return stbi__err("0-pixel image","Corrupt PNG");
5119
3.17k
            if (!pal_img_n) {
5120
2.71k
               s->img_n = (color & 2 ? 3 : 1) + (color & 4 ? 1 : 0);
5121
2.71k
               if ((1 << 30) / s->img_x / s->img_n < s->img_y) return stbi__err("too large", "Image too large to decode");
5122
2.71k
            } else {
5123
               // if paletted, then pal_n is our final components, and
5124
               // img_n is # components to decompress/filter.
5125
468
               s->img_n = 1;
5126
468
               if ((1 << 30) / s->img_x / 4 < s->img_y) return stbi__err("too large","Corrupt PNG");
5127
468
            }
5128
            // even with SCAN_header, have to scan to see if we have a tRNS
5129
3.15k
            break;
5130
3.17k
         }
5131
5132
3.15k
         case STBI__PNG_TYPE('P','L','T','E'):  {
5133
1.35k
            if (first) return stbi__err("first not IHDR", "Corrupt PNG");
5134
1.35k
            if (c.length > 256*3) return stbi__err("invalid PLTE","Corrupt PNG");
5135
1.32k
            pal_len = c.length / 3;
5136
1.32k
            if (pal_len * 3 != c.length) return stbi__err("invalid PLTE","Corrupt PNG");
5137
18.7k
            for (i=0; i < pal_len; ++i) {
5138
17.3k
               palette[i*4+0] = stbi__get8(s);
5139
17.3k
               palette[i*4+1] = stbi__get8(s);
5140
17.3k
               palette[i*4+2] = stbi__get8(s);
5141
17.3k
               palette[i*4+3] = 255;
5142
17.3k
            }
5143
1.31k
            break;
5144
1.32k
         }
5145
5146
2.39k
         case STBI__PNG_TYPE('t','R','N','S'): {
5147
2.39k
            if (first) return stbi__err("first not IHDR", "Corrupt PNG");
5148
2.39k
            if (z->idata) return stbi__err("tRNS after IDAT","Corrupt PNG");
5149
2.39k
            if (pal_img_n) {
5150
1.01k
               if (scan == STBI__SCAN_header) { s->img_n = 4; return 1; }
5151
901
               if (pal_len == 0) return stbi__err("tRNS before PLTE","Corrupt PNG");
5152
886
               if (c.length > pal_len) return stbi__err("bad tRNS len","Corrupt PNG");
5153
838
               pal_img_n = 4;
5154
3.79k
               for (i=0; i < c.length; ++i)
5155
2.95k
                  palette[i*4+3] = stbi__get8(s);
5156
1.37k
            } else {
5157
1.37k
               if (!(s->img_n & 1)) return stbi__err("tRNS with alpha","Corrupt PNG");
5158
1.37k
               if (c.length != (stbi__uint32) s->img_n*2) return stbi__err("bad tRNS len","Corrupt PNG");
5159
1.33k
               has_trans = 1;
5160
               // non-paletted with tRNS = constant alpha. if header-scanning, we can stop now.
5161
1.33k
               if (scan == STBI__SCAN_header) { ++s->img_n; return 1; }
5162
1.13k
               if (z->depth == 16) {
5163
1.82k
                  for (k = 0; k < s->img_n && k < 3; ++k) // extra loop test to suppress false GCC warning
5164
1.26k
                     tc16[k] = (stbi__uint16)stbi__get16be(s); // copy the values as-is
5165
570
               } else {
5166
1.72k
                  for (k = 0; k < s->img_n && k < 3; ++k)
5167
1.15k
                     tc[k] = (stbi_uc)(stbi__get16be(s) & 255) * stbi__depth_scale_table[z->depth]; // non 8-bit images will be larger
5168
570
               }
5169
1.13k
            }
5170
1.97k
            break;
5171
2.39k
         }
5172
5173
5.48k
         case STBI__PNG_TYPE('I','D','A','T'): {
5174
5.48k
            if (first) return stbi__err("first not IHDR", "Corrupt PNG");
5175
5.48k
            if (pal_img_n && !pal_len) return stbi__err("no PLTE","Corrupt PNG");
5176
5.48k
            if (scan == STBI__SCAN_header) {
5177
               // header scan definitely stops at first IDAT
5178
1.13k
               if (pal_img_n)
5179
72
                  s->img_n = pal_img_n;
5180
1.13k
               return 1;
5181
1.13k
            }
5182
4.35k
            if (c.length > (1u << 30)) return stbi__err("IDAT size limit", "IDAT section larger than 2^30 bytes");
5183
4.34k
            if ((int)(ioff + c.length) < (int)ioff) return 0;
5184
4.34k
            if (ioff + c.length > idata_limit) {
5185
1.32k
               stbi__uint32 idata_limit_old = idata_limit;
5186
1.32k
               stbi_uc *p;
5187
1.32k
               if (idata_limit == 0) idata_limit = c.length > 4096 ? c.length : 4096;
5188
1.51k
               while (ioff + c.length > idata_limit)
5189
192
                  idata_limit *= 2;
5190
1.32k
               STBI_NOTUSED(idata_limit_old);
5191
1.32k
               p = (stbi_uc *) STBI_REALLOC_SIZED(z->idata, idata_limit_old, idata_limit); if (p == NULL) return stbi__err("outofmem", "Out of memory");
5192
1.32k
               z->idata = p;
5193
1.32k
            }
5194
4.34k
            if (!stbi__getn(s, z->idata+ioff,c.length)) return stbi__err("outofdata","Corrupt PNG");
5195
4.25k
            ioff += c.length;
5196
4.25k
            break;
5197
4.34k
         }
5198
5199
1.18k
         case STBI__PNG_TYPE('I','E','N','D'): {
5200
1.18k
            stbi__uint32 raw_len, bpl;
5201
1.18k
            if (first) return stbi__err("first not IHDR", "Corrupt PNG");
5202
1.18k
            if (scan != STBI__SCAN_load) return 1;
5203
1.16k
            if (z->idata == NULL) return stbi__err("no IDAT","Corrupt PNG");
5204
            // initial guess for decoded data size to avoid unnecessary reallocs
5205
1.14k
            bpl = (s->img_x * z->depth + 7) / 8; // bytes per line, per component
5206
1.14k
            raw_len = bpl * s->img_y * s->img_n /* pixels */ + s->img_y /* filter mode per row */;
5207
1.14k
            z->expanded = (stbi_uc *) stbi_zlib_decode_malloc_guesssize_headerflag((char *) z->idata, ioff, raw_len, (int *) &raw_len, !is_iphone);
5208
1.14k
            if (z->expanded == NULL) return 0; // zlib should set error
5209
603
            STBI_FREE(z->idata); z->idata = NULL;
5210
603
            if ((req_comp == s->img_n+1 && req_comp != 3 && !pal_img_n) || has_trans)
5211
230
               s->img_out_n = s->img_n+1;
5212
373
            else
5213
373
               s->img_out_n = s->img_n;
5214
603
            if (!stbi__create_png_image(z, z->expanded, raw_len, s->img_out_n, z->depth, color, interlace)) return 0;
5215
343
            if (has_trans) {
5216
127
               if (z->depth == 16) {
5217
69
                  if (!stbi__compute_transparency16(z, tc16, s->img_out_n)) return 0;
5218
69
               } else {
5219
58
                  if (!stbi__compute_transparency(z, tc, s->img_out_n)) return 0;
5220
58
               }
5221
127
            }
5222
343
            if (is_iphone && stbi__de_iphone_flag && s->img_out_n > 2)
5223
0
               stbi__de_iphone(z);
5224
343
            if (pal_img_n) {
5225
               // pal_img_n == 3 or 4
5226
35
               s->img_n = pal_img_n; // record the actual colors we had
5227
35
               s->img_out_n = pal_img_n;
5228
35
               if (req_comp >= 3) s->img_out_n = req_comp;
5229
35
               if (!stbi__expand_png_palette(z, palette, pal_len, s->img_out_n))
5230
0
                  return 0;
5231
308
            } else if (has_trans) {
5232
               // non-paletted image with tRNS -> source image has (constant) alpha
5233
127
               ++s->img_n;
5234
127
            }
5235
343
            STBI_FREE(z->expanded); z->expanded = NULL;
5236
            // end of PNG chunk, read and skip CRC
5237
343
            stbi__get32be(s);
5238
343
            return 1;
5239
343
         }
5240
5241
1.95k
         default:
5242
            // if critical, fail
5243
1.95k
            if (first) return stbi__err("first not IHDR", "Corrupt PNG");
5244
1.53k
            if ((c.type & (1 << 29)) == 0) {
5245
279
               #ifndef STBI_NO_FAILURE_STRINGS
5246
               // not threadsafe
5247
279
               static char invalid_chunk[] = "XXXX PNG chunk not known";
5248
279
               invalid_chunk[0] = STBI__BYTECAST(c.type >> 24);
5249
279
               invalid_chunk[1] = STBI__BYTECAST(c.type >> 16);
5250
279
               invalid_chunk[2] = STBI__BYTECAST(c.type >>  8);
5251
279
               invalid_chunk[3] = STBI__BYTECAST(c.type >>  0);
5252
279
               #endif
5253
279
               return stbi__err(invalid_chunk, "PNG not supported: unknown PNG chunk type");
5254
279
            }
5255
1.25k
            stbi__skip(s, c.length);
5256
1.25k
            break;
5257
16.7k
      }
5258
      // end of PNG chunk, read and skip CRC
5259
12.9k
      stbi__get32be(s);
5260
12.9k
   }
5261
3.74k
}
5262
5263
static void *stbi__do_png(stbi__png *p, int *x, int *y, int *n, int req_comp, stbi__result_info *ri)
5264
1.45k
{
5265
1.45k
   void *result=NULL;
5266
1.45k
   if (req_comp < 0 || req_comp > 4) return stbi__errpuc("bad req_comp", "Internal error");
5267
1.45k
   if (stbi__parse_png_file(p, STBI__SCAN_load, req_comp)) {
5268
343
      if (p->depth <= 8)
5269
199
         ri->bits_per_channel = 8;
5270
144
      else if (p->depth == 16)
5271
144
         ri->bits_per_channel = 16;
5272
0
      else
5273
0
         return stbi__errpuc("bad bits_per_channel", "PNG not supported: unsupported color depth");
5274
343
      result = p->out;
5275
343
      p->out = NULL;
5276
343
      if (req_comp && req_comp != p->s->img_out_n) {
5277
200
         if (ri->bits_per_channel == 8)
5278
106
            result = stbi__convert_format((unsigned char *) result, p->s->img_out_n, req_comp, p->s->img_x, p->s->img_y);
5279
94
         else
5280
94
            result = stbi__convert_format16((stbi__uint16 *) result, p->s->img_out_n, req_comp, p->s->img_x, p->s->img_y);
5281
200
         p->s->img_out_n = req_comp;
5282
200
         if (result == NULL) return result;
5283
200
      }
5284
343
      *x = p->s->img_x;
5285
343
      *y = p->s->img_y;
5286
343
      if (n) *n = p->s->img_n;
5287
343
   }
5288
1.45k
   STBI_FREE(p->out);      p->out      = NULL;
5289
1.45k
   STBI_FREE(p->expanded); p->expanded = NULL;
5290
1.45k
   STBI_FREE(p->idata);    p->idata    = NULL;
5291
5292
1.45k
   return result;
5293
1.45k
}
5294
5295
static void *stbi__png_load(stbi__context *s, int *x, int *y, int *comp, int req_comp, stbi__result_info *ri)
5296
1.45k
{
5297
1.45k
   stbi__png p;
5298
1.45k
   p.s = s;
5299
1.45k
   return stbi__do_png(&p, x,y,comp,req_comp, ri);
5300
1.45k
}
5301
5302
static int stbi__png_test(stbi__context *s)
5303
6.48k
{
5304
6.48k
   int r;
5305
6.48k
   r = stbi__check_png_header(s);
5306
6.48k
   stbi__rewind(s);
5307
6.48k
   return r;
5308
6.48k
}
5309
5310
static int stbi__png_info_raw(stbi__png *p, int *x, int *y, int *comp)
5311
7.22k
{
5312
7.22k
   if (!stbi__parse_png_file(p, STBI__SCAN_header, 0)) {
5313
5.76k
      stbi__rewind( p->s );
5314
5.76k
      return 0;
5315
5.76k
   }
5316
1.45k
   if (x) *x = p->s->img_x;
5317
1.45k
   if (y) *y = p->s->img_y;
5318
1.45k
   if (comp) *comp = p->s->img_n;
5319
1.45k
   return 1;
5320
7.22k
}
5321
5322
static int stbi__png_info(stbi__context *s, int *x, int *y, int *comp)
5323
7.22k
{
5324
7.22k
   stbi__png p;
5325
7.22k
   p.s = s;
5326
7.22k
   return stbi__png_info_raw(&p, x, y, comp);
5327
7.22k
}
5328
5329
static int stbi__png_is16(stbi__context *s)
5330
0
{
5331
0
   stbi__png p;
5332
0
   p.s = s;
5333
0
   if (!stbi__png_info_raw(&p, NULL, NULL, NULL))
5334
0
     return 0;
5335
0
   if (p.depth != 16) {
5336
0
      stbi__rewind(p.s);
5337
0
      return 0;
5338
0
   }
5339
0
   return 1;
5340
0
}
5341
#endif
5342
5343
// Microsoft/Windows BMP image
5344
5345
#ifndef STBI_NO_BMP
5346
static int stbi__bmp_test_raw(stbi__context *s)
5347
5.02k
{
5348
5.02k
   int r;
5349
5.02k
   int sz;
5350
5.02k
   if (stbi__get8(s) != 'B') return 0;
5351
1.12k
   if (stbi__get8(s) != 'M') return 0;
5352
1.12k
   stbi__get32le(s); // discard filesize
5353
1.12k
   stbi__get16le(s); // discard reserved
5354
1.12k
   stbi__get16le(s); // discard reserved
5355
1.12k
   stbi__get32le(s); // discard data offset
5356
1.12k
   sz = stbi__get32le(s);
5357
1.12k
   r = (sz == 12 || sz == 40 || sz == 56 || sz == 108 || sz == 124);
5358
1.12k
   return r;
5359
1.12k
}
5360
5361
static int stbi__bmp_test(stbi__context *s)
5362
5.02k
{
5363
5.02k
   int r = stbi__bmp_test_raw(s);
5364
5.02k
   stbi__rewind(s);
5365
5.02k
   return r;
5366
5.02k
}
5367
5368
5369
// returns 0..31 for the highest set bit
5370
static int stbi__high_bit(unsigned int z)
5371
1.23k
{
5372
1.23k
   int n=0;
5373
1.23k
   if (z == 0) return -1;
5374
997
   if (z >= 0x10000) { n += 16; z >>= 16; }
5375
997
   if (z >= 0x00100) { n +=  8; z >>=  8; }
5376
997
   if (z >= 0x00010) { n +=  4; z >>=  4; }
5377
997
   if (z >= 0x00004) { n +=  2; z >>=  2; }
5378
997
   if (z >= 0x00002) { n +=  1;/* >>=  1;*/ }
5379
997
   return n;
5380
1.23k
}
5381
5382
static int stbi__bitcount(unsigned int a)
5383
1.23k
{
5384
1.23k
   a = (a & 0x55555555) + ((a >>  1) & 0x55555555); // max 2
5385
1.23k
   a = (a & 0x33333333) + ((a >>  2) & 0x33333333); // max 4
5386
1.23k
   a = (a + (a >> 4)) & 0x0f0f0f0f; // max 8 per 4, now 8 bits
5387
1.23k
   a = (a + (a >> 8)); // max 16 per 8 bits
5388
1.23k
   a = (a + (a >> 16)); // max 32 per 8 bits
5389
1.23k
   return a & 0xff;
5390
1.23k
}
5391
5392
// extract an arbitrarily-aligned N-bit value (N=bits)
5393
// from v, and then make it 8-bits long and fractionally
5394
// extend it to full full range.
5395
static int stbi__shiftsigned(unsigned int v, int shift, int bits)
5396
2.11G
{
5397
2.11G
   static unsigned int mul_table[9] = {
5398
2.11G
      0,
5399
2.11G
      0xff/*0b11111111*/, 0x55/*0b01010101*/, 0x49/*0b01001001*/, 0x11/*0b00010001*/,
5400
2.11G
      0x21/*0b00100001*/, 0x41/*0b01000001*/, 0x81/*0b10000001*/, 0x01/*0b00000001*/,
5401
2.11G
   };
5402
2.11G
   static unsigned int shift_table[9] = {
5403
2.11G
      0, 0,0,1,0,2,4,6,0,
5404
2.11G
   };
5405
2.11G
   if (shift < 0)
5406
527M
      v <<= -shift;
5407
1.59G
   else
5408
1.59G
      v >>= shift;
5409
2.11G
   STBI_ASSERT(v < 256);
5410
2.11G
   v >>= (8-bits);
5411
2.11G
   STBI_ASSERT(bits >= 0 && bits <= 8);
5412
2.11G
   return (int) ((unsigned) v * mul_table[bits]) >> shift_table[bits];
5413
2.11G
}
5414
5415
typedef struct
5416
{
5417
   int bpp, offset, hsz;
5418
   unsigned int mr,mg,mb,ma, all_a;
5419
   int extra_read;
5420
} stbi__bmp_data;
5421
5422
static int stbi__bmp_set_mask_defaults(stbi__bmp_data *info, int compress)
5423
1.20k
{
5424
   // BI_BITFIELDS specifies masks explicitly, don't override
5425
1.20k
   if (compress == 3)
5426
0
      return 1;
5427
5428
1.20k
   if (compress == 0) {
5429
347
      if (info->bpp == 16) {
5430
96
         info->mr = 31u << 10;
5431
96
         info->mg = 31u <<  5;
5432
96
         info->mb = 31u <<  0;
5433
251
      } else if (info->bpp == 32) {
5434
62
         info->mr = 0xffu << 16;
5435
62
         info->mg = 0xffu <<  8;
5436
62
         info->mb = 0xffu <<  0;
5437
62
         info->ma = 0xffu << 24;
5438
62
         info->all_a = 0; // if all_a is 0 at end, then we loaded alpha channel but it was all 0
5439
189
      } else {
5440
         // otherwise, use defaults, which is all-0
5441
189
         info->mr = info->mg = info->mb = info->ma = 0;
5442
189
      }
5443
347
      return 1;
5444
347
   }
5445
854
   return 0; // error
5446
1.20k
}
5447
5448
static void *stbi__bmp_parse_header(stbi__context *s, stbi__bmp_data *info)
5449
6.43k
{
5450
6.43k
   int hsz;
5451
6.43k
   if (stbi__get8(s) != 'B' || stbi__get8(s) != 'M') return stbi__errpuc("not BMP", "Corrupt BMP");
5452
2.54k
   stbi__get32le(s); // discard filesize
5453
2.54k
   stbi__get16le(s); // discard reserved
5454
2.54k
   stbi__get16le(s); // discard reserved
5455
2.54k
   info->offset = stbi__get32le(s);
5456
2.54k
   info->hsz = hsz = stbi__get32le(s);
5457
2.54k
   info->mr = info->mg = info->mb = info->ma = 0;
5458
2.54k
   info->extra_read = 14;
5459
5460
2.54k
   if (info->offset < 0) return stbi__errpuc("bad BMP", "bad BMP");
5461
5462
2.51k
   if (hsz != 12 && hsz != 40 && hsz != 56 && hsz != 108 && hsz != 124) return stbi__errpuc("unknown BMP", "BMP type not supported: unknown");
5463
2.39k
   if (hsz == 12) {
5464
805
      s->img_x = stbi__get16le(s);
5465
805
      s->img_y = stbi__get16le(s);
5466
1.59k
   } else {
5467
1.59k
      s->img_x = stbi__get32le(s);
5468
1.59k
      s->img_y = stbi__get32le(s);
5469
1.59k
   }
5470
2.39k
   if (stbi__get16le(s) != 1) return stbi__errpuc("bad BMP", "bad BMP");
5471
2.38k
   info->bpp = stbi__get16le(s);
5472
2.38k
   if (hsz != 12) {
5473
1.58k
      int compress = stbi__get32le(s);
5474
1.58k
      if (compress == 1 || compress == 2) return stbi__errpuc("BMP RLE", "BMP type not supported: RLE");
5475
1.58k
      if (compress >= 4) return stbi__errpuc("BMP JPEG/PNG", "BMP type not supported: unsupported compression"); // this includes PNG/JPEG modes
5476
1.55k
      if (compress == 3 && info->bpp != 16 && info->bpp != 32) return stbi__errpuc("bad BMP", "bad BMP"); // bitfields requires 16 or 32 bits/pixel
5477
1.54k
      stbi__get32le(s); // discard sizeof
5478
1.54k
      stbi__get32le(s); // discard hres
5479
1.54k
      stbi__get32le(s); // discard vres
5480
1.54k
      stbi__get32le(s); // discard colorsused
5481
1.54k
      stbi__get32le(s); // discard max important
5482
1.54k
      if (hsz == 40 || hsz == 56) {
5483
464
         if (hsz == 56) {
5484
19
            stbi__get32le(s);
5485
19
            stbi__get32le(s);
5486
19
            stbi__get32le(s);
5487
19
            stbi__get32le(s);
5488
19
         }
5489
464
         if (info->bpp == 16 || info->bpp == 32) {
5490
369
            if (compress == 0) {
5491
122
               stbi__bmp_set_mask_defaults(info, compress);
5492
247
            } else if (compress == 3) {
5493
210
               info->mr = stbi__get32le(s);
5494
210
               info->mg = stbi__get32le(s);
5495
210
               info->mb = stbi__get32le(s);
5496
210
               info->extra_read += 12;
5497
               // not documented, but generated by photoshop and handled by mspaint
5498
210
               if (info->mr == info->mg && info->mg == info->mb) {
5499
                  // ?!?!?
5500
1
                  return stbi__errpuc("bad BMP", "bad BMP");
5501
1
               }
5502
210
            } else
5503
37
               return stbi__errpuc("bad BMP", "bad BMP");
5504
369
         }
5505
1.08k
      } else {
5506
         // V4/V5 header
5507
1.08k
         int i;
5508
1.08k
         if (hsz != 108 && hsz != 124)
5509
0
            return stbi__errpuc("bad BMP", "bad BMP");
5510
1.08k
         info->mr = stbi__get32le(s);
5511
1.08k
         info->mg = stbi__get32le(s);
5512
1.08k
         info->mb = stbi__get32le(s);
5513
1.08k
         info->ma = stbi__get32le(s);
5514
1.08k
         if (compress != 3) // override mr/mg/mb unless in BI_BITFIELDS mode, as per docs
5515
1.07k
            stbi__bmp_set_mask_defaults(info, compress);
5516
1.08k
         stbi__get32le(s); // discard color space
5517
14.0k
         for (i=0; i < 12; ++i)
5518
13.0k
            stbi__get32le(s); // discard color space parameters
5519
1.08k
         if (hsz == 124) {
5520
890
            stbi__get32le(s); // discard rendering intent
5521
890
            stbi__get32le(s); // discard offset of profile data
5522
890
            stbi__get32le(s); // discard size of profile data
5523
890
            stbi__get32le(s); // discard reserved
5524
890
         }
5525
1.08k
      }
5526
1.54k
   }
5527
2.30k
   return (void *) 1;
5528
2.38k
}
5529
5530
5531
static void *stbi__bmp_load(stbi__context *s, int *x, int *y, int *comp, int req_comp, stbi__result_info *ri)
5532
1.12k
{
5533
1.12k
   stbi_uc *out;
5534
1.12k
   unsigned int mr=0,mg=0,mb=0,ma=0, all_a;
5535
1.12k
   stbi_uc pal[256][4];
5536
1.12k
   int psize=0,i,j,width;
5537
1.12k
   int flip_vertically, pad, target;
5538
1.12k
   stbi__bmp_data info;
5539
1.12k
   STBI_NOTUSED(ri);
5540
5541
1.12k
   info.all_a = 255;
5542
1.12k
   if (stbi__bmp_parse_header(s, &info) == NULL)
5543
0
      return NULL; // error code already set
5544
5545
1.12k
   flip_vertically = ((int) s->img_y) > 0;
5546
1.12k
   s->img_y = abs((int) s->img_y);
5547
5548
1.12k
   if (s->img_y > STBI_MAX_DIMENSIONS) return stbi__errpuc("too large","Very large image (corrupt?)");
5549
1.10k
   if (s->img_x > STBI_MAX_DIMENSIONS) return stbi__errpuc("too large","Very large image (corrupt?)");
5550
5551
1.05k
   mr = info.mr;
5552
1.05k
   mg = info.mg;
5553
1.05k
   mb = info.mb;
5554
1.05k
   ma = info.ma;
5555
1.05k
   all_a = info.all_a;
5556
5557
1.05k
   if (info.hsz == 12) {
5558
397
      if (info.bpp < 24)
5559
267
         psize = (info.offset - info.extra_read - 24) / 3;
5560
658
   } else {
5561
658
      if (info.bpp < 16)
5562
77
         psize = (info.offset - info.extra_read - info.hsz) >> 2;
5563
658
   }
5564
1.05k
   if (psize == 0) {
5565
      // accept some number of extra bytes after the header, but if the offset points either to before
5566
      // the header ends or implies a large amount of extra data, reject the file as malformed
5567
716
      int bytes_read_so_far = s->callback_already_read + (int)(s->img_buffer - s->img_buffer_original);
5568
716
      int header_limit = 1024; // max we actually read is below 256 bytes currently.
5569
716
      int extra_data_limit = 256*4; // what ordinarily goes here is a palette; 256 entries*4 bytes is its max size.
5570
716
      if (bytes_read_so_far <= 0 || bytes_read_so_far > header_limit) {
5571
0
         return stbi__errpuc("bad header", "Corrupt BMP");
5572
0
      }
5573
      // we established that bytes_read_so_far is positive and sensible.
5574
      // the first half of this test rejects offsets that are either too small positives, or
5575
      // negative, and guarantees that info.offset >= bytes_read_so_far > 0. this in turn
5576
      // ensures the number computed in the second half of the test can't overflow.
5577
716
      if (info.offset < bytes_read_so_far || info.offset - bytes_read_so_far > extra_data_limit) {
5578
71
         return stbi__errpuc("bad offset", "Corrupt BMP");
5579
645
      } else {
5580
645
         stbi__skip(s, info.offset - bytes_read_so_far);
5581
645
      }
5582
716
   }
5583
5584
984
   if (info.bpp == 24 && ma == 0xff000000)
5585
1
      s->img_n = 3;
5586
983
   else
5587
983
      s->img_n = ma ? 4 : 3;
5588
984
   if (req_comp && req_comp >= 3) // we can directly decode 3 or 4
5589
984
      target = req_comp;
5590
0
   else
5591
0
      target = s->img_n; // if they want monochrome, we'll post-convert
5592
5593
   // sanity-check size
5594
984
   if (!stbi__mad3sizes_valid(target, s->img_x, s->img_y, 0))
5595
0
      return stbi__errpuc("too large", "Corrupt BMP");
5596
5597
984
   out = (stbi_uc *) stbi__malloc_mad3(target, s->img_x, s->img_y, 0);
5598
984
   if (!out) return stbi__errpuc("outofmem", "Out of memory");
5599
984
   if (info.bpp < 16) {
5600
298
      int z=0;
5601
298
      if (psize == 0 || psize > 256) { STBI_FREE(out); return stbi__errpuc("invalid", "Corrupt BMP"); }
5602
9.52k
      for (i=0; i < psize; ++i) {
5603
9.29k
         pal[i][2] = stbi__get8(s);
5604
9.29k
         pal[i][1] = stbi__get8(s);
5605
9.29k
         pal[i][0] = stbi__get8(s);
5606
9.29k
         if (info.hsz != 12) stbi__get8(s);
5607
9.29k
         pal[i][3] = 255;
5608
9.29k
      }
5609
228
      stbi__skip(s, info.offset - info.extra_read - info.hsz - psize * (info.hsz == 12 ? 3 : 4));
5610
228
      if (info.bpp == 1) width = (s->img_x + 7) >> 3;
5611
142
      else if (info.bpp == 4) width = (s->img_x + 1) >> 1;
5612
105
      else if (info.bpp == 8) width = s->img_x;
5613
61
      else { STBI_FREE(out); return stbi__errpuc("bad bpp", "Corrupt BMP"); }
5614
167
      pad = (-width)&3;
5615
167
      if (info.bpp == 1) {
5616
135M
         for (j=0; j < (int) s->img_y; ++j) {
5617
135M
            int bit_offset = 7, v = stbi__get8(s);
5618
419M
            for (i=0; i < (int) s->img_x; ++i) {
5619
284M
               int color = (v>>bit_offset)&0x1;
5620
284M
               out[z++] = pal[color][0];
5621
284M
               out[z++] = pal[color][1];
5622
284M
               out[z++] = pal[color][2];
5623
284M
               if (target == 4) out[z++] = 255;
5624
284M
               if (i+1 == (int) s->img_x) break;
5625
284M
               if((--bit_offset) < 0) {
5626
35.1M
                  bit_offset = 7;
5627
35.1M
                  v = stbi__get8(s);
5628
35.1M
               }
5629
284M
            }
5630
135M
            stbi__skip(s, pad);
5631
135M
         }
5632
86
      } else {
5633
828k
         for (j=0; j < (int) s->img_y; ++j) {
5634
160M
            for (i=0; i < (int) s->img_x; i += 2) {
5635
160M
               int v=stbi__get8(s),v2=0;
5636
160M
               if (info.bpp == 4) {
5637
113M
                  v2 = v & 15;
5638
113M
                  v >>= 4;
5639
113M
               }
5640
160M
               out[z++] = pal[v][0];
5641
160M
               out[z++] = pal[v][1];
5642
160M
               out[z++] = pal[v][2];
5643
160M
               if (target == 4) out[z++] = 255;
5644
160M
               if (i+1 == (int) s->img_x) break;
5645
160M
               v = (info.bpp == 8) ? stbi__get8(s) : v2;
5646
160M
               out[z++] = pal[v][0];
5647
160M
               out[z++] = pal[v][1];
5648
160M
               out[z++] = pal[v][2];
5649
160M
               if (target == 4) out[z++] = 255;
5650
160M
            }
5651
828k
            stbi__skip(s, pad);
5652
828k
         }
5653
81
      }
5654
686
   } else {
5655
686
      int rshift=0,gshift=0,bshift=0,ashift=0,rcount=0,gcount=0,bcount=0,acount=0;
5656
686
      int z = 0;
5657
686
      int easy=0;
5658
686
      stbi__skip(s, info.offset - info.extra_read - info.hsz);
5659
686
      if (info.bpp == 24) width = 3 * s->img_x;
5660
542
      else if (info.bpp == 16) width = 2*s->img_x;
5661
473
      else /* bpp = 32 and pad = 0 */ width=0;
5662
686
      pad = (-width) & 3;
5663
686
      if (info.bpp == 24) {
5664
144
         easy = 1;
5665
542
      } else if (info.bpp == 32) {
5666
285
         if (mb == 0xff && mg == 0xff00 && mr == 0x00ff0000 && ma == 0xff000000)
5667
28
            easy = 2;
5668
285
      }
5669
686
      if (!easy) {
5670
514
         if (!mr || !mg || !mb) { STBI_FREE(out); return stbi__errpuc("bad masks", "Corrupt BMP"); }
5671
         // right shift amt to put high bit in position #7
5672
308
         rshift = stbi__high_bit(mr)-7; rcount = stbi__bitcount(mr);
5673
308
         gshift = stbi__high_bit(mg)-7; gcount = stbi__bitcount(mg);
5674
308
         bshift = stbi__high_bit(mb)-7; bcount = stbi__bitcount(mb);
5675
308
         ashift = stbi__high_bit(ma)-7; acount = stbi__bitcount(ma);
5676
308
         if (rcount > 8 || gcount > 8 || bcount > 8 || acount > 8) { STBI_FREE(out); return stbi__errpuc("bad masks", "Corrupt BMP"); }
5677
308
      }
5678
578M
      for (j=0; j < (int) s->img_y; ++j) {
5679
578M
         if (easy) {
5680
982M
            for (i=0; i < (int) s->img_x; ++i) {
5681
533M
               unsigned char a;
5682
533M
               out[z+2] = stbi__get8(s);
5683
533M
               out[z+1] = stbi__get8(s);
5684
533M
               out[z+0] = stbi__get8(s);
5685
533M
               z += 3;
5686
533M
               a = (easy == 2 ? stbi__get8(s) : 255);
5687
533M
               all_a |= a;
5688
533M
               if (target == 4) out[z++] = a;
5689
533M
            }
5690
448M
         } else {
5691
129M
            int bpp = info.bpp;
5692
788M
            for (i=0; i < (int) s->img_x; ++i) {
5693
659M
               stbi__uint32 v = (bpp == 16 ? (stbi__uint32) stbi__get16le(s) : stbi__get32le(s));
5694
659M
               unsigned int a;
5695
659M
               out[z++] = STBI__BYTECAST(stbi__shiftsigned(v & mr, rshift, rcount));
5696
659M
               out[z++] = STBI__BYTECAST(stbi__shiftsigned(v & mg, gshift, gcount));
5697
659M
               out[z++] = STBI__BYTECAST(stbi__shiftsigned(v & mb, bshift, bcount));
5698
659M
               a = (ma ? stbi__shiftsigned(v & ma, ashift, acount) : 255);
5699
659M
               all_a |= a;
5700
659M
               if (target == 4) out[z++] = STBI__BYTECAST(a);
5701
659M
            }
5702
129M
         }
5703
578M
         stbi__skip(s, pad);
5704
578M
      }
5705
297
   }
5706
5707
   // if alpha channel is all 0s, replace with all 255s
5708
464
   if (target == 4 && all_a == 0)
5709
145M
      for (i=4*s->img_x*s->img_y-1; i >= 0; i -= 4)
5710
145M
         out[i] = 255;
5711
5712
464
   if (flip_vertically) {
5713
456
      stbi_uc t;
5714
357M
      for (j=0; j < (int) s->img_y>>1; ++j) {
5715
357M
         stbi_uc *p1 = out +      j     *s->img_x*target;
5716
357M
         stbi_uc *p2 = out + (s->img_y-1-j)*s->img_x*target;
5717
2.94G
         for (i=0; i < (int) s->img_x*target; ++i) {
5718
2.58G
            t = p1[i]; p1[i] = p2[i]; p2[i] = t;
5719
2.58G
         }
5720
357M
      }
5721
456
   }
5722
5723
464
   if (req_comp && req_comp != target) {
5724
0
      out = stbi__convert_format(out, target, req_comp, s->img_x, s->img_y);
5725
0
      if (out == NULL) return out; // stbi__convert_format frees input on failure
5726
0
   }
5727
5728
464
   *x = s->img_x;
5729
464
   *y = s->img_y;
5730
464
   if (comp) *comp = s->img_n;
5731
464
   return out;
5732
464
}
5733
#endif
5734
5735
// Targa Truevision - TGA
5736
// by Jonathan Dummer
5737
#ifndef STBI_NO_TGA
5738
// returns STBI_rgb or whatever, 0 on error
5739
static int stbi__tga_get_comp(int bits_per_pixel, int is_grey, int* is_rgb16)
5740
475
{
5741
   // only RGB or RGBA (incl. 16bit) or grey allowed
5742
475
   if (is_rgb16) *is_rgb16 = 0;
5743
475
   switch(bits_per_pixel) {
5744
140
      case 8:  return STBI_grey;
5745
148
      case 16: if(is_grey) return STBI_grey_alpha;
5746
               // fallthrough
5747
148
      case 15: if(is_rgb16) *is_rgb16 = 1;
5748
148
               return STBI_rgb;
5749
64
      case 24: // fallthrough
5750
128
      case 32: return bits_per_pixel/8;
5751
11
      default: return 0;
5752
475
   }
5753
475
}
5754
5755
static int stbi__tga_info(stbi__context *s, int *x, int *y, int *comp)
5756
2.70k
{
5757
2.70k
    int tga_w, tga_h, tga_comp, tga_image_type, tga_bits_per_pixel, tga_colormap_bpp;
5758
2.70k
    int sz, tga_colormap_type;
5759
2.70k
    stbi__get8(s);                   // discard Offset
5760
2.70k
    tga_colormap_type = stbi__get8(s); // colormap type
5761
2.70k
    if( tga_colormap_type > 1 ) {
5762
2.31k
        stbi__rewind(s);
5763
2.31k
        return 0;      // only RGB or indexed allowed
5764
2.31k
    }
5765
389
    tga_image_type = stbi__get8(s); // image type
5766
389
    if ( tga_colormap_type == 1 ) { // colormapped (paletted) image
5767
151
        if (tga_image_type != 1 && tga_image_type != 9) {
5768
13
            stbi__rewind(s);
5769
13
            return 0;
5770
13
        }
5771
138
        stbi__skip(s,4);       // skip index of first colormap entry and number of entries
5772
138
        sz = stbi__get8(s);    //   check bits per palette color entry
5773
138
        if ( (sz != 8) && (sz != 15) && (sz != 16) && (sz != 24) && (sz != 32) ) {
5774
23
            stbi__rewind(s);
5775
23
            return 0;
5776
23
        }
5777
115
        stbi__skip(s,4);       // skip image x and y origin
5778
115
        tga_colormap_bpp = sz;
5779
238
    } else { // "normal" image w/o colormap - only RGB or grey allowed, +/- RLE
5780
238
        if ( (tga_image_type != 2) && (tga_image_type != 3) && (tga_image_type != 10) && (tga_image_type != 11) ) {
5781
67
            stbi__rewind(s);
5782
67
            return 0; // only RGB or grey allowed, +/- RLE
5783
67
        }
5784
171
        stbi__skip(s,9); // skip colormap specification and image x/y origin
5785
171
        tga_colormap_bpp = 0;
5786
171
    }
5787
286
    tga_w = stbi__get16le(s);
5788
286
    if( tga_w < 1 ) {
5789
9
        stbi__rewind(s);
5790
9
        return 0;   // test width
5791
9
    }
5792
277
    tga_h = stbi__get16le(s);
5793
277
    if( tga_h < 1 ) {
5794
16
        stbi__rewind(s);
5795
16
        return 0;   // test height
5796
16
    }
5797
261
    tga_bits_per_pixel = stbi__get8(s); // bits per pixel
5798
261
    stbi__get8(s); // ignore alpha bits
5799
261
    if (tga_colormap_bpp != 0) {
5800
105
        if((tga_bits_per_pixel != 8) && (tga_bits_per_pixel != 16)) {
5801
            // when using a colormap, tga_bits_per_pixel is the size of the indexes
5802
            // I don't think anything but 8 or 16bit indexes makes sense
5803
18
            stbi__rewind(s);
5804
18
            return 0;
5805
18
        }
5806
87
        tga_comp = stbi__tga_get_comp(tga_colormap_bpp, 0, NULL);
5807
156
    } else {
5808
156
        tga_comp = stbi__tga_get_comp(tga_bits_per_pixel, (tga_image_type == 3) || (tga_image_type == 11), NULL);
5809
156
    }
5810
243
    if(!tga_comp) {
5811
11
      stbi__rewind(s);
5812
11
      return 0;
5813
11
    }
5814
232
    if (x) *x = tga_w;
5815
232
    if (y) *y = tga_h;
5816
232
    if (comp) *comp = tga_comp;
5817
232
    return 1;                   // seems to have passed everything
5818
243
}
5819
5820
static int stbi__tga_test(stbi__context *s)
5821
245
{
5822
245
   int res = 0;
5823
245
   int sz, tga_color_type;
5824
245
   stbi__get8(s);      //   discard Offset
5825
245
   tga_color_type = stbi__get8(s);   //   color type
5826
245
   if ( tga_color_type > 1 ) goto errorEnd;   //   only RGB or indexed allowed
5827
232
   sz = stbi__get8(s);   //   image type
5828
232
   if ( tga_color_type == 1 ) { // colormapped (paletted) image
5829
87
      if (sz != 1 && sz != 9) goto errorEnd; // colortype 1 demands image type 1 or 9
5830
87
      stbi__skip(s,4);       // skip index of first colormap entry and number of entries
5831
87
      sz = stbi__get8(s);    //   check bits per palette color entry
5832
87
      if ( (sz != 8) && (sz != 15) && (sz != 16) && (sz != 24) && (sz != 32) ) goto errorEnd;
5833
87
      stbi__skip(s,4);       // skip image x and y origin
5834
145
   } else { // "normal" image w/o colormap
5835
145
      if ( (sz != 2) && (sz != 3) && (sz != 10) && (sz != 11) ) goto errorEnd; // only RGB or grey allowed, +/- RLE
5836
145
      stbi__skip(s,9); // skip colormap specification and image x/y origin
5837
145
   }
5838
232
   if ( stbi__get16le(s) < 1 ) goto errorEnd;      //   test width
5839
232
   if ( stbi__get16le(s) < 1 ) goto errorEnd;      //   test height
5840
232
   sz = stbi__get8(s);   //   bits per pixel
5841
232
   if ( (tga_color_type == 1) && (sz != 8) && (sz != 16) ) goto errorEnd; // for colormapped images, bpp is size of an index
5842
232
   if ( (sz != 8) && (sz != 15) && (sz != 16) && (sz != 24) && (sz != 32) ) goto errorEnd;
5843
5844
232
   res = 1; // if we got this far, everything's good and we can return 1 instead of 0
5845
5846
245
errorEnd:
5847
245
   stbi__rewind(s);
5848
245
   return res;
5849
232
}
5850
5851
// read 16bit value and convert to 24bit RGB
5852
static void stbi__tga_read_rgb16(stbi__context *s, stbi_uc* out)
5853
114M
{
5854
114M
   stbi__uint16 px = (stbi__uint16)stbi__get16le(s);
5855
114M
   stbi__uint16 fiveBitMask = 31;
5856
   // we have 3 channels with 5bits each
5857
114M
   int r = (px >> 10) & fiveBitMask;
5858
114M
   int g = (px >> 5) & fiveBitMask;
5859
114M
   int b = px & fiveBitMask;
5860
   // Note that this saves the data in RGB(A) order, so it doesn't need to be swapped later
5861
114M
   out[0] = (stbi_uc)((r * 255)/31);
5862
114M
   out[1] = (stbi_uc)((g * 255)/31);
5863
114M
   out[2] = (stbi_uc)((b * 255)/31);
5864
5865
   // some people claim that the most significant bit might be used for alpha
5866
   // (possibly if an alpha-bit is set in the "image descriptor byte")
5867
   // but that only made 16bit test images completely translucent..
5868
   // so let's treat all 15 and 16bit TGAs as RGB with no alpha.
5869
114M
}
5870
5871
static void *stbi__tga_load(stbi__context *s, int *x, int *y, int *comp, int req_comp, stbi__result_info *ri)
5872
232
{
5873
   //   read in the TGA header stuff
5874
232
   int tga_offset = stbi__get8(s);
5875
232
   int tga_indexed = stbi__get8(s);
5876
232
   int tga_image_type = stbi__get8(s);
5877
232
   int tga_is_RLE = 0;
5878
232
   int tga_palette_start = stbi__get16le(s);
5879
232
   int tga_palette_len = stbi__get16le(s);
5880
232
   int tga_palette_bits = stbi__get8(s);
5881
232
   int tga_x_origin = stbi__get16le(s);
5882
232
   int tga_y_origin = stbi__get16le(s);
5883
232
   int tga_width = stbi__get16le(s);
5884
232
   int tga_height = stbi__get16le(s);
5885
232
   int tga_bits_per_pixel = stbi__get8(s);
5886
232
   int tga_comp, tga_rgb16=0;
5887
232
   int tga_inverted = stbi__get8(s);
5888
   // int tga_alpha_bits = tga_inverted & 15; // the 4 lowest bits - unused (useless?)
5889
   //   image data
5890
232
   unsigned char *tga_data;
5891
232
   unsigned char *tga_palette = NULL;
5892
232
   int i, j;
5893
232
   unsigned char raw_data[4] = {0};
5894
232
   int RLE_count = 0;
5895
232
   int RLE_repeating = 0;
5896
232
   int read_next_pixel = 1;
5897
232
   STBI_NOTUSED(ri);
5898
232
   STBI_NOTUSED(tga_x_origin); // @TODO
5899
232
   STBI_NOTUSED(tga_y_origin); // @TODO
5900
5901
232
   if (tga_height > STBI_MAX_DIMENSIONS) return stbi__errpuc("too large","Very large image (corrupt?)");
5902
232
   if (tga_width > STBI_MAX_DIMENSIONS) return stbi__errpuc("too large","Very large image (corrupt?)");
5903
5904
   //   do a tiny bit of precessing
5905
232
   if ( tga_image_type >= 8 )
5906
102
   {
5907
102
      tga_image_type -= 8;
5908
102
      tga_is_RLE = 1;
5909
102
   }
5910
232
   tga_inverted = 1 - ((tga_inverted >> 5) & 1);
5911
5912
   //   If I'm paletted, then I'll use the number of bits from the palette
5913
232
   if ( tga_indexed ) tga_comp = stbi__tga_get_comp(tga_palette_bits, 0, &tga_rgb16);
5914
145
   else tga_comp = stbi__tga_get_comp(tga_bits_per_pixel, (tga_image_type == 3), &tga_rgb16);
5915
5916
232
   if(!tga_comp) // shouldn't really happen, stbi__tga_test() should have ensured basic consistency
5917
0
      return stbi__errpuc("bad format", "Can't find out TGA pixelformat");
5918
5919
   //   tga info
5920
232
   *x = tga_width;
5921
232
   *y = tga_height;
5922
232
   if (comp) *comp = tga_comp;
5923
5924
232
   if (!stbi__mad3sizes_valid(tga_width, tga_height, tga_comp, 0))
5925
0
      return stbi__errpuc("too large", "Corrupt TGA");
5926
5927
232
   tga_data = (unsigned char*)stbi__malloc_mad3(tga_width, tga_height, tga_comp, 0);
5928
232
   if (!tga_data) return stbi__errpuc("outofmem", "Out of memory");
5929
5930
   // skip to the data's starting position (offset usually = 0)
5931
232
   stbi__skip(s, tga_offset );
5932
5933
232
   if ( !tga_indexed && !tga_is_RLE && !tga_rgb16 ) {
5934
1.09M
      for (i=0; i < tga_height; ++i) {
5935
1.09M
         int row = tga_inverted ? tga_height -i - 1 : i;
5936
1.09M
         stbi_uc *tga_row = tga_data + row*tga_width*tga_comp;
5937
1.09M
         stbi__getn(s, tga_row, tga_width * tga_comp);
5938
1.09M
      }
5939
180
   } else  {
5940
      //   do I need to load a palette?
5941
180
      if ( tga_indexed)
5942
87
      {
5943
87
         if (tga_palette_len == 0) {  /* you have to have at least one entry! */
5944
3
            STBI_FREE(tga_data);
5945
3
            return stbi__errpuc("bad palette", "Corrupt TGA");
5946
3
         }
5947
5948
         //   any data to skip? (offset usually = 0)
5949
84
         stbi__skip(s, tga_palette_start );
5950
         //   load the palette
5951
84
         tga_palette = (unsigned char*)stbi__malloc_mad2(tga_palette_len, tga_comp, 0);
5952
84
         if (!tga_palette) {
5953
0
            STBI_FREE(tga_data);
5954
0
            return stbi__errpuc("outofmem", "Out of memory");
5955
0
         }
5956
84
         if (tga_rgb16) {
5957
52
            stbi_uc *pal_entry = tga_palette;
5958
52
            STBI_ASSERT(tga_comp == STBI_rgb);
5959
1.25M
            for (i=0; i < tga_palette_len; ++i) {
5960
1.25M
               stbi__tga_read_rgb16(s, pal_entry);
5961
1.25M
               pal_entry += tga_comp;
5962
1.25M
            }
5963
52
         } else if (!stbi__getn(s, tga_palette, tga_palette_len * tga_comp)) {
5964
4
               STBI_FREE(tga_data);
5965
4
               STBI_FREE(tga_palette);
5966
4
               return stbi__errpuc("bad palette", "Corrupt TGA");
5967
4
         }
5968
84
      }
5969
      //   load the data
5970
767M
      for (i=0; i < tga_width * tga_height; ++i)
5971
767M
      {
5972
         //   if I'm in RLE mode, do I need to get a RLE stbi__pngchunk?
5973
767M
         if ( tga_is_RLE )
5974
346M
         {
5975
346M
            if ( RLE_count == 0 )
5976
346M
            {
5977
               //   yep, get the next byte as a RLE command
5978
346M
               int RLE_cmd = stbi__get8(s);
5979
346M
               RLE_count = 1 + (RLE_cmd & 127);
5980
346M
               RLE_repeating = RLE_cmd >> 7;
5981
346M
               read_next_pixel = 1;
5982
346M
            } else if ( !RLE_repeating )
5983
880
            {
5984
880
               read_next_pixel = 1;
5985
880
            }
5986
346M
         } else
5987
420M
         {
5988
420M
            read_next_pixel = 1;
5989
420M
         }
5990
         //   OK, if I need to read a pixel, do it now
5991
767M
         if ( read_next_pixel )
5992
767M
         {
5993
            //   load however much data we did have
5994
767M
            if ( tga_indexed )
5995
399M
            {
5996
               // read in index, then perform the lookup
5997
399M
               int pal_idx = (tga_bits_per_pixel == 8) ? stbi__get8(s) : stbi__get16le(s);
5998
399M
               if ( pal_idx >= tga_palette_len ) {
5999
                  // invalid index
6000
225
                  pal_idx = 0;
6001
225
               }
6002
399M
               pal_idx *= tga_comp;
6003
1.37G
               for (j = 0; j < tga_comp; ++j) {
6004
974M
                  raw_data[j] = tga_palette[pal_idx+j];
6005
974M
               }
6006
399M
            } else if(tga_rgb16) {
6007
113M
               STBI_ASSERT(tga_comp == STBI_rgb);
6008
113M
               stbi__tga_read_rgb16(s, raw_data);
6009
254M
            } else {
6010
               //   read in the data raw
6011
923M
               for (j = 0; j < tga_comp; ++j) {
6012
668M
                  raw_data[j] = stbi__get8(s);
6013
668M
               }
6014
254M
            }
6015
            //   clear the reading flag for the next pixel
6016
767M
            read_next_pixel = 0;
6017
767M
         } // end of reading a pixel
6018
6019
         // copy data
6020
2.75G
         for (j = 0; j < tga_comp; ++j)
6021
1.98G
           tga_data[i*tga_comp+j] = raw_data[j];
6022
6023
         //   in case we're in RLE mode, keep counting down
6024
767M
         --RLE_count;
6025
767M
      }
6026
      //   do I need to invert the image?
6027
173
      if ( tga_inverted )
6028
154
      {
6029
650k
         for (j = 0; j*2 < tga_height; ++j)
6030
650k
         {
6031
650k
            int index1 = j * tga_width * tga_comp;
6032
650k
            int index2 = (tga_height - 1 - j) * tga_width * tga_comp;
6033
914M
            for (i = tga_width * tga_comp; i > 0; --i)
6034
914M
            {
6035
914M
               unsigned char temp = tga_data[index1];
6036
914M
               tga_data[index1] = tga_data[index2];
6037
914M
               tga_data[index2] = temp;
6038
914M
               ++index1;
6039
914M
               ++index2;
6040
914M
            }
6041
650k
         }
6042
154
      }
6043
      //   clear my palette, if I had one
6044
173
      if ( tga_palette != NULL )
6045
80
      {
6046
80
         STBI_FREE( tga_palette );
6047
80
      }
6048
173
   }
6049
6050
   // swap RGB - if the source data was RGB16, it already is in the right order
6051
225
   if (tga_comp >= 3 && !tga_rgb16)
6052
62
   {
6053
62
      unsigned char* tga_pixel = tga_data;
6054
331M
      for (i=0; i < tga_width * tga_height; ++i)
6055
331M
      {
6056
331M
         unsigned char temp = tga_pixel[0];
6057
331M
         tga_pixel[0] = tga_pixel[2];
6058
331M
         tga_pixel[2] = temp;
6059
331M
         tga_pixel += tga_comp;
6060
331M
      }
6061
62
   }
6062
6063
   // convert to target component count
6064
225
   if (req_comp && req_comp != tga_comp)
6065
194
      tga_data = stbi__convert_format(tga_data, tga_comp, req_comp, tga_width, tga_height);
6066
6067
   //   the things I do to get rid of an error message, and yet keep
6068
   //   Microsoft's C compilers happy... [8^(
6069
225
   tga_palette_start = tga_palette_len = tga_palette_bits =
6070
225
         tga_x_origin = tga_y_origin = 0;
6071
225
   STBI_NOTUSED(tga_palette_start);
6072
   //   OK, done
6073
225
   return tga_data;
6074
232
}
6075
#endif
6076
6077
// *************************************************************************************************
6078
// Photoshop PSD loader -- PD by Thatcher Ulrich, integration by Nicolas Schulz, tweaked by STB
6079
6080
#ifndef STBI_NO_PSD
6081
static int stbi__psd_test(stbi__context *s)
6082
3.46k
{
6083
3.46k
   int r = (stbi__get32be(s) == 0x38425053);
6084
3.46k
   stbi__rewind(s);
6085
3.46k
   return r;
6086
3.46k
}
6087
6088
static int stbi__psd_decode_rle(stbi__context *s, stbi_uc *p, int pixelCount)
6089
136
{
6090
136
   int count, nleft, len;
6091
6092
136
   count = 0;
6093
185M
   while ((nleft = pixelCount - count) > 0) {
6094
185M
      len = stbi__get8(s);
6095
185M
      if (len == 128) {
6096
         // No-op.
6097
185M
      } else if (len < 128) {
6098
         // Copy next len+1 bytes literally.
6099
185M
         len++;
6100
185M
         if (len > nleft) return 0; // corrupt data
6101
185M
         count += len;
6102
371M
         while (len) {
6103
185M
            *p = stbi__get8(s);
6104
185M
            p += 4;
6105
185M
            len--;
6106
185M
         }
6107
185M
      } else if (len > 128) {
6108
254
         stbi_uc   val;
6109
         // Next -len+1 bytes in the dest are replicated from next source byte.
6110
         // (Interpret len as a negative 8-bit int.)
6111
254
         len = 257 - len;
6112
254
         if (len > nleft) return 0; // corrupt data
6113
253
         val = stbi__get8(s);
6114
253
         count += len;
6115
19.6k
         while (len) {
6116
19.4k
            *p = val;
6117
19.4k
            p += 4;
6118
19.4k
            len--;
6119
19.4k
         }
6120
253
      }
6121
185M
   }
6122
6123
131
   return 1;
6124
136
}
6125
6126
static void *stbi__psd_load(stbi__context *s, int *x, int *y, int *comp, int req_comp, stbi__result_info *ri, int bpc)
6127
306
{
6128
306
   int pixelCount;
6129
306
   int channelCount, compression;
6130
306
   int channel, i;
6131
306
   int bitdepth;
6132
306
   int w,h;
6133
306
   stbi_uc *out;
6134
306
   STBI_NOTUSED(ri);
6135
6136
   // Check identifier
6137
306
   if (stbi__get32be(s) != 0x38425053)   // "8BPS"
6138
0
      return stbi__errpuc("not PSD", "Corrupt PSD image");
6139
6140
   // Check file type version.
6141
306
   if (stbi__get16be(s) != 1)
6142
0
      return stbi__errpuc("wrong version", "Unsupported version of PSD image");
6143
6144
   // Skip 6 reserved bytes.
6145
306
   stbi__skip(s, 6 );
6146
6147
   // Read the number of channels (R, G, B, A, etc).
6148
306
   channelCount = stbi__get16be(s);
6149
306
   if (channelCount < 0 || channelCount > 16)
6150
0
      return stbi__errpuc("wrong channel count", "Unsupported number of channels in PSD image");
6151
6152
   // Read the rows and columns of the image.
6153
306
   h = stbi__get32be(s);
6154
306
   w = stbi__get32be(s);
6155
6156
306
   if (h > STBI_MAX_DIMENSIONS) return stbi__errpuc("too large","Very large image (corrupt?)");
6157
302
   if (w > STBI_MAX_DIMENSIONS) return stbi__errpuc("too large","Very large image (corrupt?)");
6158
6159
   // Make sure the depth is 8 bits.
6160
301
   bitdepth = stbi__get16be(s);
6161
301
   if (bitdepth != 8 && bitdepth != 16)
6162
0
      return stbi__errpuc("unsupported bit depth", "PSD bit depth is not 8 or 16 bit");
6163
6164
   // Make sure the color mode is RGB.
6165
   // Valid options are:
6166
   //   0: Bitmap
6167
   //   1: Grayscale
6168
   //   2: Indexed color
6169
   //   3: RGB color
6170
   //   4: CMYK color
6171
   //   7: Multichannel
6172
   //   8: Duotone
6173
   //   9: Lab color
6174
301
   if (stbi__get16be(s) != 3)
6175
0
      return stbi__errpuc("wrong color format", "PSD is not in RGB color format");
6176
6177
   // Skip the Mode Data.  (It's the palette for indexed color; other info for other modes.)
6178
301
   stbi__skip(s,stbi__get32be(s) );
6179
6180
   // Skip the image resources.  (resolution, pen tool paths, etc)
6181
301
   stbi__skip(s, stbi__get32be(s) );
6182
6183
   // Skip the reserved data.
6184
301
   stbi__skip(s, stbi__get32be(s) );
6185
6186
   // Find out if the data is compressed.
6187
   // Known values:
6188
   //   0: no compression
6189
   //   1: RLE compressed
6190
301
   compression = stbi__get16be(s);
6191
301
   if (compression > 1)
6192
14
      return stbi__errpuc("bad compression", "PSD has an unknown compression format");
6193
6194
   // Check size
6195
287
   if (!stbi__mad3sizes_valid(4, w, h, 0))
6196
83
      return stbi__errpuc("too large", "Corrupt PSD");
6197
6198
   // Create the destination image.
6199
6200
204
   if (!compression && bitdepth == 16 && bpc == 16) {
6201
0
      out = (stbi_uc *) stbi__malloc_mad3(8, w, h, 0);
6202
0
      ri->bits_per_channel = 16;
6203
0
   } else
6204
204
      out = (stbi_uc *) stbi__malloc(4 * w*h);
6205
6206
204
   if (!out) return stbi__errpuc("outofmem", "Out of memory");
6207
204
   pixelCount = w*h;
6208
6209
   // Initialize the data to zero.
6210
   //memset( out, 0, pixelCount * 4 );
6211
6212
   // Finally, the image data.
6213
204
   if (compression) {
6214
      // RLE as used by .PSD and .TIFF
6215
      // Loop until you get the number of unpacked bytes you are expecting:
6216
      //     Read the next source byte into n.
6217
      //     If n is between 0 and 127 inclusive, copy the next n+1 bytes literally.
6218
      //     Else if n is between -127 and -1 inclusive, copy the next byte -n+1 times.
6219
      //     Else if n is 128, noop.
6220
      // Endloop
6221
6222
      // The RLE-compressed data is preceded by a 2-byte data count for each row in the data,
6223
      // which we're going to just skip.
6224
89
      stbi__skip(s, h * channelCount * 2 );
6225
6226
      // Read the RLE data by channel.
6227
428
      for (channel = 0; channel < 4; channel++) {
6228
344
         stbi_uc *p;
6229
6230
344
         p = out+channel;
6231
344
         if (channel >= channelCount) {
6232
            // Fill this channel with default data.
6233
750M
            for (i = 0; i < pixelCount; i++, p += 4)
6234
750M
               *p = (channel == 3 ? 255 : 0);
6235
208
         } else {
6236
            // Read the RLE data.
6237
136
            if (!stbi__psd_decode_rle(s, p, pixelCount)) {
6238
5
               STBI_FREE(out);
6239
5
               return stbi__errpuc("corrupt", "bad RLE data");
6240
5
            }
6241
136
         }
6242
344
      }
6243
6244
115
   } else {
6245
      // We're at the raw image data.  It's each channel in order (Red, Green, Blue, Alpha, ...)
6246
      // where each channel consists of an 8-bit (or 16-bit) value for each pixel in the image.
6247
6248
      // Read the data by channel.
6249
575
      for (channel = 0; channel < 4; channel++) {
6250
460
         if (channel >= channelCount) {
6251
            // Fill this channel with default data.
6252
174
            if (bitdepth == 16 && bpc == 16) {
6253
0
               stbi__uint16 *q = ((stbi__uint16 *) out) + channel;
6254
0
               stbi__uint16 val = channel == 3 ? 65535 : 0;
6255
0
               for (i = 0; i < pixelCount; i++, q += 4)
6256
0
                  *q = val;
6257
174
            } else {
6258
174
               stbi_uc *p = out+channel;
6259
174
               stbi_uc val = channel == 3 ? 255 : 0;
6260
587M
               for (i = 0; i < pixelCount; i++, p += 4)
6261
587M
                  *p = val;
6262
174
            }
6263
286
         } else {
6264
286
            if (ri->bits_per_channel == 16) {    // output bpc
6265
0
               stbi__uint16 *q = ((stbi__uint16 *) out) + channel;
6266
0
               for (i = 0; i < pixelCount; i++, q += 4)
6267
0
                  *q = (stbi__uint16) stbi__get16be(s);
6268
286
            } else {
6269
286
               stbi_uc *p = out+channel;
6270
286
               if (bitdepth == 16) {  // input bpc
6271
145M
                  for (i = 0; i < pixelCount; i++, p += 4)
6272
145M
                     *p = (stbi_uc) (stbi__get16be(s) >> 8);
6273
198
               } else {
6274
864M
                  for (i = 0; i < pixelCount; i++, p += 4)
6275
864M
                     *p = stbi__get8(s);
6276
198
               }
6277
286
            }
6278
286
         }
6279
460
      }
6280
115
   }
6281
6282
   // remove weird white matte from PSD
6283
199
   if (channelCount >= 4) {
6284
72
      if (ri->bits_per_channel == 16) {
6285
0
         for (i=0; i < w*h; ++i) {
6286
0
            stbi__uint16 *pixel = (stbi__uint16 *) out + 4*i;
6287
0
            if (pixel[3] != 0 && pixel[3] != 65535) {
6288
0
               float a = pixel[3] / 65535.0f;
6289
0
               float ra = 1.0f / a;
6290
0
               float inv_a = 65535.0f * (1 - ra);
6291
0
               pixel[0] = (stbi__uint16) (pixel[0]*ra + inv_a);
6292
0
               pixel[1] = (stbi__uint16) (pixel[1]*ra + inv_a);
6293
0
               pixel[2] = (stbi__uint16) (pixel[2]*ra + inv_a);
6294
0
            }
6295
0
         }
6296
72
      } else {
6297
214M
         for (i=0; i < w*h; ++i) {
6298
214M
            unsigned char *pixel = out + 4*i;
6299
214M
            if (pixel[3] != 0 && pixel[3] != 255) {
6300
298
               float a = pixel[3] / 255.0f;
6301
298
               float ra = 1.0f / a;
6302
298
               float inv_a = 255.0f * (1 - ra);
6303
298
               pixel[0] = (unsigned char) (pixel[0]*ra + inv_a);
6304
298
               pixel[1] = (unsigned char) (pixel[1]*ra + inv_a);
6305
298
               pixel[2] = (unsigned char) (pixel[2]*ra + inv_a);
6306
298
            }
6307
214M
         }
6308
72
      }
6309
72
   }
6310
6311
   // convert to desired output format
6312
199
   if (req_comp && req_comp != 4) {
6313
0
      if (ri->bits_per_channel == 16)
6314
0
         out = (stbi_uc *) stbi__convert_format16((stbi__uint16 *) out, 4, req_comp, w, h);
6315
0
      else
6316
0
         out = stbi__convert_format(out, 4, req_comp, w, h);
6317
0
      if (out == NULL) return out; // stbi__convert_format frees input on failure
6318
0
   }
6319
6320
199
   if (comp) *comp = 4;
6321
199
   *y = h;
6322
199
   *x = w;
6323
6324
199
   return out;
6325
199
}
6326
#endif
6327
6328
// *************************************************************************************************
6329
// Softimage PIC loader
6330
// by Tom Seddon
6331
//
6332
// See http://softimage.wiki.softimage.com/index.php/INFO:_PIC_file_format
6333
// See http://ozviz.wasp.uwa.edu.au/~pbourke/dataformats/softimagepic/
6334
6335
#ifndef STBI_NO_PIC
6336
static int stbi__pic_is4(stbi__context *s,const char *str)
6337
7.22k
{
6338
7.22k
   int i;
6339
10.4k
   for (i=0; i<4; ++i)
6340
9.62k
      if (stbi__get8(s) != (stbi_uc)str[i])
6341
6.42k
         return 0;
6342
6343
794
   return 1;
6344
7.22k
}
6345
6346
static int stbi__pic_test_core(stbi__context *s)
6347
3.15k
{
6348
3.15k
   int i;
6349
6350
3.15k
   if (!stbi__pic_is4(s,"\x53\x80\xF6\x34"))
6351
2.91k
      return 0;
6352
6353
20.8k
   for(i=0;i<84;++i)
6354
20.5k
      stbi__get8(s);
6355
6356
245
   if (!stbi__pic_is4(s,"PICT"))
6357
13
      return 0;
6358
6359
232
   return 1;
6360
245
}
6361
6362
typedef struct
6363
{
6364
   stbi_uc size,type,channel;
6365
} stbi__pic_packet;
6366
6367
static stbi_uc *stbi__readval(stbi__context *s, int channel, stbi_uc *dest)
6368
73.3M
{
6369
73.3M
   int mask=0x80, i;
6370
6371
366M
   for (i=0; i<4; ++i, mask>>=1) {
6372
293M
      if (channel & mask) {
6373
467
         if (stbi__at_eof(s)) return stbi__errpuc("bad file","PIC file too short");
6374
442
         dest[i]=stbi__get8(s);
6375
442
      }
6376
293M
   }
6377
6378
73.3M
   return dest;
6379
73.3M
}
6380
6381
static void stbi__copyval(int channel,stbi_uc *dest,const stbi_uc *src)
6382
486k
{
6383
486k
   int mask=0x80,i;
6384
6385
2.43M
   for (i=0;i<4; ++i, mask>>=1)
6386
1.94M
      if (channel&mask)
6387
126k
         dest[i]=src[i];
6388
486k
}
6389
6390
static stbi_uc *stbi__pic_load_core(stbi__context *s,int width,int height,int *comp, stbi_uc *result)
6391
232
{
6392
232
   int act_comp=0,num_packets=0,y,chained;
6393
232
   stbi__pic_packet packets[10];
6394
6395
   // this will (should...) cater for even some bizarre stuff like having data
6396
    // for the same channel in multiple packets.
6397
257
   do {
6398
257
      stbi__pic_packet *packet;
6399
6400
257
      if (num_packets==sizeof(packets)/sizeof(packets[0]))
6401
0
         return stbi__errpuc("bad format","too many packets");
6402
6403
257
      packet = &packets[num_packets++];
6404
6405
257
      chained = stbi__get8(s);
6406
257
      packet->size    = stbi__get8(s);
6407
257
      packet->type    = stbi__get8(s);
6408
257
      packet->channel = stbi__get8(s);
6409
6410
257
      act_comp |= packet->channel;
6411
6412
257
      if (stbi__at_eof(s))          return stbi__errpuc("bad file","file too short (reading packets)");
6413
257
      if (packet->size != 8)  return stbi__errpuc("bad format","packet isn't 8bpp");
6414
257
   } while (chained);
6415
6416
232
   *comp = (act_comp & 0x10 ? 4 : 3); // has alpha channel?
6417
6418
646k
   for(y=0; y<height; ++y) {
6419
646k
      int packet_idx;
6420
6421
1.42M
      for(packet_idx=0; packet_idx < num_packets; ++packet_idx) {
6422
773k
         stbi__pic_packet *packet = &packets[packet_idx];
6423
773k
         stbi_uc *dest = result+y*width*4;
6424
6425
773k
         switch (packet->type) {
6426
11
            default:
6427
11
               return stbi__errpuc("bad format","packet has bad compression type");
6428
6429
562k
            case 0: {//uncompressed
6430
562k
               int x;
6431
6432
73.9M
               for(x=0;x<width;++x, dest+=4)
6433
73.3M
                  if (!stbi__readval(s,packet->channel,dest))
6434
17
                     return 0;
6435
562k
               break;
6436
562k
            }
6437
6438
562k
            case 1://Pure RLE
6439
195k
               {
6440
195k
                  int left=width, i;
6441
6442
196k
                  while (left>0) {
6443
1.05k
                     stbi_uc count,value[4];
6444
6445
1.05k
                     count=stbi__get8(s);
6446
1.05k
                     if (stbi__at_eof(s))   return stbi__errpuc("bad file","file too short (pure read count)");
6447
6448
986
                     if (count > left)
6449
202
                        count = (stbi_uc) left;
6450
6451
986
                     if (!stbi__readval(s,packet->channel,value))  return 0;
6452
6453
69.1k
                     for(i=0; i<count; ++i,dest+=4)
6454
68.2k
                        stbi__copyval(packet->channel,dest,value);
6455
981
                     left -= count;
6456
981
                  }
6457
195k
               }
6458
195k
               break;
6459
6460
195k
            case 2: {//Mixed RLE
6461
16.2k
               int left=width;
6462
17.9k
               while (left>0) {
6463
1.74k
                  int count = stbi__get8(s), i;
6464
1.74k
                  if (stbi__at_eof(s))  return stbi__errpuc("bad file","file too short (mixed read count)");
6465
6466
1.66k
                  if (count >= 128) { // Repeated
6467
913
                     stbi_uc value[4];
6468
6469
913
                     if (count==128)
6470
415
                        count = stbi__get16be(s);
6471
498
                     else
6472
498
                        count -= 127;
6473
913
                     if (count > left)
6474
11
                        return stbi__errpuc("bad file","scanline overrun");
6475
6476
902
                     if (!stbi__readval(s,packet->channel,value))
6477
1
                        return 0;
6478
6479
419k
                     for(i=0;i<count;++i, dest += 4)
6480
418k
                        stbi__copyval(packet->channel,dest,value);
6481
901
                  } else { // Raw
6482
755
                     ++count;
6483
755
                     if (count>left) return stbi__errpuc("bad file","scanline overrun");
6484
6485
21.4k
                     for(i=0;i<count;++i, dest+=4)
6486
20.6k
                        if (!stbi__readval(s,packet->channel,dest))
6487
2
                           return 0;
6488
753
                  }
6489
1.65k
                  left-=count;
6490
1.65k
               }
6491
16.1k
               break;
6492
16.2k
            }
6493
773k
         }
6494
773k
      }
6495
646k
   }
6496
6497
46
   return result;
6498
232
}
6499
6500
static void *stbi__pic_load(stbi__context *s,int *px,int *py,int *comp,int req_comp, stbi__result_info *ri)
6501
232
{
6502
232
   stbi_uc *result;
6503
232
   int i, x,y, internal_comp;
6504
232
   STBI_NOTUSED(ri);
6505
6506
232
   if (!comp) comp = &internal_comp;
6507
6508
21.5k
   for (i=0; i<92; ++i)
6509
21.3k
      stbi__get8(s);
6510
6511
232
   x = stbi__get16be(s);
6512
232
   y = stbi__get16be(s);
6513
6514
232
   if (y > STBI_MAX_DIMENSIONS) return stbi__errpuc("too large","Very large image (corrupt?)");
6515
232
   if (x > STBI_MAX_DIMENSIONS) return stbi__errpuc("too large","Very large image (corrupt?)");
6516
6517
232
   if (stbi__at_eof(s))  return stbi__errpuc("bad file","file too short (pic header)");
6518
232
   if (!stbi__mad3sizes_valid(x, y, 4, 0)) return stbi__errpuc("too large", "PIC image too large to decode");
6519
6520
232
   stbi__get32be(s); //skip `ratio'
6521
232
   stbi__get16be(s); //skip `fields'
6522
232
   stbi__get16be(s); //skip `pad'
6523
6524
   // intermediate buffer is RGBA
6525
232
   result = (stbi_uc *) stbi__malloc_mad3(x, y, 4, 0);
6526
232
   if (!result) return stbi__errpuc("outofmem", "Out of memory");
6527
232
   memset(result, 0xff, x*y*4);
6528
6529
232
   if (!stbi__pic_load_core(s,x,y,comp, result)) {
6530
186
      STBI_FREE(result);
6531
186
      result=0;
6532
186
   }
6533
232
   *px = x;
6534
232
   *py = y;
6535
232
   if (req_comp == 0) req_comp = *comp;
6536
232
   result=stbi__convert_format(result,4,req_comp,x,y);
6537
6538
232
   return result;
6539
232
}
6540
6541
static int stbi__pic_test(stbi__context *s)
6542
3.15k
{
6543
3.15k
   int r = stbi__pic_test_core(s);
6544
3.15k
   stbi__rewind(s);
6545
3.15k
   return r;
6546
3.15k
}
6547
#endif
6548
6549
// *************************************************************************************************
6550
// GIF loader -- public domain by Jean-Marc Lienher -- simplified/shrunk by stb
6551
6552
#ifndef STBI_NO_GIF
6553
typedef struct
6554
{
6555
   stbi__int16 prefix;
6556
   stbi_uc first;
6557
   stbi_uc suffix;
6558
} stbi__gif_lzw;
6559
6560
typedef struct
6561
{
6562
   int w,h;
6563
   stbi_uc *out;                 // output buffer (always 4 components)
6564
   stbi_uc *background;          // The current "background" as far as a gif is concerned
6565
   stbi_uc *history;
6566
   int flags, bgindex, ratio, transparent, eflags;
6567
   stbi_uc  pal[256][4];
6568
   stbi_uc lpal[256][4];
6569
   stbi__gif_lzw codes[8192];
6570
   stbi_uc *color_table;
6571
   int parse, step;
6572
   int lflags;
6573
   int start_x, start_y;
6574
   int max_x, max_y;
6575
   int cur_x, cur_y;
6576
   int line_size;
6577
   int delay;
6578
} stbi__gif;
6579
6580
static int stbi__gif_test_raw(stbi__context *s)
6581
3.90k
{
6582
3.90k
   int sz;
6583
3.90k
   if (stbi__get8(s) != 'G' || stbi__get8(s) != 'I' || stbi__get8(s) != 'F' || stbi__get8(s) != '8') return 0;
6584
440
   sz = stbi__get8(s);
6585
440
   if (sz != '9' && sz != '7') return 0;
6586
440
   if (stbi__get8(s) != 'a') return 0;
6587
440
   return 1;
6588
440
}
6589
6590
static int stbi__gif_test(stbi__context *s)
6591
3.90k
{
6592
3.90k
   int r = stbi__gif_test_raw(s);
6593
3.90k
   stbi__rewind(s);
6594
3.90k
   return r;
6595
3.90k
}
6596
6597
static void stbi__gif_parse_colortable(stbi__context *s, stbi_uc pal[256][4], int num_entries, int transp)
6598
257
{
6599
257
   int i;
6600
5.19k
   for (i=0; i < num_entries; ++i) {
6601
4.93k
      pal[i][2] = stbi__get8(s);
6602
4.93k
      pal[i][1] = stbi__get8(s);
6603
4.93k
      pal[i][0] = stbi__get8(s);
6604
4.93k
      pal[i][3] = transp == i ? 0 : 255;
6605
4.93k
   }
6606
257
}
6607
6608
static int stbi__gif_header(stbi__context *s, stbi__gif *g, int *comp, int is_info)
6609
6.20k
{
6610
6.20k
   stbi_uc version;
6611
6.20k
   if (stbi__get8(s) != 'G' || stbi__get8(s) != 'I' || stbi__get8(s) != 'F' || stbi__get8(s) != '8')
6612
5.29k
      return stbi__err("not GIF", "Corrupt GIF");
6613
6614
915
   version = stbi__get8(s);
6615
915
   if (version != '7' && version != '9')    return stbi__err("not GIF", "Corrupt GIF");
6616
905
   if (stbi__get8(s) != 'a')                return stbi__err("not GIF", "Corrupt GIF");
6617
6618
893
   stbi__g_failure_reason = "";
6619
893
   g->w = stbi__get16le(s);
6620
893
   g->h = stbi__get16le(s);
6621
893
   g->flags = stbi__get8(s);
6622
893
   g->bgindex = stbi__get8(s);
6623
893
   g->ratio = stbi__get8(s);
6624
893
   g->transparent = -1;
6625
6626
893
   if (g->w > STBI_MAX_DIMENSIONS) return stbi__err("too large","Very large image (corrupt?)");
6627
893
   if (g->h > STBI_MAX_DIMENSIONS) return stbi__err("too large","Very large image (corrupt?)");
6628
6629
893
   if (comp != 0) *comp = 4;  // can't actually tell whether it's 3 or 4 until we parse the comments
6630
6631
893
   if (is_info) return 1;
6632
6633
440
   if (g->flags & 0x80)
6634
128
      stbi__gif_parse_colortable(s,g->pal, 2 << (g->flags & 7), -1);
6635
6636
440
   return 1;
6637
893
}
6638
6639
static int stbi__gif_info_raw(stbi__context *s, int *x, int *y, int *comp)
6640
5.76k
{
6641
5.76k
   stbi__gif* g = (stbi__gif*) stbi__malloc(sizeof(stbi__gif));
6642
5.76k
   if (!g) return stbi__err("outofmem", "Out of memory");
6643
5.76k
   if (!stbi__gif_header(s, g, comp, 1)) {
6644
5.31k
      STBI_FREE(g);
6645
5.31k
      stbi__rewind( s );
6646
5.31k
      return 0;
6647
5.31k
   }
6648
453
   if (x) *x = g->w;
6649
453
   if (y) *y = g->h;
6650
453
   STBI_FREE(g);
6651
453
   return 1;
6652
5.76k
}
6653
6654
static void stbi__out_gif_code(stbi__gif *g, stbi__uint16 code)
6655
3.76M
{
6656
3.76M
   stbi_uc *p, *c;
6657
3.76M
   int idx;
6658
6659
   // recurse to decode the prefixes, since the linked-list is backwards,
6660
   // and working backwards through an interleaved image would be nasty
6661
3.76M
   if (g->codes[code].prefix >= 0)
6662
3.61M
      stbi__out_gif_code(g, g->codes[code].prefix);
6663
6664
3.76M
   if (g->cur_y >= g->max_y) return;
6665
6666
3.76M
   idx = g->cur_x + g->cur_y;
6667
3.76M
   p = &g->out[idx];
6668
3.76M
   g->history[idx / 4] = 1;
6669
6670
3.76M
   c = &g->color_table[g->codes[code].suffix * 4];
6671
3.76M
   if (c[3] > 128) { // don't render transparent pixels;
6672
3.67M
      p[0] = c[2];
6673
3.67M
      p[1] = c[1];
6674
3.67M
      p[2] = c[0];
6675
3.67M
      p[3] = c[3];
6676
3.67M
   }
6677
3.76M
   g->cur_x += 4;
6678
6679
3.76M
   if (g->cur_x >= g->max_x) {
6680
8.78k
      g->cur_x = g->start_x;
6681
8.78k
      g->cur_y += g->step;
6682
6683
8.90k
      while (g->cur_y >= g->max_y && g->parse > 0) {
6684
121
         g->step = (1 << g->parse) * g->line_size;
6685
121
         g->cur_y = g->start_y + (g->step >> 1);
6686
121
         --g->parse;
6687
121
      }
6688
8.78k
   }
6689
3.76M
}
6690
6691
static stbi_uc *stbi__process_gif_raster(stbi__context *s, stbi__gif *g)
6692
206
{
6693
206
   stbi_uc lzw_cs;
6694
206
   stbi__int32 len, init_code;
6695
206
   stbi__uint32 first;
6696
206
   stbi__int32 codesize, codemask, avail, oldcode, bits, valid_bits, clear;
6697
206
   stbi__gif_lzw *p;
6698
6699
206
   lzw_cs = stbi__get8(s);
6700
206
   if (lzw_cs > 12) return NULL;
6701
202
   clear = 1 << lzw_cs;
6702
202
   first = 1;
6703
202
   codesize = lzw_cs + 1;
6704
202
   codemask = (1 << codesize) - 1;
6705
202
   bits = 0;
6706
202
   valid_bits = 0;
6707
97.7k
   for (init_code = 0; init_code < clear; init_code++) {
6708
97.5k
      g->codes[init_code].prefix = -1;
6709
97.5k
      g->codes[init_code].first = (stbi_uc) init_code;
6710
97.5k
      g->codes[init_code].suffix = (stbi_uc) init_code;
6711
97.5k
   }
6712
6713
   // support no starting clear code
6714
202
   avail = clear+2;
6715
202
   oldcode = -1;
6716
6717
202
   len = 0;
6718
359k
   for(;;) {
6719
359k
      if (valid_bits < codesize) {
6720
202k
         if (len == 0) {
6721
1.91k
            len = stbi__get8(s); // start new block
6722
1.91k
            if (len == 0)
6723
141
               return g->out;
6724
1.91k
         }
6725
202k
         --len;
6726
202k
         bits |= (stbi__int32) stbi__get8(s) << valid_bits;
6727
202k
         valid_bits += 8;
6728
202k
      } else {
6729
157k
         stbi__int32 code = bits & codemask;
6730
157k
         bits >>= codesize;
6731
157k
         valid_bits -= codesize;
6732
         // @OPTIMIZE: is there some way we can accelerate the non-clear path?
6733
157k
         if (code == clear) {  // clear code
6734
2.67k
            codesize = lzw_cs + 1;
6735
2.67k
            codemask = (1 << codesize) - 1;
6736
2.67k
            avail = clear + 2;
6737
2.67k
            oldcode = -1;
6738
2.67k
            first = 0;
6739
154k
         } else if (code == clear + 1) { // end of stream code
6740
16
            stbi__skip(s, len);
6741
210
            while ((len = stbi__get8(s)) > 0)
6742
194
               stbi__skip(s,len);
6743
16
            return g->out;
6744
154k
         } else if (code <= avail) {
6745
154k
            if (first) {
6746
15
               return stbi__errpuc("no clear code", "Corrupt GIF");
6747
15
            }
6748
6749
154k
            if (oldcode >= 0) {
6750
153k
               p = &g->codes[avail++];
6751
153k
               if (avail > 8192) {
6752
4
                  return stbi__errpuc("too many codes", "Corrupt GIF");
6753
4
               }
6754
6755
153k
               p->prefix = (stbi__int16) oldcode;
6756
153k
               p->first = g->codes[oldcode].first;
6757
153k
               p->suffix = (code == avail) ? p->first : g->codes[code].first;
6758
153k
            } else if (code == avail)
6759
1
               return stbi__errpuc("illegal code in raster", "Corrupt GIF");
6760
6761
154k
            stbi__out_gif_code(g, (stbi__uint16) code);
6762
6763
154k
            if ((avail & codemask) == 0 && avail <= 0x0FFF) {
6764
1.06k
               codesize++;
6765
1.06k
               codemask = (1 << codesize) - 1;
6766
1.06k
            }
6767
6768
154k
            oldcode = code;
6769
154k
         } else {
6770
25
            return stbi__errpuc("illegal code in raster", "Corrupt GIF");
6771
25
         }
6772
157k
      }
6773
359k
   }
6774
202
}
6775
6776
// this function is designed to support animated gifs, although stb_image doesn't support it
6777
// two back is the image from two frames ago, used for a very specific disposal format
6778
static stbi_uc *stbi__gif_load_next(stbi__context *s, stbi__gif *g, int *comp, int req_comp, stbi_uc *two_back)
6779
440
{
6780
440
   int dispose;
6781
440
   int first_frame;
6782
440
   int pi;
6783
440
   int pcount;
6784
440
   STBI_NOTUSED(req_comp);
6785
6786
   // on first frame, any non-written pixels get the background colour (non-transparent)
6787
440
   first_frame = 0;
6788
440
   if (g->out == 0) {
6789
440
      if (!stbi__gif_header(s, g, comp,0)) return 0; // stbi__g_failure_reason set by stbi__gif_header
6790
440
      if (!stbi__mad3sizes_valid(4, g->w, g->h, 0))
6791
0
         return stbi__errpuc("too large", "GIF image is too large");
6792
440
      pcount = g->w * g->h;
6793
440
      g->out = (stbi_uc *) stbi__malloc(4 * pcount);
6794
440
      g->background = (stbi_uc *) stbi__malloc(4 * pcount);
6795
440
      g->history = (stbi_uc *) stbi__malloc(pcount);
6796
440
      if (!g->out || !g->background || !g->history)
6797
0
         return stbi__errpuc("outofmem", "Out of memory");
6798
6799
      // image is treated as "transparent" at the start - ie, nothing overwrites the current background;
6800
      // background colour is only used for pixels that are not rendered first frame, after that "background"
6801
      // color refers to the color that was there the previous frame.
6802
440
      memset(g->out, 0x00, 4 * pcount);
6803
440
      memset(g->background, 0x00, 4 * pcount); // state of the background (starts transparent)
6804
440
      memset(g->history, 0x00, pcount);        // pixels that were affected previous frame
6805
440
      first_frame = 1;
6806
440
   } else {
6807
      // second frame - how do we dispose of the previous one?
6808
0
      dispose = (g->eflags & 0x1C) >> 2;
6809
0
      pcount = g->w * g->h;
6810
6811
0
      if ((dispose == 3) && (two_back == 0)) {
6812
0
         dispose = 2; // if I don't have an image to revert back to, default to the old background
6813
0
      }
6814
6815
0
      if (dispose == 3) { // use previous graphic
6816
0
         for (pi = 0; pi < pcount; ++pi) {
6817
0
            if (g->history[pi]) {
6818
0
               memcpy( &g->out[pi * 4], &two_back[pi * 4], 4 );
6819
0
            }
6820
0
         }
6821
0
      } else if (dispose == 2) {
6822
         // restore what was changed last frame to background before that frame;
6823
0
         for (pi = 0; pi < pcount; ++pi) {
6824
0
            if (g->history[pi]) {
6825
0
               memcpy( &g->out[pi * 4], &g->background[pi * 4], 4 );
6826
0
            }
6827
0
         }
6828
0
      } else {
6829
         // This is a non-disposal case eithe way, so just
6830
         // leave the pixels as is, and they will become the new background
6831
         // 1: do not dispose
6832
         // 0:  not specified.
6833
0
      }
6834
6835
      // background is what out is after the undoing of the previou frame;
6836
0
      memcpy( g->background, g->out, 4 * g->w * g->h );
6837
0
   }
6838
6839
   // clear my history;
6840
440
   memset( g->history, 0x00, g->w * g->h );        // pixels that were affected previous frame
6841
6842
2.59k
   for (;;) {
6843
2.59k
      int tag = stbi__get8(s);
6844
2.59k
      switch (tag) {
6845
245
         case 0x2C: /* Image Descriptor */
6846
245
         {
6847
245
            stbi__int32 x, y, w, h;
6848
245
            stbi_uc *o;
6849
6850
245
            x = stbi__get16le(s);
6851
245
            y = stbi__get16le(s);
6852
245
            w = stbi__get16le(s);
6853
245
            h = stbi__get16le(s);
6854
245
            if (((x + w) > (g->w)) || ((y + h) > (g->h)))
6855
20
               return stbi__errpuc("bad Image Descriptor", "Corrupt GIF");
6856
6857
225
            g->line_size = g->w * 4;
6858
225
            g->start_x = x * 4;
6859
225
            g->start_y = y * g->line_size;
6860
225
            g->max_x   = g->start_x + w * 4;
6861
225
            g->max_y   = g->start_y + h * g->line_size;
6862
225
            g->cur_x   = g->start_x;
6863
225
            g->cur_y   = g->start_y;
6864
6865
            // if the width of the specified rectangle is 0, that means
6866
            // we may not see *any* pixels or the image is malformed;
6867
            // to make sure this is caught, move the current y down to
6868
            // max_y (which is what out_gif_code checks).
6869
225
            if (w == 0)
6870
59
               g->cur_y = g->max_y;
6871
6872
225
            g->lflags = stbi__get8(s);
6873
6874
225
            if (g->lflags & 0x40) {
6875
130
               g->step = 8 * g->line_size; // first interlaced spacing
6876
130
               g->parse = 3;
6877
130
            } else {
6878
95
               g->step = g->line_size;
6879
95
               g->parse = 0;
6880
95
            }
6881
6882
225
            if (g->lflags & 0x80) {
6883
129
               stbi__gif_parse_colortable(s,g->lpal, 2 << (g->lflags & 7), g->eflags & 0x01 ? g->transparent : -1);
6884
129
               g->color_table = (stbi_uc *) g->lpal;
6885
129
            } else if (g->flags & 0x80) {
6886
77
               g->color_table = (stbi_uc *) g->pal;
6887
77
            } else
6888
19
               return stbi__errpuc("missing color table", "Corrupt GIF");
6889
6890
206
            o = stbi__process_gif_raster(s, g);
6891
206
            if (!o) return NULL;
6892
6893
            // if this was the first frame,
6894
157
            pcount = g->w * g->h;
6895
157
            if (first_frame && (g->bgindex > 0)) {
6896
               // if first frame, any pixel not drawn to gets the background color
6897
821M
               for (pi = 0; pi < pcount; ++pi) {
6898
821M
                  if (g->history[pi] == 0) {
6899
819M
                     g->pal[g->bgindex][3] = 255; // just in case it was made transparent, undo that; It will be reset next frame if need be;
6900
819M
                     memcpy( &g->out[pi * 4], &g->pal[g->bgindex], 4 );
6901
819M
                  }
6902
821M
               }
6903
152
            }
6904
6905
157
            return o;
6906
206
         }
6907
6908
2.15k
         case 0x21: // Comment Extension.
6909
2.15k
         {
6910
2.15k
            int len;
6911
2.15k
            int ext = stbi__get8(s);
6912
2.15k
            if (ext == 0xF9) { // Graphic Control Extension.
6913
1.77k
               len = stbi__get8(s);
6914
1.77k
               if (len == 4) {
6915
1.28k
                  g->eflags = stbi__get8(s);
6916
1.28k
                  g->delay = 10 * stbi__get16le(s); // delay - 1/100th of a second, saving as 1/1000ths.
6917
6918
                  // unset old transparent
6919
1.28k
                  if (g->transparent >= 0) {
6920
944
                     g->pal[g->transparent][3] = 255;
6921
944
                  }
6922
1.28k
                  if (g->eflags & 0x01) {
6923
970
                     g->transparent = stbi__get8(s);
6924
970
                     if (g->transparent >= 0) {
6925
970
                        g->pal[g->transparent][3] = 0;
6926
970
                     }
6927
970
                  } else {
6928
                     // don't need transparent
6929
319
                     stbi__skip(s, 1);
6930
319
                     g->transparent = -1;
6931
319
                  }
6932
1.28k
               } else {
6933
481
                  stbi__skip(s, len);
6934
481
                  break;
6935
481
               }
6936
1.77k
            }
6937
2.44k
            while ((len = stbi__get8(s)) != 0) {
6938
770
               stbi__skip(s, len);
6939
770
            }
6940
1.67k
            break;
6941
2.15k
         }
6942
6943
1
         case 0x3B: // gif stream termination code
6944
1
            return (stbi_uc *) s; // using '1' causes warning on some compilers
6945
6946
194
         default:
6947
194
            return stbi__errpuc("unknown code", "Corrupt GIF");
6948
2.59k
      }
6949
2.59k
   }
6950
440
}
6951
6952
static void *stbi__load_gif_main_outofmem(stbi__gif *g, stbi_uc *out, int **delays)
6953
0
{
6954
0
   STBI_FREE(g->out);
6955
0
   STBI_FREE(g->history);
6956
0
   STBI_FREE(g->background);
6957
6958
0
   if (out) STBI_FREE(out);
6959
0
   if (delays && *delays) STBI_FREE(*delays);
6960
0
   return stbi__errpuc("outofmem", "Out of memory");
6961
0
}
6962
6963
static void *stbi__load_gif_main(stbi__context *s, int **delays, int *x, int *y, int *z, int *comp, int req_comp)
6964
0
{
6965
0
   if (stbi__gif_test(s)) {
6966
0
      int layers = 0;
6967
0
      stbi_uc *u = 0;
6968
0
      stbi_uc *out = 0;
6969
0
      stbi_uc *two_back = 0;
6970
0
      stbi__gif g;
6971
0
      int stride;
6972
0
      int out_size = 0;
6973
0
      int delays_size = 0;
6974
6975
0
      STBI_NOTUSED(out_size);
6976
0
      STBI_NOTUSED(delays_size);
6977
6978
0
      memset(&g, 0, sizeof(g));
6979
0
      if (delays) {
6980
0
         *delays = 0;
6981
0
      }
6982
6983
0
      do {
6984
0
         u = stbi__gif_load_next(s, &g, comp, req_comp, two_back);
6985
0
         if (u == (stbi_uc *) s) u = 0;  // end of animated gif marker
6986
6987
0
         if (u) {
6988
0
            *x = g.w;
6989
0
            *y = g.h;
6990
0
            ++layers;
6991
0
            stride = g.w * g.h * 4;
6992
6993
0
            if (out) {
6994
0
               void *tmp = (stbi_uc*) STBI_REALLOC_SIZED( out, out_size, layers * stride );
6995
0
               if (!tmp)
6996
0
                  return stbi__load_gif_main_outofmem(&g, out, delays);
6997
0
               else {
6998
0
                   out = (stbi_uc*) tmp;
6999
0
                   out_size = layers * stride;
7000
0
               }
7001
7002
0
               if (delays) {
7003
0
                  int *new_delays = (int*) STBI_REALLOC_SIZED( *delays, delays_size, sizeof(int) * layers );
7004
0
                  if (!new_delays)
7005
0
                     return stbi__load_gif_main_outofmem(&g, out, delays);
7006
0
                  *delays = new_delays;
7007
0
                  delays_size = layers * sizeof(int);
7008
0
               }
7009
0
            } else {
7010
0
               out = (stbi_uc*)stbi__malloc( layers * stride );
7011
0
               if (!out)
7012
0
                  return stbi__load_gif_main_outofmem(&g, out, delays);
7013
0
               out_size = layers * stride;
7014
0
               if (delays) {
7015
0
                  *delays = (int*) stbi__malloc( layers * sizeof(int) );
7016
0
                  if (!*delays)
7017
0
                     return stbi__load_gif_main_outofmem(&g, out, delays);
7018
0
                  delays_size = layers * sizeof(int);
7019
0
               }
7020
0
            }
7021
0
            memcpy( out + ((layers - 1) * stride), u, stride );
7022
0
            if (layers >= 2) {
7023
0
               two_back = out - 2 * stride;
7024
0
            }
7025
7026
0
            if (delays) {
7027
0
               (*delays)[layers - 1U] = g.delay;
7028
0
            }
7029
0
         }
7030
0
      } while (u != 0);
7031
7032
      // free temp buffer;
7033
0
      STBI_FREE(g.out);
7034
0
      STBI_FREE(g.history);
7035
0
      STBI_FREE(g.background);
7036
7037
      // do the final conversion after loading everything;
7038
0
      if (req_comp && req_comp != 4)
7039
0
         out = stbi__convert_format(out, 4, req_comp, layers * g.w, g.h);
7040
7041
0
      *z = layers;
7042
0
      return out;
7043
0
   } else {
7044
0
      return stbi__errpuc("not GIF", "Image was not as a gif type.");
7045
0
   }
7046
0
}
7047
7048
static void *stbi__gif_load(stbi__context *s, int *x, int *y, int *comp, int req_comp, stbi__result_info *ri)
7049
440
{
7050
440
   stbi_uc *u = 0;
7051
440
   stbi__gif g;
7052
440
   memset(&g, 0, sizeof(g));
7053
440
   STBI_NOTUSED(ri);
7054
7055
440
   u = stbi__gif_load_next(s, &g, comp, req_comp, 0);
7056
440
   if (u == (stbi_uc *) s) u = 0;  // end of animated gif marker
7057
440
   if (u) {
7058
157
      *x = g.w;
7059
157
      *y = g.h;
7060
7061
      // moved conversion to after successful load so that the same
7062
      // can be done for multiple frames.
7063
157
      if (req_comp && req_comp != 4)
7064
0
         u = stbi__convert_format(u, 4, req_comp, g.w, g.h);
7065
283
   } else if (g.out) {
7066
      // if there was an error and we allocated an image buffer, free it!
7067
283
      STBI_FREE(g.out);
7068
283
   }
7069
7070
   // free buffers needed for multiple frame loading;
7071
440
   STBI_FREE(g.history);
7072
440
   STBI_FREE(g.background);
7073
7074
440
   return u;
7075
440
}
7076
7077
static int stbi__gif_info(stbi__context *s, int *x, int *y, int *comp)
7078
5.76k
{
7079
5.76k
   return stbi__gif_info_raw(s,x,y,comp);
7080
5.76k
}
7081
#endif
7082
7083
// *************************************************************************************************
7084
// Radiance RGBE HDR loader
7085
// originally by Nicolas Schulz
7086
#ifndef STBI_NO_HDR
7087
static int stbi__hdr_test_core(stbi__context *s, const char *signature)
7088
8.08k
{
7089
8.08k
   int i;
7090
21.4k
   for (i=0; signature[i]; ++i)
7091
20.0k
      if (stbi__get8(s) != signature[i])
7092
6.75k
          return 0;
7093
1.32k
   stbi__rewind(s);
7094
1.32k
   return 1;
7095
8.08k
}
7096
7097
static int stbi__hdr_test(stbi__context* s)
7098
4.04k
{
7099
4.04k
   int r = stbi__hdr_test_core(s, "#?RADIANCE\n");
7100
4.04k
   stbi__rewind(s);
7101
4.04k
   if(!r) {
7102
4.04k
       r = stbi__hdr_test_core(s, "#?RGBE\n");
7103
4.04k
       stbi__rewind(s);
7104
4.04k
   }
7105
4.04k
   return r;
7106
4.04k
}
7107
7108
299k
#define STBI__HDR_BUFLEN  1024
7109
static char *stbi__hdr_gettoken(stbi__context *z, char *buffer)
7110
7.29k
{
7111
7.29k
   int len=0;
7112
7.29k
   char c = '\0';
7113
7114
7.29k
   c = (char) stbi__get8(z);
7115
7116
306k
   while (!stbi__at_eof(z) && c != '\n') {
7117
299k
      buffer[len++] = c;
7118
299k
      if (len == STBI__HDR_BUFLEN-1) {
7119
         // flush to end of line
7120
16.5k
         while (!stbi__at_eof(z) && stbi__get8(z) != '\n')
7121
16.3k
            ;
7122
213
         break;
7123
213
      }
7124
299k
      c = (char) stbi__get8(z);
7125
299k
   }
7126
7127
7.29k
   buffer[len] = 0;
7128
7.29k
   return buffer;
7129
7.29k
}
7130
7131
static void stbi__hdr_convert(float *output, stbi_uc *input, int req_comp)
7132
234M
{
7133
234M
   if ( input[3] != 0 ) {
7134
65.0k
      float f1;
7135
      // Exponent
7136
65.0k
      f1 = (float) ldexp(1.0f, input[3] - (int)(128 + 8));
7137
65.0k
      if (req_comp <= 2)
7138
0
         output[0] = (input[0] + input[1] + input[2]) * f1 / 3;
7139
65.0k
      else {
7140
65.0k
         output[0] = input[0] * f1;
7141
65.0k
         output[1] = input[1] * f1;
7142
65.0k
         output[2] = input[2] * f1;
7143
65.0k
      }
7144
65.0k
      if (req_comp == 2) output[1] = 1;
7145
65.0k
      if (req_comp == 4) output[3] = 1;
7146
234M
   } else {
7147
234M
      switch (req_comp) {
7148
234M
         case 4: output[3] = 1; /* fallthrough */
7149
234M
         case 3: output[0] = output[1] = output[2] = 0;
7150
234M
                 break;
7151
0
         case 2: output[1] = 1; /* fallthrough */
7152
0
         case 1: output[0] = 0;
7153
0
                 break;
7154
234M
      }
7155
234M
   }
7156
234M
}
7157
7158
static float *stbi__hdr_load(stbi__context *s, int *x, int *y, int *comp, int req_comp, stbi__result_info *ri)
7159
547
{
7160
547
   char buffer[STBI__HDR_BUFLEN];
7161
547
   char *token;
7162
547
   int valid = 0;
7163
547
   int width, height;
7164
547
   stbi_uc *scanline;
7165
547
   float *hdr_data;
7166
547
   int len;
7167
547
   unsigned char count, value;
7168
547
   int i, j, k, c1,c2, z;
7169
547
   const char *headerToken;
7170
547
   STBI_NOTUSED(ri);
7171
7172
   // Check identifier
7173
547
   headerToken = stbi__hdr_gettoken(s,buffer);
7174
547
   if (strcmp(headerToken, "#?RADIANCE") != 0 && strcmp(headerToken, "#?RGBE") != 0)
7175
0
      return stbi__errpf("not HDR", "Corrupt HDR image");
7176
7177
   // Parse header
7178
1.82k
   for(;;) {
7179
1.82k
      token = stbi__hdr_gettoken(s,buffer);
7180
1.82k
      if (token[0] == 0) break;
7181
1.27k
      if (strcmp(token, "FORMAT=32-bit_rle_rgbe") == 0) valid = 1;
7182
1.27k
   }
7183
7184
547
   if (!valid)    return stbi__errpf("unsupported format", "Unsupported HDR format");
7185
7186
   // Parse width and height
7187
   // can't use sscanf() if we're not using stdio!
7188
547
   token = stbi__hdr_gettoken(s,buffer);
7189
547
   if (strncmp(token, "-Y ", 3))  return stbi__errpf("unsupported data layout", "Unsupported HDR format");
7190
547
   token += 3;
7191
547
   height = (int) strtol(token, &token, 10);
7192
834
   while (*token == ' ') ++token;
7193
547
   if (strncmp(token, "+X ", 3))  return stbi__errpf("unsupported data layout", "Unsupported HDR format");
7194
547
   token += 3;
7195
547
   width = (int) strtol(token, NULL, 10);
7196
7197
547
   if (height > STBI_MAX_DIMENSIONS) return stbi__errpf("too large","Very large image (corrupt?)");
7198
544
   if (width > STBI_MAX_DIMENSIONS) return stbi__errpf("too large","Very large image (corrupt?)");
7199
7200
543
   *x = width;
7201
543
   *y = height;
7202
7203
543
   if (comp) *comp = 3;
7204
543
   if (req_comp == 0) req_comp = 3;
7205
7206
543
   if (!stbi__mad4sizes_valid(width, height, req_comp, sizeof(float), 0))
7207
46
      return stbi__errpf("too large", "HDR image is too large");
7208
7209
   // Read data
7210
497
   hdr_data = (float *) stbi__malloc_mad4(width, height, req_comp, sizeof(float), 0);
7211
497
   if (!hdr_data)
7212
0
      return stbi__errpf("outofmem", "Out of memory");
7213
7214
   // Load image data
7215
   // image data is stored as some number of sca
7216
497
   if ( width < 8 || width >= 32768) {
7217
      // Read flat data
7218
363M
      for (j=0; j < height; ++j) {
7219
598M
         for (i=0; i < width; ++i) {
7220
234M
            stbi_uc rgbe[4];
7221
234M
           main_decode_loop:
7222
234M
            stbi__getn(s, rgbe, 4);
7223
234M
            stbi__hdr_convert(hdr_data + j * width * req_comp + i * req_comp, rgbe, req_comp);
7224
234M
         }
7225
363M
      }
7226
280
   } else {
7227
      // Read RLE-encoded data
7228
217
      scanline = NULL;
7229
7230
1.03k
      for (j = 0; j < height; ++j) {
7231
999
         c1 = stbi__get8(s);
7232
999
         c2 = stbi__get8(s);
7233
999
         len = stbi__get8(s);
7234
999
         if (c1 != 2 || c2 != 2 || (len & 0x80)) {
7235
            // not run-length encoded, so we have to actually use THIS data as a decoded
7236
            // pixel (note this can't be a valid pixel--one of RGB must be >= 128)
7237
66
            stbi_uc rgbe[4];
7238
66
            rgbe[0] = (stbi_uc) c1;
7239
66
            rgbe[1] = (stbi_uc) c2;
7240
66
            rgbe[2] = (stbi_uc) len;
7241
66
            rgbe[3] = (stbi_uc) stbi__get8(s);
7242
66
            stbi__hdr_convert(hdr_data, rgbe, req_comp);
7243
66
            i = 1;
7244
66
            j = 0;
7245
66
            STBI_FREE(scanline);
7246
66
            goto main_decode_loop; // yes, this makes no sense
7247
66
         }
7248
933
         len <<= 8;
7249
933
         len |= stbi__get8(s);
7250
933
         if (len != width) { STBI_FREE(hdr_data); STBI_FREE(scanline); return stbi__errpf("invalid decoded scanline length", "corrupt HDR"); }
7251
906
         if (scanline == NULL) {
7252
116
            scanline = (stbi_uc *) stbi__malloc_mad2(width, 4, 0);
7253
116
            if (!scanline) {
7254
0
               STBI_FREE(hdr_data);
7255
0
               return stbi__errpf("outofmem", "Out of memory");
7256
0
            }
7257
116
         }
7258
7259
4.20k
         for (k = 0; k < 4; ++k) {
7260
3.39k
            int nleft;
7261
3.39k
            i = 0;
7262
7.48k
            while ((nleft = width - i) > 0) {
7263
4.18k
               count = stbi__get8(s);
7264
4.18k
               if (count > 128) {
7265
                  // Run
7266
3.74k
                  value = stbi__get8(s);
7267
3.74k
                  count -= 128;
7268
3.74k
                  if ((count == 0) || (count > nleft)) { STBI_FREE(hdr_data); STBI_FREE(scanline); return stbi__errpf("corrupt", "bad RLE data in HDR"); }
7269
291k
                  for (z = 0; z < count; ++z)
7270
288k
                     scanline[i++ * 4 + k] = value;
7271
3.73k
               } else {
7272
                  // Dump
7273
446
                  if ((count == 0) || (count > nleft)) { STBI_FREE(hdr_data); STBI_FREE(scanline); return stbi__errpf("corrupt", "bad RLE data in HDR"); }
7274
3.26k
                  for (z = 0; z < count; ++z)
7275
2.90k
                     scanline[i++ * 4 + k] = stbi__get8(s);
7276
359
               }
7277
4.18k
            }
7278
3.39k
         }
7279
64.6k
         for (i=0; i < width; ++i)
7280
63.8k
            stbi__hdr_convert(hdr_data+(j*width + i)*req_comp, scanline + i*4, req_comp);
7281
818
      }
7282
36
      if (scanline)
7283
8
         STBI_FREE(scanline);
7284
36
   }
7285
7286
382
   return hdr_data;
7287
497
}
7288
7289
static int stbi__hdr_info(stbi__context *s, int *x, int *y, int *comp)
7290
3.25k
{
7291
3.25k
   char buffer[STBI__HDR_BUFLEN];
7292
3.25k
   char *token;
7293
3.25k
   int valid = 0;
7294
3.25k
   int dummy;
7295
7296
3.25k
   if (!x) x = &dummy;
7297
3.25k
   if (!y) y = &dummy;
7298
3.25k
   if (!comp) comp = &dummy;
7299
7300
3.25k
   if (stbi__hdr_test(s) == 0) {
7301
2.47k
       stbi__rewind( s );
7302
2.47k
       return 0;
7303
2.47k
   }
7304
7305
3.78k
   for(;;) {
7306
3.78k
      token = stbi__hdr_gettoken(s,buffer);
7307
3.78k
      if (token[0] == 0) break;
7308
3.00k
      if (strcmp(token, "FORMAT=32-bit_rle_rgbe") == 0) valid = 1;
7309
3.00k
   }
7310
7311
782
   if (!valid) {
7312
187
       stbi__rewind( s );
7313
187
       return 0;
7314
187
   }
7315
595
   token = stbi__hdr_gettoken(s,buffer);
7316
595
   if (strncmp(token, "-Y ", 3)) {
7317
15
       stbi__rewind( s );
7318
15
       return 0;
7319
15
   }
7320
580
   token += 3;
7321
580
   *y = (int) strtol(token, &token, 10);
7322
933
   while (*token == ' ') ++token;
7323
580
   if (strncmp(token, "+X ", 3)) {
7324
30
       stbi__rewind( s );
7325
30
       return 0;
7326
30
   }
7327
550
   token += 3;
7328
550
   *x = (int) strtol(token, NULL, 10);
7329
550
   *comp = 3;
7330
550
   return 1;
7331
580
}
7332
#endif // STBI_NO_HDR
7333
7334
#ifndef STBI_NO_BMP
7335
static int stbi__bmp_info(stbi__context *s, int *x, int *y, int *comp)
7336
5.31k
{
7337
5.31k
   void *p;
7338
5.31k
   stbi__bmp_data info;
7339
7340
5.31k
   info.all_a = 255;
7341
5.31k
   p = stbi__bmp_parse_header(s, &info);
7342
5.31k
   if (p == NULL) {
7343
4.13k
      stbi__rewind( s );
7344
4.13k
      return 0;
7345
4.13k
   }
7346
1.18k
   if (x) *x = s->img_x;
7347
1.18k
   if (y) *y = s->img_y;
7348
1.18k
   if (comp) {
7349
1.18k
      if (info.bpp == 24 && info.ma == 0xff000000)
7350
1
         *comp = 3;
7351
1.18k
      else
7352
1.18k
         *comp = info.ma ? 4 : 3;
7353
1.18k
   }
7354
1.18k
   return 1;
7355
5.31k
}
7356
#endif
7357
7358
#ifndef STBI_NO_PSD
7359
static int stbi__psd_info(stbi__context *s, int *x, int *y, int *comp)
7360
4.13k
{
7361
4.13k
   int channelCount, dummy, depth;
7362
4.13k
   if (!x) x = &dummy;
7363
4.13k
   if (!y) y = &dummy;
7364
4.13k
   if (!comp) comp = &dummy;
7365
4.13k
   if (stbi__get32be(s) != 0x38425053) {
7366
3.76k
       stbi__rewind( s );
7367
3.76k
       return 0;
7368
3.76k
   }
7369
372
   if (stbi__get16be(s) != 1) {
7370
13
       stbi__rewind( s );
7371
13
       return 0;
7372
13
   }
7373
359
   stbi__skip(s, 6);
7374
359
   channelCount = stbi__get16be(s);
7375
359
   if (channelCount < 0 || channelCount > 16) {
7376
8
       stbi__rewind( s );
7377
8
       return 0;
7378
8
   }
7379
351
   *y = stbi__get32be(s);
7380
351
   *x = stbi__get32be(s);
7381
351
   depth = stbi__get16be(s);
7382
351
   if (depth != 8 && depth != 16) {
7383
23
       stbi__rewind( s );
7384
23
       return 0;
7385
23
   }
7386
328
   if (stbi__get16be(s) != 3) {
7387
14
       stbi__rewind( s );
7388
14
       return 0;
7389
14
   }
7390
314
   *comp = 4;
7391
314
   return 1;
7392
328
}
7393
7394
static int stbi__psd_is16(stbi__context *s)
7395
0
{
7396
0
   int channelCount, depth;
7397
0
   if (stbi__get32be(s) != 0x38425053) {
7398
0
       stbi__rewind( s );
7399
0
       return 0;
7400
0
   }
7401
0
   if (stbi__get16be(s) != 1) {
7402
0
       stbi__rewind( s );
7403
0
       return 0;
7404
0
   }
7405
0
   stbi__skip(s, 6);
7406
0
   channelCount = stbi__get16be(s);
7407
0
   if (channelCount < 0 || channelCount > 16) {
7408
0
       stbi__rewind( s );
7409
0
       return 0;
7410
0
   }
7411
0
   STBI_NOTUSED(stbi__get32be(s));
7412
0
   STBI_NOTUSED(stbi__get32be(s));
7413
0
   depth = stbi__get16be(s);
7414
0
   if (depth != 16) {
7415
0
       stbi__rewind( s );
7416
0
       return 0;
7417
0
   }
7418
0
   return 1;
7419
0
}
7420
#endif
7421
7422
#ifndef STBI_NO_PIC
7423
static int stbi__pic_info(stbi__context *s, int *x, int *y, int *comp)
7424
3.82k
{
7425
3.82k
   int act_comp=0,num_packets=0,chained,dummy;
7426
3.82k
   stbi__pic_packet packets[10];
7427
7428
3.82k
   if (!x) x = &dummy;
7429
3.82k
   if (!y) y = &dummy;
7430
3.82k
   if (!comp) comp = &dummy;
7431
7432
3.82k
   if (!stbi__pic_is4(s,"\x53\x80\xF6\x34")) {
7433
3.50k
      stbi__rewind(s);
7434
3.50k
      return 0;
7435
3.50k
   }
7436
7437
317
   stbi__skip(s, 88);
7438
7439
317
   *x = stbi__get16be(s);
7440
317
   *y = stbi__get16be(s);
7441
317
   if (stbi__at_eof(s)) {
7442
1
      stbi__rewind( s);
7443
1
      return 0;
7444
1
   }
7445
316
   if ( (*x) != 0 && (1 << 28) / (*x) < (*y)) {
7446
1
      stbi__rewind( s );
7447
1
      return 0;
7448
1
   }
7449
7450
315
   stbi__skip(s, 8);
7451
7452
414
   do {
7453
414
      stbi__pic_packet *packet;
7454
7455
414
      if (num_packets==sizeof(packets)/sizeof(packets[0]))
7456
1
         return 0;
7457
7458
413
      packet = &packets[num_packets++];
7459
413
      chained = stbi__get8(s);
7460
413
      packet->size    = stbi__get8(s);
7461
413
      packet->type    = stbi__get8(s);
7462
413
      packet->channel = stbi__get8(s);
7463
413
      act_comp |= packet->channel;
7464
7465
413
      if (stbi__at_eof(s)) {
7466
66
          stbi__rewind( s );
7467
66
          return 0;
7468
66
      }
7469
347
      if (packet->size != 8) {
7470
3
          stbi__rewind( s );
7471
3
          return 0;
7472
3
      }
7473
347
   } while (chained);
7474
7475
245
   *comp = (act_comp & 0x10 ? 4 : 3);
7476
7477
245
   return 1;
7478
315
}
7479
#endif
7480
7481
// *************************************************************************************************
7482
// Portable Gray Map and Portable Pixel Map loader
7483
// by Ken Miller
7484
//
7485
// PGM: http://netpbm.sourceforge.net/doc/pgm.html
7486
// PPM: http://netpbm.sourceforge.net/doc/ppm.html
7487
//
7488
// Known limitations:
7489
//    Does not support comments in the header section
7490
//    Does not support ASCII image data (formats P2 and P3)
7491
7492
#ifndef STBI_NO_PNM
7493
7494
static int      stbi__pnm_test(stbi__context *s)
7495
1.10k
{
7496
1.10k
   char p, t;
7497
1.10k
   p = (char) stbi__get8(s);
7498
1.10k
   t = (char) stbi__get8(s);
7499
1.10k
   if (p != 'P' || (t != '5' && t != '6')) {
7500
792
       stbi__rewind( s );
7501
792
       return 0;
7502
792
   }
7503
308
   return 1;
7504
1.10k
}
7505
7506
static void *stbi__pnm_load(stbi__context *s, int *x, int *y, int *comp, int req_comp, stbi__result_info *ri)
7507
308
{
7508
308
   stbi_uc *out;
7509
308
   STBI_NOTUSED(ri);
7510
7511
308
   ri->bits_per_channel = stbi__pnm_info(s, (int *)&s->img_x, (int *)&s->img_y, (int *)&s->img_n);
7512
308
   if (ri->bits_per_channel == 0)
7513
3
      return 0;
7514
7515
305
   if (s->img_y > STBI_MAX_DIMENSIONS) return stbi__errpuc("too large","Very large image (corrupt?)");
7516
303
   if (s->img_x > STBI_MAX_DIMENSIONS) return stbi__errpuc("too large","Very large image (corrupt?)");
7517
7518
302
   *x = s->img_x;
7519
302
   *y = s->img_y;
7520
302
   if (comp) *comp = s->img_n;
7521
7522
302
   if (!stbi__mad4sizes_valid(s->img_n, s->img_x, s->img_y, ri->bits_per_channel / 8, 0))
7523
0
      return stbi__errpuc("too large", "PNM too large");
7524
7525
302
   out = (stbi_uc *) stbi__malloc_mad4(s->img_n, s->img_x, s->img_y, ri->bits_per_channel / 8, 0);
7526
302
   if (!out) return stbi__errpuc("outofmem", "Out of memory");
7527
302
   if (!stbi__getn(s, out, s->img_n * s->img_x * s->img_y * (ri->bits_per_channel / 8))) {
7528
246
      STBI_FREE(out);
7529
246
      return stbi__errpuc("bad PNM", "PNM file truncated");
7530
246
   }
7531
7532
56
   if (req_comp && req_comp != s->img_n) {
7533
56
      if (ri->bits_per_channel == 16) {
7534
33
         out = (stbi_uc *) stbi__convert_format16((stbi__uint16 *) out, s->img_n, req_comp, s->img_x, s->img_y);
7535
33
      } else {
7536
23
         out = stbi__convert_format(out, s->img_n, req_comp, s->img_x, s->img_y);
7537
23
      }
7538
56
      if (out == NULL) return out; // stbi__convert_format frees input on failure
7539
56
   }
7540
56
   return out;
7541
56
}
7542
7543
static int      stbi__pnm_isspace(char c)
7544
4.78k
{
7545
4.78k
   return c == ' ' || c == '\t' || c == '\n' || c == '\v' || c == '\f' || c == '\r';
7546
4.78k
}
7547
7548
static void     stbi__pnm_skip_whitespace(stbi__context *s, char *c)
7549
2.17k
{
7550
2.80k
   for (;;) {
7551
5.33k
      while (!stbi__at_eof(s) && stbi__pnm_isspace(*c))
7552
2.53k
         *c = (char) stbi__get8(s);
7553
7554
2.80k
      if (stbi__at_eof(s) || *c != '#')
7555
2.17k
         break;
7556
7557
1.95k
      while (!stbi__at_eof(s) && *c != '\n' && *c != '\r' )
7558
1.32k
         *c = (char) stbi__get8(s);
7559
632
   }
7560
2.17k
}
7561
7562
static int      stbi__pnm_isdigit(char c)
7563
5.21k
{
7564
5.21k
   return c >= '0' && c <= '9';
7565
5.21k
}
7566
7567
static int      stbi__pnm_getinteger(stbi__context *s, char *c)
7568
2.17k
{
7569
2.17k
   int value = 0;
7570
7571
6.29k
   while (!stbi__at_eof(s) && stbi__pnm_isdigit(*c)) {
7572
4.13k
      value = value*10 + (*c - '0');
7573
4.13k
      *c = (char) stbi__get8(s);
7574
4.13k
      if((value > 214748364) || (value == 214748364 && *c > '7'))
7575
11
          return stbi__err("integer parse overflow", "Parsing an integer in the PPM header overflowed a 32-bit int");
7576
4.13k
   }
7577
7578
2.16k
   return value;
7579
2.17k
}
7580
7581
static int      stbi__pnm_info(stbi__context *s, int *x, int *y, int *comp)
7582
3.88k
{
7583
3.88k
   int maxv, dummy;
7584
3.88k
   char c, p, t;
7585
7586
3.88k
   if (!x) x = &dummy;
7587
3.88k
   if (!y) y = &dummy;
7588
3.88k
   if (!comp) comp = &dummy;
7589
7590
3.88k
   stbi__rewind(s);
7591
7592
   // Get identifier
7593
3.88k
   p = (char) stbi__get8(s);
7594
3.88k
   t = (char) stbi__get8(s);
7595
3.88k
   if (p != 'P' || (t != '5' && t != '6')) {
7596
3.07k
       stbi__rewind(s);
7597
3.07k
       return 0;
7598
3.07k
   }
7599
7600
810
   *comp = (t == '6') ? 3 : 1;  // '5' is 1-component .pgm; '6' is 3-component .ppm
7601
7602
810
   c = (char) stbi__get8(s);
7603
810
   stbi__pnm_skip_whitespace(s, &c);
7604
7605
810
   *x = stbi__pnm_getinteger(s, &c); // read width
7606
810
   if(*x == 0)
7607
84
       return stbi__err("invalid width", "PPM image header had zero or overflowing width");
7608
726
   stbi__pnm_skip_whitespace(s, &c);
7609
7610
726
   *y = stbi__pnm_getinteger(s, &c); // read height
7611
726
   if (*y == 0)
7612
91
       return stbi__err("invalid width", "PPM image header had zero or overflowing width");
7613
635
   stbi__pnm_skip_whitespace(s, &c);
7614
7615
635
   maxv = stbi__pnm_getinteger(s, &c);  // read max value
7616
635
   if (maxv > 65535)
7617
10
      return stbi__err("max value > 65535", "PPM image supports only 8-bit and 16-bit images");
7618
625
   else if (maxv > 255)
7619
115
      return 16;
7620
510
   else
7621
510
      return 8;
7622
635
}
7623
7624
static int stbi__pnm_is16(stbi__context *s)
7625
0
{
7626
0
   if (stbi__pnm_info(s, NULL, NULL, NULL) == 16)
7627
0
     return 1;
7628
0
   return 0;
7629
0
}
7630
#endif
7631
7632
static int stbi__info_main(stbi__context *s, int *x, int *y, int *comp)
7633
8.78k
{
7634
8.78k
   #ifndef STBI_NO_JPEG
7635
8.78k
   if (stbi__jpeg_info(s, x, y, comp)) return 1;
7636
6.95k
   #endif
7637
7638
6.95k
   #ifndef STBI_NO_PNG
7639
6.95k
   if (stbi__png_info(s, x, y, comp))  return 1;
7640
5.76k
   #endif
7641
7642
5.76k
   #ifndef STBI_NO_GIF
7643
5.76k
   if (stbi__gif_info(s, x, y, comp))  return 1;
7644
5.31k
   #endif
7645
7646
5.31k
   #ifndef STBI_NO_BMP
7647
5.31k
   if (stbi__bmp_info(s, x, y, comp))  return 1;
7648
4.13k
   #endif
7649
7650
4.13k
   #ifndef STBI_NO_PSD
7651
4.13k
   if (stbi__psd_info(s, x, y, comp))  return 1;
7652
3.82k
   #endif
7653
7654
3.82k
   #ifndef STBI_NO_PIC
7655
3.82k
   if (stbi__pic_info(s, x, y, comp))  return 1;
7656
3.57k
   #endif
7657
7658
3.57k
   #ifndef STBI_NO_PNM
7659
3.57k
   if (stbi__pnm_info(s, x, y, comp))  return 1;
7660
3.25k
   #endif
7661
7662
3.25k
   #ifndef STBI_NO_HDR
7663
3.25k
   if (stbi__hdr_info(s, x, y, comp))  return 1;
7664
2.70k
   #endif
7665
7666
   // test tga last because it's a crappy test!
7667
2.70k
   #ifndef STBI_NO_TGA
7668
2.70k
   if (stbi__tga_info(s, x, y, comp))
7669
232
       return 1;
7670
2.47k
   #endif
7671
2.47k
   return stbi__err("unknown image type", "Image not of any known type, or corrupt");
7672
2.70k
}
7673
7674
static int stbi__is_16_main(stbi__context *s)
7675
0
{
7676
0
   #ifndef STBI_NO_PNG
7677
0
   if (stbi__png_is16(s))  return 1;
7678
0
   #endif
7679
7680
0
   #ifndef STBI_NO_PSD
7681
0
   if (stbi__psd_is16(s))  return 1;
7682
0
   #endif
7683
7684
0
   #ifndef STBI_NO_PNM
7685
0
   if (stbi__pnm_is16(s))  return 1;
7686
0
   #endif
7687
0
   return 0;
7688
0
}
7689
7690
#ifndef STBI_NO_STDIO
7691
STBIDEF int stbi_info(char const *filename, int *x, int *y, int *comp)
7692
0
{
7693
0
    FILE *f = stbi__fopen(filename, "rb");
7694
0
    int result;
7695
0
    if (!f) return stbi__err("can't fopen", "Unable to open file");
7696
0
    result = stbi_info_from_file(f, x, y, comp);
7697
0
    fclose(f);
7698
0
    return result;
7699
0
}
7700
7701
STBIDEF int stbi_info_from_file(FILE *f, int *x, int *y, int *comp)
7702
0
{
7703
0
   int r;
7704
0
   stbi__context s;
7705
0
   long pos = ftell(f);
7706
0
   stbi__start_file(&s, f);
7707
0
   r = stbi__info_main(&s,x,y,comp);
7708
0
   fseek(f,pos,SEEK_SET);
7709
0
   return r;
7710
0
}
7711
7712
STBIDEF int stbi_is_16_bit(char const *filename)
7713
0
{
7714
0
    FILE *f = stbi__fopen(filename, "rb");
7715
0
    int result;
7716
0
    if (!f) return stbi__err("can't fopen", "Unable to open file");
7717
0
    result = stbi_is_16_bit_from_file(f);
7718
0
    fclose(f);
7719
0
    return result;
7720
0
}
7721
7722
STBIDEF int stbi_is_16_bit_from_file(FILE *f)
7723
0
{
7724
0
   int r;
7725
0
   stbi__context s;
7726
0
   long pos = ftell(f);
7727
0
   stbi__start_file(&s, f);
7728
0
   r = stbi__is_16_main(&s);
7729
0
   fseek(f,pos,SEEK_SET);
7730
0
   return r;
7731
0
}
7732
#endif // !STBI_NO_STDIO
7733
7734
STBIDEF int stbi_info_from_memory(stbi_uc const *buffer, int len, int *x, int *y, int *comp)
7735
9.05k
{
7736
9.05k
   stbi__context s;
7737
9.05k
   stbi__start_mem(&s,buffer,len);
7738
9.05k
   return stbi__info_main(&s,x,y,comp);
7739
9.05k
}
7740
7741
STBIDEF int stbi_info_from_callbacks(stbi_io_callbacks const *c, void *user, int *x, int *y, int *comp)
7742
0
{
7743
0
   stbi__context s;
7744
0
   stbi__start_callbacks(&s, (stbi_io_callbacks *) c, user);
7745
0
   return stbi__info_main(&s,x,y,comp);
7746
0
}
7747
7748
STBIDEF int stbi_is_16_bit_from_memory(stbi_uc const *buffer, int len)
7749
0
{
7750
0
   stbi__context s;
7751
0
   stbi__start_mem(&s,buffer,len);
7752
0
   return stbi__is_16_main(&s);
7753
0
}
7754
7755
STBIDEF int stbi_is_16_bit_from_callbacks(stbi_io_callbacks const *c, void *user)
7756
0
{
7757
0
   stbi__context s;
7758
0
   stbi__start_callbacks(&s, (stbi_io_callbacks *) c, user);
7759
0
   return stbi__is_16_main(&s);
7760
0
}
7761
7762
#endif // STB_IMAGE_IMPLEMENTATION
7763
7764
/*
7765
   revision history:
7766
      2.20  (2019-02-07) support utf8 filenames in Windows; fix warnings and platform ifdefs
7767
      2.19  (2018-02-11) fix warning
7768
      2.18  (2018-01-30) fix warnings
7769
      2.17  (2018-01-29) change sbti__shiftsigned to avoid clang -O2 bug
7770
                         1-bit BMP
7771
                         *_is_16_bit api
7772
                         avoid warnings
7773
      2.16  (2017-07-23) all functions have 16-bit variants;
7774
                         STBI_NO_STDIO works again;
7775
                         compilation fixes;
7776
                         fix rounding in unpremultiply;
7777
                         optimize vertical flip;
7778
                         disable raw_len validation;
7779
                         documentation fixes
7780
      2.15  (2017-03-18) fix png-1,2,4 bug; now all Imagenet JPGs decode;
7781
                         warning fixes; disable run-time SSE detection on gcc;
7782
                         uniform handling of optional "return" values;
7783
                         thread-safe initialization of zlib tables
7784
      2.14  (2017-03-03) remove deprecated STBI_JPEG_OLD; fixes for Imagenet JPGs
7785
      2.13  (2016-11-29) add 16-bit API, only supported for PNG right now
7786
      2.12  (2016-04-02) fix typo in 2.11 PSD fix that caused crashes
7787
      2.11  (2016-04-02) allocate large structures on the stack
7788
                         remove white matting for transparent PSD
7789
                         fix reported channel count for PNG & BMP
7790
                         re-enable SSE2 in non-gcc 64-bit
7791
                         support RGB-formatted JPEG
7792
                         read 16-bit PNGs (only as 8-bit)
7793
      2.10  (2016-01-22) avoid warning introduced in 2.09 by STBI_REALLOC_SIZED
7794
      2.09  (2016-01-16) allow comments in PNM files
7795
                         16-bit-per-pixel TGA (not bit-per-component)
7796
                         info() for TGA could break due to .hdr handling
7797
                         info() for BMP to shares code instead of sloppy parse
7798
                         can use STBI_REALLOC_SIZED if allocator doesn't support realloc
7799
                         code cleanup
7800
      2.08  (2015-09-13) fix to 2.07 cleanup, reading RGB PSD as RGBA
7801
      2.07  (2015-09-13) fix compiler warnings
7802
                         partial animated GIF support
7803
                         limited 16-bpc PSD support
7804
                         #ifdef unused functions
7805
                         bug with < 92 byte PIC,PNM,HDR,TGA
7806
      2.06  (2015-04-19) fix bug where PSD returns wrong '*comp' value
7807
      2.05  (2015-04-19) fix bug in progressive JPEG handling, fix warning
7808
      2.04  (2015-04-15) try to re-enable SIMD on MinGW 64-bit
7809
      2.03  (2015-04-12) extra corruption checking (mmozeiko)
7810
                         stbi_set_flip_vertically_on_load (nguillemot)
7811
                         fix NEON support; fix mingw support
7812
      2.02  (2015-01-19) fix incorrect assert, fix warning
7813
      2.01  (2015-01-17) fix various warnings; suppress SIMD on gcc 32-bit without -msse2
7814
      2.00b (2014-12-25) fix STBI_MALLOC in progressive JPEG
7815
      2.00  (2014-12-25) optimize JPG, including x86 SSE2 & NEON SIMD (ryg)
7816
                         progressive JPEG (stb)
7817
                         PGM/PPM support (Ken Miller)
7818
                         STBI_MALLOC,STBI_REALLOC,STBI_FREE
7819
                         GIF bugfix -- seemingly never worked
7820
                         STBI_NO_*, STBI_ONLY_*
7821
      1.48  (2014-12-14) fix incorrectly-named assert()
7822
      1.47  (2014-12-14) 1/2/4-bit PNG support, both direct and paletted (Omar Cornut & stb)
7823
                         optimize PNG (ryg)
7824
                         fix bug in interlaced PNG with user-specified channel count (stb)
7825
      1.46  (2014-08-26)
7826
              fix broken tRNS chunk (colorkey-style transparency) in non-paletted PNG
7827
      1.45  (2014-08-16)
7828
              fix MSVC-ARM internal compiler error by wrapping malloc
7829
      1.44  (2014-08-07)
7830
              various warning fixes from Ronny Chevalier
7831
      1.43  (2014-07-15)
7832
              fix MSVC-only compiler problem in code changed in 1.42
7833
      1.42  (2014-07-09)
7834
              don't define _CRT_SECURE_NO_WARNINGS (affects user code)
7835
              fixes to stbi__cleanup_jpeg path
7836
              added STBI_ASSERT to avoid requiring assert.h
7837
      1.41  (2014-06-25)
7838
              fix search&replace from 1.36 that messed up comments/error messages
7839
      1.40  (2014-06-22)
7840
              fix gcc struct-initialization warning
7841
      1.39  (2014-06-15)
7842
              fix to TGA optimization when req_comp != number of components in TGA;
7843
              fix to GIF loading because BMP wasn't rewinding (whoops, no GIFs in my test suite)
7844
              add support for BMP version 5 (more ignored fields)
7845
      1.38  (2014-06-06)
7846
              suppress MSVC warnings on integer casts truncating values
7847
              fix accidental rename of 'skip' field of I/O
7848
      1.37  (2014-06-04)
7849
              remove duplicate typedef
7850
      1.36  (2014-06-03)
7851
              convert to header file single-file library
7852
              if de-iphone isn't set, load iphone images color-swapped instead of returning NULL
7853
      1.35  (2014-05-27)
7854
              various warnings
7855
              fix broken STBI_SIMD path
7856
              fix bug where stbi_load_from_file no longer left file pointer in correct place
7857
              fix broken non-easy path for 32-bit BMP (possibly never used)
7858
              TGA optimization by Arseny Kapoulkine
7859
      1.34  (unknown)
7860
              use STBI_NOTUSED in stbi__resample_row_generic(), fix one more leak in tga failure case
7861
      1.33  (2011-07-14)
7862
              make stbi_is_hdr work in STBI_NO_HDR (as specified), minor compiler-friendly improvements
7863
      1.32  (2011-07-13)
7864
              support for "info" function for all supported filetypes (SpartanJ)
7865
      1.31  (2011-06-20)
7866
              a few more leak fixes, bug in PNG handling (SpartanJ)
7867
      1.30  (2011-06-11)
7868
              added ability to load files via callbacks to accomidate custom input streams (Ben Wenger)
7869
              removed deprecated format-specific test/load functions
7870
              removed support for installable file formats (stbi_loader) -- would have been broken for IO callbacks anyway
7871
              error cases in bmp and tga give messages and don't leak (Raymond Barbiero, grisha)
7872
              fix inefficiency in decoding 32-bit BMP (David Woo)
7873
      1.29  (2010-08-16)
7874
              various warning fixes from Aurelien Pocheville
7875
      1.28  (2010-08-01)
7876
              fix bug in GIF palette transparency (SpartanJ)
7877
      1.27  (2010-08-01)
7878
              cast-to-stbi_uc to fix warnings
7879
      1.26  (2010-07-24)
7880
              fix bug in file buffering for PNG reported by SpartanJ
7881
      1.25  (2010-07-17)
7882
              refix trans_data warning (Won Chun)
7883
      1.24  (2010-07-12)
7884
              perf improvements reading from files on platforms with lock-heavy fgetc()
7885
              minor perf improvements for jpeg
7886
              deprecated type-specific functions so we'll get feedback if they're needed
7887
              attempt to fix trans_data warning (Won Chun)
7888
      1.23    fixed bug in iPhone support
7889
      1.22  (2010-07-10)
7890
              removed image *writing* support
7891
              stbi_info support from Jetro Lauha
7892
              GIF support from Jean-Marc Lienher
7893
              iPhone PNG-extensions from James Brown
7894
              warning-fixes from Nicolas Schulz and Janez Zemva (i.stbi__err. Janez (U+017D)emva)
7895
      1.21    fix use of 'stbi_uc' in header (reported by jon blow)
7896
      1.20    added support for Softimage PIC, by Tom Seddon
7897
      1.19    bug in interlaced PNG corruption check (found by ryg)
7898
      1.18  (2008-08-02)
7899
              fix a threading bug (local mutable static)
7900
      1.17    support interlaced PNG
7901
      1.16    major bugfix - stbi__convert_format converted one too many pixels
7902
      1.15    initialize some fields for thread safety
7903
      1.14    fix threadsafe conversion bug
7904
              header-file-only version (#define STBI_HEADER_FILE_ONLY before including)
7905
      1.13    threadsafe
7906
      1.12    const qualifiers in the API
7907
      1.11    Support installable IDCT, colorspace conversion routines
7908
      1.10    Fixes for 64-bit (don't use "unsigned long")
7909
              optimized upsampling by Fabian "ryg" Giesen
7910
      1.09    Fix format-conversion for PSD code (bad global variables!)
7911
      1.08    Thatcher Ulrich's PSD code integrated by Nicolas Schulz
7912
      1.07    attempt to fix C++ warning/errors again
7913
      1.06    attempt to fix C++ warning/errors again
7914
      1.05    fix TGA loading to return correct *comp and use good luminance calc
7915
      1.04    default float alpha is 1, not 255; use 'void *' for stbi_image_free
7916
      1.03    bugfixes to STBI_NO_STDIO, STBI_NO_HDR
7917
      1.02    support for (subset of) HDR files, float interface for preferred access to them
7918
      1.01    fix bug: possible bug in handling right-side up bmps... not sure
7919
              fix bug: the stbi__bmp_load() and stbi__tga_load() functions didn't work at all
7920
      1.00    interface to zlib that skips zlib header
7921
      0.99    correct handling of alpha in palette
7922
      0.98    TGA loader by lonesock; dynamically add loaders (untested)
7923
      0.97    jpeg errors on too large a file; also catch another malloc failure
7924
      0.96    fix detection of invalid v value - particleman@mollyrocket forum
7925
      0.95    during header scan, seek to markers in case of padding
7926
      0.94    STBI_NO_STDIO to disable stdio usage; rename all #defines the same
7927
      0.93    handle jpegtran output; verbose errors
7928
      0.92    read 4,8,16,24,32-bit BMP files of several formats
7929
      0.91    output 24-bit Windows 3.0 BMP files
7930
      0.90    fix a few more warnings; bump version number to approach 1.0
7931
      0.61    bugfixes due to Marc LeBlanc, Christopher Lloyd
7932
      0.60    fix compiling as c++
7933
      0.59    fix warnings: merge Dave Moore's -Wall fixes
7934
      0.58    fix bug: zlib uncompressed mode len/nlen was wrong endian
7935
      0.57    fix bug: jpg last huffman symbol before marker was >9 bits but less than 16 available
7936
      0.56    fix bug: zlib uncompressed mode len vs. nlen
7937
      0.55    fix bug: restart_interval not initialized to 0
7938
      0.54    allow NULL for 'int *comp'
7939
      0.53    fix bug in png 3->4; speedup png decoding
7940
      0.52    png handles req_comp=3,4 directly; minor cleanup; jpeg comments
7941
      0.51    obey req_comp requests, 1-component jpegs return as 1-component,
7942
              on 'test' only check type, not whether we support this variant
7943
      0.50  (2006-11-19)
7944
              first released version
7945
*/
7946
7947
7948
/*
7949
------------------------------------------------------------------------------
7950
This software is available under 2 licenses -- choose whichever you prefer.
7951
------------------------------------------------------------------------------
7952
ALTERNATIVE A - MIT License
7953
Copyright (c) 2017 Sean Barrett
7954
Permission is hereby granted, free of charge, to any person obtaining a copy of
7955
this software and associated documentation files (the "Software"), to deal in
7956
the Software without restriction, including without limitation the rights to
7957
use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies
7958
of the Software, and to permit persons to whom the Software is furnished to do
7959
so, subject to the following conditions:
7960
The above copyright notice and this permission notice shall be included in all
7961
copies or substantial portions of the Software.
7962
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
7963
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
7964
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
7965
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
7966
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
7967
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
7968
SOFTWARE.
7969
------------------------------------------------------------------------------
7970
ALTERNATIVE B - Public Domain (www.unlicense.org)
7971
This is free and unencumbered software released into the public domain.
7972
Anyone is free to copy, modify, publish, use, compile, sell, or distribute this
7973
software, either in source code form or as a compiled binary, for any purpose,
7974
commercial or non-commercial, and by any means.
7975
In jurisdictions that recognize copyright laws, the author or authors of this
7976
software dedicate any and all copyright interest in the software to the public
7977
domain. We make this dedication for the benefit of the public at large and to
7978
the detriment of our heirs and successors. We intend this dedication to be an
7979
overt act of relinquishment in perpetuity of all present and future rights to
7980
this software under copyright law.
7981
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
7982
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
7983
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
7984
AUTHORS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN
7985
ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION
7986
WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
7987
------------------------------------------------------------------------------
7988
*/