Free viewpoint television (FTV) is an innovative technology that allows one to view a three-dimensional (3D) world by freely changing the viewpoint. The transmission aspect of FTV is published as Recommendation ITU-T J.901. [ITU-T J.901] defines the reference system configuration and shows the allocation of depth estimation and the interpolation module in the configuration. Then@ [ITU-T J.901] specifies the requirements for the protocols and data format that are needed@ in accordance with the configuration. The most favourable feature of FTV is its ability to offer an audience a selection of viewpoints. However@ as a reproducible range of viewpoints can give rise to a trade-off with the amount of data to be transmitted@ [ITU-T J.901] demands data scalability as an optional requirement. This Recommendation specifies the data structure within the scope of [ITU-T J.901]@ where the data structure enables scalability in the sense of a reproducible view-range and the amount of data. This Recommendation utilizes existing data representation and view generation schemes specified by ITU and other related standardization organizations@ as well as future view generation schemes.