The rapid development of mobile social networks and wireless communication technology has made image/video sharing easier and more efficient. However, the convenience of social multimedia sharing can also cause serious problems such as privacy disclosure because of illegal use of shared contents, and the existing content sharing scheme cannot meet the privacy protection requirements of users in social networks. Therefore, secure multimedia sharing and privacy protection have become critical and urgent in multimedia social networks. For protecting multimedia sharing in multimedia social networks, a novel joint fingerprinting and encryption (JFE) scheme is proposed. Both fingerprint embedding and encryption are performed in the tree structure haar wavelet transform and singular value decomposition (TSHWT_SVD) domain based on chaotic neural network. First, the social image is decomposed based on the fingerprint code structure by the TSHWT. Then, perform SVD computing for selective subbands for parallel piecewise fingerprint segments embedding. In the end, the fingerprinted coefficient stream is encrypted via block permutation and SVD diffusion. It is worth mentioning that most of the existing image security algorithms encrypt and embed a watermark in the spatial domain, which cannot meet the requirements of the era of multimedia social networks due to lack of scalability. The proposed method, to the best of our knowledge, is the first scalable JFE method for fingerprinting and encryption in the TSHWT_SVD domain. The use of fingerprinting along with encryption in the TSHWT_SVD domain can provide scalable double-layer protection for secure social multimedia sharing. When compared with existing image security algorithms, the scalable selective encryption method greatly improves encryption efficiency. Moreover, experimental results and contrastive analyses show that the proposed JFE scheme has high security, fast speed and can resist various attacks.