Specials (Unicode block) - Wikipedia

文章推薦指數: 80 %
投票人數:10人

U+FFFC  OBJECT REPLACEMENT CHARACTER, placeholder in the text for another unspecified object, for example in a compound document. U+FFFD REPLACEMENT ... Specials(Unicodeblock) FromWikipedia,thefreeencyclopedia Jumptonavigation Jumptosearch Unicodeblockcontainingsomespecialcodepointsandtwonon-characters UnicodecharacterblockSpecialsRangeU+FFF0..U+FFFF(16codepoints)PlaneBMPScriptsCommonAssigned5codepointsUnused9reservedcodepoints2non-charactersUnicodeversionhistory1.0.0(1991)1(+1)2.1(1998)2(+1)3.0(1999)5(+3) ChartCodechartNote:[1][2] SpecialsisashortUnicodeblockofcharactersallocatedattheveryendoftheBasicMultilingualPlane,atU+FFF0–FFFF.Ofthese16codepoints,fivehavebeenassignedsinceUnicode3.0: U+FFF9INTERLINEARANNOTATIONANCHOR,marksstartofannotatedtext U+FFFAINTERLINEARANNOTATIONSEPARATOR,marksstartofannotatingcharacter(s) U+FFFBINTERLINEARANNOTATIONTERMINATOR,marksendofannotationblock U+FFFCOBJECTREPLACEMENTCHARACTER,placeholderinthetextforanotherunspecifiedobject,forexampleinacompounddocument. U+FFFD�REPLACEMENTCHARACTERusedtoreplaceanunknown,unrecognized,orunrepresentablecharacter U+FFFEnotacharacter. U+FFFFnotacharacter. FFFEandFFFFarenotunassignedintheusualsense,butguaranteednottobeUnicodecharactersatall.Theycanbeusedtoguessatext'sencodingscheme,sinceanytextcontainingtheseisbydefinitionnotacorrectlyencodedUnicodetext.Unicode'sU+FEFFBYTEORDERMARKcharactercanbeinsertedatthebeginningofaUnicodetexttosignalitsendianness:aprogramreadingsuchatextandencountering0xFFFEwouldthenknowthatitshouldswitchthebyteorderforallthefollowingcharacters. ItsblocknameinUnicode1.0wasSpecial.[3] Contents 1Replacementcharacter 2Unicodechart 3History 4Seealso 5References Replacementcharacter[edit] Replacementcharacter Thereplacementcharacter�(oftendisplayedasablackrhombuswithawhitequestionmark)isasymbolfoundintheUnicodestandardatcodepointU+FFFDintheSpecialstable.Itisusedtoindicateproblemswhenasystemisunabletorenderastreamofdatatoacorrectsymbol.[4]Itisusuallyseenwhenthedataisinvalidanddoesnotmatchanycharacter: ConsideratextfilecontainingtheGermanwordfür(meaning'for')intheISO-8859-1encoding(0x660xFC0x72).ThisfileisnowopenedwithatexteditorthatassumestheinputisUTF-8.ThefirstandlastbytearevalidUTF-8encodingsofASCII,butthemiddlebyte(0xFC)isnotavalidbyteinUTF-8.Therefore,atexteditorcouldreplacethisbytewiththereplacementcharactersymboltoproduceavalidstringofUnicodecodepoints.Thewholestringnowdisplayslikethis:"f�r". ApoorlyimplementedtexteditormightsavethereplacementinUTF-8form;thetextfiledatawillthenlooklikethis:0x660xEF0xBF0xBD0x72,whichwillbedisplayedinISO-8859-1as"f�r"(thisiscalledmojibake).Sincethereplacementisthesameforallerrorsthismakesitimpossibletorecovertheoriginalcharacter.Abetter(buthardertoimplement)designistopreservetheoriginalbytes,includingtheerror,andonlyconverttothereplacementwhendisplayingthetext.Thiswillallowthetexteditortosavetheoriginalbytesequence,whilestillshowingtheerrorindicatortotheuser. Atonetimethereplacementcharacterwasoftenusedwhentherewasnoglyphavailableinafontforthatcharacter.However,mostmoderntextrenderingsystemsinsteaduseafont's.notdefcharacter,whichinmostcasesisanemptybox(or"?"or"X"inabox[5]),sometimescalleda"tofu"(thisbrowserdisplays􏿾).ThereisnoUnicodecodepointforthissymbol. Thusthereplacementcharacterisnowonlyseenforencodingerrors,suchasinvalidUTF-8.SomesoftwareattemptstohidethisbytranslatingthebytesofinvalidUTF-8tomatchingcharactersinWindows-1252(sincethatisthemostlikelysourceoftheseerrors),sothatthereplacementcharacterisneverseen. Unicodechart[edit] Specials[1][2][3]OfficialUnicodeConsortiumcodechart(PDF)   0 1 2 3 4 5 6 7 8 9 A B C D E F U+FFFx IAA IAS IAT  � Notes 1.^AsofUnicodeversion15.0 2.^Greyareasindicatenon-assignedcodepoints 3.^Blackareasindicatenoncharacters(codepointsthatareguaranteednevertobeassignedasencodedcharactersintheUnicodeStandard) History[edit] ThefollowingUnicode-relateddocumentsrecordthepurposeandprocessofdefiningspecificcharactersintheSpecialsblock: Version Finalcodepoints[a] Count UTC ID L2 ID WG2 ID Document 1.0.0 U+FFFD 1 (tobedetermined) U+FFFE..FFFF 2 (tobedetermined) L2/01-295R Moore,Lisa(2001-11-06),"Motion88-M2",MinutesfromtheUTC/L2meeting#88 L2/01-355 N2369(html,doc) Davis,Mark(2001-09-26),RequesttoallowFFFF,FFFEinUTF-8inthetextofISO/IEC10646 L2/02-154 N2403 Umamaheswaran,V.S.(2002-04-22),"9.3AllowingFFFFandFFFEinUTF-8",DraftminutesofWG2meeting41,HotelPhoenix,Singapore,2001-10-15/19 2.1 U+FFFC 1 UTC/1995-056 Sargent,Murray(1995-12-06),RecommendationtoencodeaWCH_EMBEDDINGcharacter UTC/1996-002 Aliprand,Joan;Hart,Edwin;Greenfield,Steve(1996-03-05),"EmbeddedObjects",UTC#67Minutes N1365 Sargent,Murray(1996-03-18),ProposalSummary–ObjectReplacementCharacter N1353 Umamaheswaran,V.S.;Ksar,Mike(1996-06-25),"8.14",DraftminutesofWG2CopenhagenMeeting#30 L2/97-288 N1603 Umamaheswaran,V.S.(1997-10-24),"7.3",UnconfirmedMeetingMinutes,WG2Meeting#33,Heraklion,Crete,Greece,20June–4July1997 L2/98-004R N1681 TextofISO10646–AMD18forPDAMregistrationandFPDAMballot,1997-12-22 L2/98-070 Aliprand,Joan;Winkler,Arnold,"Additionalcommentsregarding2.1",MinutesofthejointUTCandL2meetingfromthemeetinginCupertino,February25-27,1998 L2/98-318 N1894 Revisedtextof10646-1/FPDAM18,AMENDMENT18:SymbolsandOthers,1998-10-22 3.0 U+FFF9..FFFB 3 L2/97-255R Aliprand,Joan(1997-12-03),"3.DProposalforIn-LineNotation(ruby)",ApprovedMinutes–UTC#73&L2#170jointmeeting,PaloAlto,CA–August4-5,1997 L2/98-055 Freytag,Asmus(1998-02-22),SupportforImplementingInlineandInterlinearAnnotations L2/98-070 Aliprand,Joan;Winkler,Arnold,"3.C.5.Supportforimplementinginlineandinterlinearannotations",MinutesofthejointUTCandL2meetingfromthemeetinginCupertino,February25-27,1998 L2/98-099 N1727 Freytag,Asmus(1998-03-18),SupportforImplementingInterlinearAnnotationsasusedinEastAsianTypography L2/98-158 Aliprand,Joan;Winkler,Arnold(1998-05-26),"InlineandInterlinearAnnotations",DraftMinutes–UTC#76&NCITSSubgroupL2#173jointmeeting,Tredyffrin,Pennsylvania,April20-22,1998 L2/98-286 N1703 Umamaheswaran,V.S.;Ksar,Mike(1998-07-02),"8.14",UnconfirmedMeetingMinutes,WG2Meeting#34,Redmond,WA,USA;1998-03-16--20 L2/98-270 Hiura,Hideki;Kobayashi,Tatsuo(1998-07-29),Suggestiontotheinlineandinterlinearannotationproposal L2/98-281R(pdf,html) Aliprand,Joan(1998-07-31),"In-LineandInterlinearAnnotation(III.C.1.c)",UnconfirmedMinutes–UTC#77&NCITSSubgroupL2#174JOINTMEETING,Redmond,WA--July29-31,1998 L2/98-363 N1861 Sato,T.K.(1998-09-01),Rubymarkers L2/98-372 N1884R2(pdf,doc) Whistler,Ken;et al.(1998-09-22),AdditionalCharactersfortheUCS L2/98-416 N1882.zip SupportforImplementingInterlinearAnnotations,1998-09-23 L2/98-329 N1920 CombinedPDAMregistrationandconsiderationballotonWDforISO/IEC10646-1/Amd.30,AMENDMENT30:AdditionalLatinandothercharacters,1998-10-28 L2/98-421R Suignard,Michel;Hiura,Hideki(1998-12-04),NotesconcerningthePDAM30interlinearannotationcharacters L2/99-010 N1903(pdf,html,doc) Umamaheswaran,V.S.(1998-12-30),"8.2.15",MinutesofWG2meeting35,London,U.K.;1998-09-21--25 L2/98-419(pdf,doc) Aliprand,Joan(1999-02-05),"InterlinearAnnotationCharacters",ApprovedMinutes--UTC#78&NCITSSubgroupL2#175JointMeeting,SanJose,CA--December1-4,1998 UTC/1999-021 Duerst,Martin;Bosak,Jon(1999-06-08),W3CXMLCGstatementonannotationcharacters L2/99-176R Moore,Lisa(1999-11-04),"W3CLiaisonStatementonAnnotationCharacters",MinutesfromthejointUTC/L2meetinginSeattle,June8-10,1999 L2/01-301 Whistler,Ken(2001-08-01),"E.Indicatedas"stronglydiscouraged"forplaintextinterchange",AnalysisofCharacterDeprecationintheUnicodeStandard ^Proposedcodepointsandcharactersnamesmaydifferfromfinalcodepointsandnames Seealso[edit] Unicodecontrolcharacters References[edit] ^"Unicodecharacterdatabase".TheUnicodeStandard.Archivedfromtheoriginalon2017-09-25.Retrieved2016-07-09. ^"EnumeratedVersionsofTheUnicodeStandard".TheUnicodeStandard.Archivedfromtheoriginalon2016-06-29.Retrieved2016-07-09. ^"3.8:Block-by-BlockCharts"(PDF).TheUnicodeStandard.version1.0.UnicodeConsortium.Archived(PDF)fromtheoriginalon2021-02-11.Retrieved2020-09-30. ^Wichary,Marcin."WhenFontsFall".Figma.Archivedfromtheoriginalon13June2021.Retrieved6June2021. ^"RecommendationsforOpenTypeFonts(OpenType1.7)-Typography".docs.microsoft.com.Archivedfromtheoriginalon19October2020.Retrieved18October2020. vteUnicodeUnicode UnicodeConsortium ISO/IEC10646(UniversalCharacterSet) Versions Codepoints Block List UniversalCharacterSet Charactercharts Characterproperty Plane PrivateUseArea CharactersSpecialpurpose BOM Combininggraphemejoiner Left-to-rightmark /Right-to-leftmark Softhyphen Variantform Wordjoiner Zero-widthjoiner Zero-widthnon-joiner Zero-widthspace Lists Characters CJKUnifiedIdeographs Combiningcharacter Duplicatecharacters Numerals Scripts Spaces Symbols Halfwidthandfullwidth Aliasnamesandabbreviations Whitespacecharacters ProcessingAlgorithms Bidirectionaltext Collation ISO/IEC14651 Equivalence Variationsequences InternationalIdeographsCore Comparison BOCU-1 CESU-8 Punycode SCSU UTF-1 UTF-7 UTF-8 UTF-16/UCS-2 UTF-32/UCS-4 UTF-EBCDIC Onpairsofcodepoints Combiningcharacter Compatibilitycharacters Duplicatecharacters Equivalence Homoglyph Precomposedcharacter list Z-variant Variationsequences Regionalindicatorsymbol Emojiskincolor Usage Domainnames(IDN) Email Fonts HTML entityreferences numericreferences Input InternationalIdeographsCore Relatedstandards CommonLocaleDataRepository(CLDR) GB18030 ISO/IEC8859 ISO15924 Relatedtopics Anomalies ConScriptUnicodeRegistry IdeographicResearchGroup InternationalComponentsforUnicode PeopleinvolvedwithUnicode Hanunification ScriptsandsymbolsinUnicodeCommonandinheritedscripts Combiningmarks Diacritics Punctuationmarks Spaces Numbers Modernscripts Adlam Arabic Armenian Balinese Bamum Batak Bengali Bopomofo Braille Buhid Burmese CanadianAboriginal Chakma Cham Cherokee CJKUnifiedIdeographs(Han) Cyrillic Deseret Devanagari Geʽez Georgian Greek Gujarati GunjalaGondi Gurmukhi Hangul HanifiRohingya Hanja Hanunuoo Hebrew Hiragana Javanese Kanji Kannada Katakana KayahLi Khmer Lao Latin Lepcha Limbu Lisu(Fraser) Lontara Malayalam MasaramGondi MendeKikakui Medefaidrin Miao(Pollard) Mongolian Mru N'Ko NagMundari NewTaiLue Nüshu NyiakengPuachueHmong Odia OlChiki Osage Osmanya PahawhHmong PauCinHau Pracalit(Newa) Ranjana Rejang Samaritan Saurashtra Shavian Sinhala SorangSompeng Sundanese Syriac Tagbanwa TaiLe TaiTham TaiViet Tamil Tangsa Telugu Thaana Thai Tibetan Tifinagh Tirhuta Toto Vai Wancho WarangCiti Yi Ancientandhistoricscripts Ahom Anatolianhieroglyphs AncientNorthArabian Avestan BassaVah Bhaiksuki Brāhmī Carian CaucasianAlbanian Coptic Cuneiform Cypriot Cypro-Minoan DivesAkuru Dogra Egyptianhieroglyphs Elbasan Elymaic Glagolitic Gothic Grantha Hatran ImperialAramaic InscriptionalPahlavi InscriptionalParthian Kaithi Kawi Kharosthi Khitansmallscript Khojki Khudawadi Khwarezmian(Chorasmian) LinearA LinearB Lycian Lydian Mahajani Makasar Mandaic Manichaean Marchen MeeteiMayek Meroitic Modi Multani Nabataean Nandinagari Ogham OldHungarian OldItalic OldPermic OldPersiancuneiform OldSogdian OldTurkic OldUyghur Palmyrene ʼPhags-pa Phoenician PsalterPahlavi Runic Sharada Siddham Sogdian SouthArabian Soyombo SylhetiNagri Tagalog(Baybayin) Takri Tangut Ugaritic Vithkuqi Yezidi ZanabazarSquare Notationalscripts Duployan SignWriting Symbols,emojis Cultural,political,andreligioussymbols Currency ControlPictures Mathematicaloperatorsandsymbols Listbysubject Phoneticsymbols(includingIPA) Emoji  Category:Unicode  Category:Unicodeblocks Retrievedfrom"https://en.wikipedia.org/w/index.php?title=Specials_(Unicode_block)&oldid=1114697453" Categories:UnicodeblocksHiddencategories:ArticleswithshortdescriptionShortdescriptionisdifferentfromWikidataArticlescontainingGerman-languagetext Navigationmenu Personaltools NotloggedinTalkContributionsCreateaccountLogin Namespaces ArticleTalk English Views ReadEditViewhistory More Search Navigation MainpageContentsCurrenteventsRandomarticleAboutWikipediaContactusDonate Contribute HelpLearntoeditCommunityportalRecentchangesUploadfile Tools WhatlinkshereRelatedchangesUploadfileSpecialpagesPermanentlinkPageinformationCitethispageWikidataitem Print/export DownloadasPDFPrintableversion Inotherprojects WikimediaCommonsWiktionary Languages العربيةDeutschEspañolFrançais한국어BahasaIndonesiaItalianoJawa日本語РусскийSimpleEnglishاردو中文 Editlinks



請為這篇文章評分?